|
COS557/MOL557: Analysis & Visualization of Large Scale Genomic Data Sets |
|
Fridays 1:30-4:00pm Rm. 302 in CS building (35 Olden street) |
|
Expression of genes predictive of outcome in lung cancer. |
|
Course Info |
|
The goal of this course is to introduce students to computational issues involved in analysis and display of large-scale biological data sets. Techniques covered will include clustering and machine learning techniques for gene expression microarrays and proteomics data analysis, biological networks and pathways modeling, data integration in genomics, and visualization issues for large-scale data sets. A short introduction to the field of bioinformatics and the nature of biological data will be provided, no prior knowledge of biology is required. In depth knowledge of computer science is not required, but students must have some understanding of computation. The course will be taught in a mixed lectures and seminar format, and will involve completing a project and a final exam. The course is open to graduate and advanced undergraduate students from all departments. |
|
Level: Graduate and upper level undergraduate Background: Some understanding of computation (programming background not required) Format: Mixed lectures and seminar-style Instructor: Prof. Olga Troyanskaya Grading: 40% class presentations, 15% class participation (including attendance), 45% final project (15% project proposal, 30% final project report) Auditors: Auditors are welcome, but every auditor must participate in presentations and discussions (but does not need to do the final project). |
|
Administrative: |
|
There is no required book for this class. Material will be presented in lectures, and readings will be based on current literature. However, here are a few recommendations for the curious. If you need to catch up on molecular biology and genetics: R. Brent. Genomic Biology. Cell 100:169-183, 2000. L. Hunter. Molecular Biology for Computer Scientists. In Artificial Intelligence and Molecular Biology, L. Hunter editor, 1993, AAAI Press. Introduction to bioinformatics: P.L. Elkin. Primer on Medical Genomics Part V: Bioinformatics. In Mayo Clinic Proceedings. |
|
Presentations: Each presentation should be 30mins, with 10-15mins for discussion afterwards. Presentations should be in power point (or another slides format), and you must e-mail me the power point after your presentation before I can grade it. A good presentation would include: -a brief overview of the paper -outline of major methods and findings -analysis of what the paper did well -analysis of problems/issues with the approach -what is the future (don’t just retype the “future work” section, we’re looking for your analysis here) Course Announcements (check here often): |