include "course_info.php"; ?>
![]()
Princeton University
|
Computer Science 597FAdTopCS: Visualization & Analysis of large-scale genomic data sets echo $code; ?>
echo $course_name; ?>
Olga Troyanskaya
|
|
include "http://www.cs.princeton.edu/courses/descriptions/cs$code"; ?>
The goal of this course is to introduce students to computational issues involved in analysis and display of large-scale biological data sets. Algorithms covered will include clustering and machine learning techniques for gene expression and proteomics data analysis, biological networks, joint learning from multiple data sources, and visualization issues for large-scale biological data sets. No prior knowledge of biology or bioinformatics is required, and an introduction to the field of bioinformatics and the nature of biological data will be provided. In depth knowledge of computer science is not required, but some understanding of programming and computation will be helpful. The course will be taught in a mixed lectures and seminar format, and will involve completing a project.
The course is open to graduate and advanced undergraduate students from all departments.
SIGNING UP: You should be able to sign up for this course through SCORE using 25078 as the class no (course is COS 597F). Let Melissa Lawson know if this doesn't work.
Professor: echo $prof_fullname; ?> - echo $prof_room; ?> Olga Troyanskaya - 204 CS Building - 258-1749 ogt@cs.princeton.edu (e-mail is the best way to contact)
if(substr($code,0,1) == "5") { echo "Graduate Coordinator:\n"; echo "Graduate Coordinator: Melissa Lawson - 310 CS Building - 258-5387 mml@cs.princeton.edu
Course Format & Grading
This course will cover the following issues: microarray analysis, data integration, biological networks, visualization of large-scale biological data. The class will consist of a mixture of lectures, student presentations of current literature papers, and discussions of these papers.
Students will also complete a team or individual project. The project will need to have a significant content related to the course, but could contribute to the student's current research and reflect the student's computational background. For example, you could implement and evaluate a machine learning method application for microarray data (if you have computational background).
Grades will depend on class participation in discussions of assigned reading (20%), presentations (35%), and project (45%).
Books
There is no required book for this class. Readings will be based on current literature. However, here are a few book recommendations for the curious.
If you need to catch up on molecular biology and genetics:
R. Brent. Genomic Biology. Cell 100:169-183, 2000.
L. Hunter. Molecular Biology for Computer Scientists. In Artificial Intelligence and Molecular Biology, L. Hunter editor, 1993, AAAI Press.
Introduction to bioinformatics:
P.L. Elkin. Primer on Medical Genomics Part V: Bioinformatics. In Mayo Clinic Proceedings.
NCBI primer on microarray analysis
Approximate Schedule
Note: This schedule is approximate and may change.
S M Tu W Th F S Sep 14 15 16 17 18 19 20 introduction to biology, bioinformatics, data; first class 21 22 23 24 25 26 27 microarray analysis, types of experiments, databases 28 29 30 Oct 1 2 3 4 microarray analysis 5 6 7 8 9 10 11 microarray analysis 12 13 14 15 16 17 18 proteomics 19 20 21 22 23 24 25 data integration 26 27 28 29 30 31 fall break Nov 1 2 3 4 5 6 7 8 data integration 9 10 11 12 13 14 15 biological networks 16 17 18 19 20 21 22 biological networks 23 24 25 26 27 28 29 Thanksgiving 30 Dec 1 2 3 4 5 6 visualization 7 8 9 10 11 12 13 visualization; last class 14 15 16 17 18 19 20 winter break 21 22 23 24 25 26 27 28 29 30 31 Jan 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
Slides
9/15 - Course details, molecular biology 101, challenges in functional genomics, intro to microarrays
9/17 - A (very) brief overview of database issues, data filtering, normalization, and clustering
Kai Li's guest lecture about visualization
Readings
NOTE: readings are listed for the date when they are DUE
(not they date on which they are assigned)
Each student presentation will be UNDER 30 minutes (INCLUDING questions), and there will be 2 student presentations
per class, followed by a 20 minute discussion. It is perfectly fine to have a presentation that takes 20 minutes, with questions
you will probably take around 25-30mins anyway. Aim at a mixed audience, but explain methods in details.