Princeton University
Computer Science Dept.

Computer Science 435
Information Retrieval, Discovery, and Delivery

Andrea LaPaugh

Spring 2008


General Information | Schedule and Assignments |  Project Page | Announcements


WEEK 1
Mon. Feb.  4:
Overview of course topics and organization.  
Begin classic information retrieval of text if time.
Weds. Feb. 6: Inspiration: discussion of  As We May ThinkFoundations: classic information retrieval of text.

WEEK 2
Mon. Feb. 11:  Classic Information retrieval continued.

Boolean model, vector model.

Wed. Feb 13: Vector model continued; Latent Semantic Indexing.

WEEK 3
Mon. Feb. 18: LSI continued; extended model. 

Wed. Feb. 20: Ranking Web pages Assignment 2 (pdf) now available.  Due Friday, February 29.  You will also need files HW2_graph1.txt   HW2_graph2.txt  and HW2_graph3.txt
Use alpha = 0.15 for Problem 2.


WEEK 4

Mon. Feb. 25: Ranking Web pages, cont.

Wed. Feb 27: Evaluation of retrieval systems; spamming search engines
Friday February 29 Assignment 2 (pdf) due.  You will also need files HW2_graph1.txt   HW2_graph2.txt  and HW2_graph3.txt
Use alpha = 0.15 for Problem 2.


Assignment 3  now available.  Due Friday,  March 7.  Note that you must have your query for problem 2 pre-approved!



WEEK 5
Mon.  March 3:  Indexing

Wed.  March 5:  Index construction

Friday,  March 7 Assignment 3  due.  Note that you must have your query for problem 2 pre-approved!


WEEK 6
Mon.  March 10: Remarks on index construction and query evaluation; Index compression


Wed. March 12: Midterm review, Compression, continued.


Friday,  March 14 
Project proposal due.  See Project Page for details.


Spring break


WEEK 7
Mon.  March 24:
   Search refinement, especially using user feedback

Wed. March 26
: Collaborative filtering; Clustering
Friday,  March 28  Exam due.


WEEK 8
Mon.  March 31:
   Clustering continued

Assignment 4 now available.  Due Monday April 7.

Wed. April 2: Clustering; Databases versus Information Retrieval Systems, Semi-structured information and  XML

WEEK 9

Mon.  April 7:
  XML continued; detecting near-duplicate documents

Assignment 5 (pdf) now available.  Due Monday April 14.


Wed. April 9: detecting near-duplicate documents continued; Crawling the Web

WEEK 10
Project check-point discussions this week!

Mon.  April 14:  Tenth week review; Crawling the Web continued; characteristics of the changing Web

Wed. April 16:  Music retrieval. 
Presentation by guest Rebecca Fiebrink



WEEK 11

Mon.  April 21:
  Image retrieval

Wed.  April 23:  Privacy in Information Retrieval.  Presentation by Joe Calandrino

WEEK 12
Mon.  April 28:
  Student presentations


Wed.  April 30:  Student presentations; wrap-up






* on reserve in the Engineering Library


last revised Wed Apr 30 15:55  EDT 2008
Copyright  2008 Andrea S. LaPaugh