Princeton University
Computer Science Dept.

Computer Science 435
Information Retrieval, Discovery, and Delivery

Andrea LaPaugh

Spring 2014


General Information | Schedule and Assignments |  Project Page | Announcements

EVOLVING:  CHECK BACK FOR UPDATES

The reading for a class should be completed before class.


WEEK 1
Mon. Feb. 3:
Overview of course topics and organization.   Models of information.
Wed. Feb. 5   SNOW DAY


WEEK 2
Mon. Feb 10: 
  Foundations: classic information retrieval of text. 

Wed. Feb. 12: 
Extending the models. Using links.


WEEK 3


Mon. Feb. 17:  
Evaluation of retrieval systems

Wed. Feb. 19:   Evaluation, cont.; Index structure and use.
Thurs. Feb. 20:  Problem set 2 is now available, due 11:55pm Wed. Feb 26.  Note that your query must be pre-approved by Prof. LaPaugh


WEEK 4

Mon.  Feb 24:  Index structure and use continued

Wed. Feb. 26: 
Index construction
Thur. Feb 27Problem Set 3 (pdf) is now available,  due Wed. March 5.


WEEK 5

Mon. Mar. 3 Index compression

Wed. Mar. 5: Index compression continued

Fri. Mar. 7:  Project proposal due


WEEK 6

Mon. Mar. 10Distributed computation for index building and query execution.   

Wed. Mar. 12:  Distributed computation for index building (continuation of Mar. 10); Crawling the Web

Take-home midterm exam distributed Wednesday March 12, 2013 at the end of class.  Due 4:30 PM sharp Friday March 14, 2014.



Spring break



WEEK 7

Mon. March 24:  Using users behavior: search refinement and recommender systems

Wed. March 26: 
Recommender systems: collaborative filtering
Fri. March 28:   Problem Set 4 (pdf) is now available,  due Wed. April 2.


WEEK 8
Mon. March 31:  Collaborative filtering, continued;  Latent semantic indexing

Wed. April 2:  Clustering
Fri. April 4:   Problem Set 5 (pdf) is now available,  due Wed. April 9.


WEEK 9

Project progress meetings with Professor LaPaugh NEXT week  -watch Piazza for sign-up instructions

Mon. April 7:    Clustering continued

Wed. April 9: 
Detecting near-duplicate documents

Fri. April 11:   Problem Set 6(pdf) is now available,  due Wed. April 16.



WEEK 10


Project progress meetings with Professor LaPaugh
THIS week  - see
Piazza for instructions.

Mon. April 14:  Non-text retrieval: image retrieval

Wed. April 16:  Deep Web Search

WEEK 11

Mon. April 21 Semi-structured information and XML

Wed. April 23  Extracting information from Social Networks

WEEK 12

 Mon. April 28  Social Networks: structure; Privacy Issues in Information Systems: technical aspects


Wed. April 30  Discussion:  privacy issues in practice; wrap-up

Second take-home exam distributed Wednesday April 30, 2014 at the end of class,  due 4:30 PM sharp Friday May 2, 2013.


Project Report due 5:00 pm Dean's Date, Tuesday May 13, 2014
Project Demonstrations between May 14 and May 19

* on reserve in the Engineering Library


last revised  Wed Jun  4 17:20:03 EDT 2014
Copyright  2010, 2011, 2012, 2013, 2014 Andrea S. LaPaugh