Princeton University
Computer Science Dept.

Computer Science 425

Database & Information Management Systems


Andrea LaPaugh

Schedule and Readings

Fall 2006


General Information | Schedule and Readings | Project Page | Announcements

Readings

Except for Chapter 1, reading should be done before the topic is begun in class to facilitate class discussion.

Problem Sets

Click on the problem set number in the schedule below to reach the problem set due that week. (If there is no link, the problem set is not ready for distribution.) Note that problems sets are not always due on the same day of the week.  Problem sets are due at the beginning of class on the due date given. Late penalties will be applied, unless there are extraordinary circumstances and/or prior arrangements. Late penalties are:

Schedule

Subject to change. Be sure to check readings and assignments for changes up to 5 days before due.

prob. set due date Topics Readings in Ramakrishnan 
and Gehrke
PART 1:  MODELS and QUERIES
- Mon 9/18 Introduction
pdf of slides:  intro,  ER model
Chapter 1
9/20 The entity-relationship model
pdf of slides: continuation of ER
Chapter 2
- Mon 9/25 The relational model
pdf of slides:  final concept ER
pdf of slides:  intro to relational
Chapter 3
9/27 Relational algebra
pdf of slides: relational constraints
pdf of slides: relational algebra
UPDATED pfd summary rel'al op properties
Chapter 4, Sections 1 and 2
PS 1 due Mon. 10/2
Mon 10/2
10/4 Relational calculus
pdf of slides: final for relational algebra
pdf of slides: relational calculus
10/23: correction of cal. example (pdf)
Chapter 4, Sections 3 and 4
PS 2 due Wed. 10/11
Mon 10/9 The SQL Language
pdf of slides: all of SQL - final version
Chapter 5
10/11
PS 3 due Fri. 10/20
Mon 10/16
XML and the tree model
Slides:  XML Part 1:  PowerPoint,  pdf
Slides:  XML Part 2:  PowerPoint,  pdf
XML Schema spec. for "books" (pdf)
Chapter 7, Sections 1-4;
Chapter 27, Sections 6 and 7
Strongly recommended: Silberschatz1
10.1, 10.2, 10.3, 10.4.1, 10.4.2,  10.7
10/18
EXAM 1: 
out Wed 10/25 after class;
 due  Fri. 10/27 by 5pm

Mon 10/23
Information retrieval (IR)
summary of pagerank and HITS
slides (pdf)  used 11/6: 
evaluation of search results
Chapter 27, Sections 1,2, 4 and 5
Recommended: Chakrabarti2
Chapter 7
: intro, 7.1, 7.2 thru pg. 215, 7.3
10/25
fall break
PART 2:  STORING and ANSWERING
Project Proposal
Due Wed. 11/8
Mon 11/6  File Organization
pdf of slides: table of file access costs
Chapter 8, Sections 1-3 
(Chapter 9 background)
11/8 Tree-Structured Indexing
pdf of slides:  general indexing and B+ trees
Chapter 10
PS 4 due Mon. 11/13 Mon 11/13 Hash-Based Indexing
pdf of slides: dynamic hashing
Chapter 11,  Sections 1 and 2
Chapter 8, Sections 4 and 5
 11/15 Relational Query Evaluation
Chapter 12, Sections 1-3
Chapter 13, Sections1-3
Chapter 14
PS 5 due Mon. 11/20 Mon 11/20
11/22
Relational Query Optimization
NEWpdf of full set slides:  eval unary ops and multi-op queries

Chapter 12, Sections 4-6
Reminder:
oral progress reports
for projects
this week
 - see Project Page
PS 6 due Wed. 11/29
Mon  11/27
11/29 Inverted indexes and search
Chapter 27, Sections 3
NEWStrongly recommended:
Baeza-Yates
3 7.4.5, 8.1, 8.2
EXAM 2: 
out Wed 12/6 after class;
 due  Fri. 12/8 by 5pm

Mon 12/4 Indexes and eval for XML
Chapter 27, Section 8
NEWRecommended:  Index-driven XQuery
procesing in the eXist XML database

PART 3: TRANSACTIONS:  CORRECTNESS and DURABILITY
12/6 Transaction Management  and Concurrency Control
pdf of slides: transactions, concurrency and locking
Chapter 16
Chapter 17: Sections 1 - 4

Mon 12/11 Crash Recovery 
pdf of slides: aborting and intro to crash recovery
Chapter 18
12/13
Finish Crash Recovery
pdf of slides:  crash recovery on restart
pdf of slides:  example done on board
Final Remarks
pdf of slides: final remarks
--
Project report due 5pm Tuesday, Jan. 16, 2007 (Dean's Date)
Project demonstrations scheduled individually, completed by 5:00pm Monday Jan 22.


1. Silberschatz refers to Abraham Silberschatz, Henry Korth, and S. Sudarshan, Database System Concepts, Fifth Edition, McGraw-Hill, 2006, on reserve at the Engineering  Library.
2. Chakrabarti refers to Soumen Chakrabarti,  Mining the Web: Discovering Knowledge from Hypertext Data, Elsevier (Morgan_Kaufmann Division), 2003, on reserve at the Engineering Library.
3. Baeza-Yates refres to Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Modern Information Retrieval, Addison-Wesley, 1999, on reserve at the Engineering Library.


A.S. LaPaugh Wed Dec 13 17:38:25 EST 2006