| - Week - | - Topics / Activities - | - Students' responsibilities - (During and/or after class) |
|---|---|---|
| Textbook IR: general topics |
||
* 1 * Tue, |
Introduction and overview of the course. | Get familiar with the course website. Set up your course website on
scils. Play with Model.xls to solve the homework.If you need to, do some Excel practice. |
* 2 * Tue, |
Introduction to IR. Information vs. data retrieval. What do we want from IR ? Introduction to evaluation. |
|
* 3 * Tue, |
IR concepts. Aboutness. Relevance. Rationalist vs. empiricist approaches (AI vs. Stats) Design decisions for IRS; automatic vs. manual/intellectual systems. |
Parsing labwork. (Please install Python on your laptops before class. Recommended: ActiveState) |
* 4 * Tue, |
Indexing. Document and query representation. Manual vs. automatic indexing. |
WebClusterLite lab work. (Please install WebClusterLite onto your laptops before class.) Look at an example of a document
collection, a stopword
list, an indexed
collection and an inverted
file. |
* 5 * Tue, |
|
Lab work. Homework (to be graded). |
* 6 * Tue, |
Models of IR. Ranking / relevance estimation models.
|
See Lavrenko's tutorial on Language Models. |
* 7 * Tue, |
Models of IR. Interaction models. Information Retrieval as interaction. Evaluation of interactive systems. |
|
| * 8 * Tue, |
Spring break, no class. |
|
* 9 * Tue, |
Evaluation of IR systems. Introduction to Statistics and Hypothesis Testing (HTML and PDF). |
Lab work / homework - Performance evaluation . |
* 10 * Tue, |
Evaluation of IR systems. TREC |
|
* 11 * Tue, |
Evaluation of IR systems. TREC |
Lab work / homework - TREC-type evaluation. |
* 12 * Tue, |
User interfaces and Information Visualization for IR. Part I: Interaction models. Gass: Designing user interfaces; |
Proposed Lemur-based term project(s) |
* 13 * Tue, |
User interfaces and Information Visualization for IR. Part II : Tools and techniques. Kirkyla: Supporting |
|
| Advanced IR: current research
topics |
||
* 14 * Tue, Slides in HTML and PDF |
Structure. Clustering vs. classification. Gass: The Semantic Web. Blogs and wikis. Informetrics and IR. Collaborative and recommender systems. Personalization and user modeling. Topic modeling. Implicit vs. explicit feedback. |
See Ravi Kumar's tutorial on Internet Search. |
* 15 * Tue, Slides in HTML and PDF |
AI and IR. Machine learning and data mining for IR. Natural language processing for IR. Information extraction. Cross-language IR. Multimedia IR (image, video, music, ...). IR for structured documents. INEX. |
(Also see a tutorial on ML ) |
* 16 * Tue, |
Student presentations of term projects. Nicholson: Informetrics |
|