Schedule for 16:194:614

This schedule is subject to alterations.

(Legend for the readings: MBK = Meadow, Boyce and Kraft, vR = van Rijsbergen, RB = Rik Belew, SJW = Sparck-Jones and Willett)

- Week - - Topics / Activities - - Students' responsibilities -
(During and/or after class)
Textbook IR: general topics

* 1 *

Fri,
Sep 03

Slides in HTML and PDF

Introduction and overview of the course.

Get familiar with the course website. Set up your course website on eden.
Send me email with your details (use students.xml template).

* 2 *

Fri,
Sep 10

Slides in HTML and PDF

Introduction to IR. Information vs data retrieval.

What do we want from IR ? Introduction to evaluation.

 

* 3 *

Fri,
Sep 17

Slides in HTML and PDF

IR concepts. Aboutness. Relevance.

Rationalist vs. empriricist approaches (AI vs. Stats)

Design decisions for IRS; automatic vs. manual/intellectual systems.

Student presentations:
MarinaMalysheva: "Indexing by latent semantic analysis";
Cathy Smith: "Relevance - A conceptual and practical overview";
Iliana Chaleva: "Empiricists vs. Rationalists".

 

* 4 *

Fri,
Sep 24

Slides in HTML and PDF

Indexing.

Document and query representation. Manual vs. automatic indexing.

Look at an example of a document collection, a stopword list, an indexed collection and an inverted file.
Formulate a few boolean queries and figure out the result of a boolean search.

* 5 *

Fri,
Oct 01

Slides in HTML and PDF


Automatic indexing. Lexical analysis. Weighting. Data structures.

Lab work.

Homework (to be graded).

* 6 *

Fri,
Oct 08

Slides in HTML and PDF

Models of IR.

Interaction models. Indexing models. Language models. Topic models. User models.

Information Retrieval as interaction. Evaluation of interactive systems.

 

* 7 *

Fri,
Oct 15

Slides in HTML and PDF

User interfaces and Information Visualization for IR Part I: Interaction models. Part II : Tools and techniques.

ClusterBook (HTML and PDF).

Lab work. Homework.

* 8 *

Fri,
Oct 22

Invited lecture:
Dr. Anselm Spoerri, SCILS - MetaCrystal.

Student presentations.

 

* 9 *

Fri,
Oct 29

Slides in HTML and PDF

Evaluation of interactive systems.

Homework.

* 10 *

Fri,
Nov 05

Slides in HTML and PDF

Evaluation of IR systems. Lab work / homework.

* 11 *

Fri,
Nov 12

Evaluation of IR systems.  
Advanced IR: current research topics

* 12 *

Fri,
Nov 19

Project work.

 

* 13 *

Fri,
Nov 26

Thanksgiving, no class.  

* 14 *

Fri,
Dec 03

Slides

AI and IR.

Machine learning and data mining for IR.

Invited lecture:
Dr. James G. Shanahan
Principal Research Scientist
Clairvoyance Corporation

(Also see Lewis' tutorial)

* 15 *

Fri,
Dec 10

Cathy Smith: Statistical model for IR.
Language models.

Iliana Chaleva: IR on the WWW.

Topic modeling.

Web IR

Structure. Clustering vs. classification.

Informetrics and IR.

The Semantic Web.

 

* 16 *

Fri,
Dec 17

Yoo-Jin Ha: Cross-language IR.

Marina Malysheva: Natural language processing for IR.

Collaborative and recommender systems.

Personalization and user modeling.

Implicit vs. explicit feedback.

Document summarization.

Information extraction.

Multimedia IR (image, video, music, ...).

IR for structured documents. INEX.