Language and Information

Schedule of Topics


This is the schedule of topics for Language and Information, Spring 2009.

Textbook readings are from "Jurafsky and Martin. 2008. Speech and Language Processing (2nd Edition). Pearson", and from NLTK book (for lab sessions) The Discussion column has link to readings for the discussion part of the class. You will find the the articles on SAKAI in the Readings corresponding to the week number.

< < <
Week Date Topic Textbook Readings Discussion Assignments
1 01/21 Course administrivia, Overview of corpus-driven and computational linguistics      
2 01/28 Word Frenquncy, Zip's law, Regular Expressions J & M Chapter 2
  • A. Kilgarriff and G. Grefenstette, Introduction to the special issue on the web as corpus, Computational Linguistics 29(3): 333-348 (2003).
 
3 02/4 Morphology; FSA and FST J & M Chapter 3.1-3.5, 3.7  
4 02/11 N-grams J & M Chapter 4.1-4.5.1, 4.8
  • * Palmer. D.d (2000). Tokenization and sentence segmentation. In Dale, r. , Moisl, H. and Somers, H. (eds) Handbook of natural language processing. [see Sakai site Resources/Readings/Week4; or check Rutgers Libraries for entire book "Handbook of natural language processing" which is e-book] [ Miriam Benovitz ].
 
5 02/18 Word classes; Part of Speech tagging J & M Chapter 5.1-5.5.2, 5.6
 
6 02/25 Formal grammars of english (Context Free Grammars(CFG); treebanks; PCFG) J & M Chapter 12.1-12.7.1; 14.1 (only 14.1.1)
 
7 03/4 Syntactic parsing (CKY; chunking; statistical parsing (PCKY)) J & M Chapter 13.1-13.4.1; 13.5; 14.2
 
8 03/11 Semantics (meaning representation; lexical semantics;WordNet; Propbank; FrameNet) J & M Chapter 17.1; Chapter 19
 
  03/18 SPRING BREAK Have fun!
   
9 03/25 Computational Semantics (semantic distance; semantic role labelling; Intro to classification) J & M Chapter 18.1; 20.1; 20.6-20.9
 
10 4/1 Text Classification (Feature Extraction; application on Sentiment Analysis)
  • Philip Resnik and Mona Diab (2000). Measuring Verb Similarity, Twenty Second Annual Meeting of the Cognitive Science Society (COGSCI2000), Philadelphia, August 2000.[ Jun Zhang ]
  • Pradhan, S. S., Ward, W., and Martin, J. H. (2007).Towards robust semantic role labeling . Proceedings of HLT-NAACL 2007. [ Gayatree Ganu ]
 
11 4/8 Computational Discourse (text coherence;reference phenomena; Penn Discourse Treebank) J & M Chapter 21.2; 21.3  
12 4/15 Dialogue (human conversation; dialogue systems design and evaluation J & M Chapter 24.1; 24.2; 24.4  
13 4/22 Language and Complexity J&M Chapter 16  
14 4/29 Student Project Presentations      
15 5/6 Student Project Presentations