Date |
Topic |
Reading |
Assignments |
16 Oct |
(no class - introductory week)
|
|
|
19 Oct |
(no class - introductory week)
|
|
|
23 Oct |
Introduction
|
|
|
26 Oct |
Crash course in Python and NLTK
|
|
A1
|
30 Oct |
Generative statistical models; n-grams
|
-
Jurafsky & Martin,
Chapters 4.1-4.2
-
*
Kevin Murphy,
Binomial and multinomial distributions
-
*
Jurafsky & Martin (Smoothing),
Chapters 4.5-4.7
-
*
Manning & Schütze (Smoothing),
Chapters 6.2-6.3
|
|
02 Nov |
More Python
|
|
|
06 Nov |
POS tagging with HMMs
|
-
Jurafsky & Martin,
Chapters 5.1-5.5, 6.1-6.4
|
A1 due;
A2
|
09 Nov |
HMM training
|
|
|
13 Nov |
Discussion of Assignment 1
|
|
|
16 Nov |
Context-free grammars
|
|
|
20 Nov |
Complexity of algorithms and the CKY parser
|
|
A2 due;
A3
|
23 Nov |
Probabilistic CFGs
|
-
Jurafsky & Martin,
Chapter 14.1-14.5, 14.7
|
|
27 Nov |
Discussion of Assignment 2
|
|
|
30 Nov |
Training PCFGs
|
|
|
04 Dec |
More accurate PCFG parsing
|
-
Mark Johnson (1998),
PCFG Models of Linguistic Tree Representations
(esp. on parent annotations)
-
Michael Collins,
Lexicalized PCFGs
-
Dan Klein & Chris Manning (2003),
Accurate unlexicalized parsing
-
*
Michael Collins (1997),
Three Generative, Lexicalized Models for Statistical Parsing
-
*
David Magerman (1995),
Description of head percolation tables
-
*
Slav Petrov, Leon Barrett, Romain Thibaux, and Dan Klein (2006),
Learning accurate, compact, and interpretable tree annotation
-
William Morgan,
Statistical Hypothesis Tests for NLP
(explains approximate randomization)
-
Berg-Kirkpatrick et al. (2012),
An empirical investigation of statistical significance for NLP
(explains bootstrap testing)
|
A3 due;
A4
|
07 Dec |
Advanced PCFG parsing algorithms
|
-
Joshua Goodman (1999),
Semiring parsing
(not Section 4, 6; ignore Earley parser if you like)
-
Charniak et al. (2006),
Multilevel coarse-to-fine PCFG parsing
-
Dan Klein & Chris Manning (2003),
A* Parsing - Fast exact Viterbi parse selection
-
*
Stuart Shieber, Yves Schabes, and Fernando Pereira (1993),
Principles and implementation of deductive parsing
-
*
Eugene Charniak, Sharon Goldwater, and Mark Johnson (1998),
Edge-based best-first chart parsing
-
*
Liang Huang and David Chiang (2005),
Better k-best parsing
-
*
Michael Collins and Terry Koo (2005),
Discriminative reranking for natural language parsing
-
*
Eugene Charniak and Mark Johnson (2005),
Coarse-to-fine n-best parsing and MaxEnt discriminative reranking
|
|
11 Dec |
Discussion of Assignment 3
|
|
|
14 Dec |
Dependency parsing
|
-
Ryan McDonald, Fernando Pereira, Kiril Ribarov, Jan Hajic (2005),
Non-projective Dependency Parsing using Spanning Tree Algorithms
-
Joakim Nivre (2008),
Algorithms for Deterministic Incremental Dependency Parsing
-
*
Joakim Nivre, Ryan McDonald (2007),
Characterizing the Errors of Data-Driven Dependency Parsing Models
-
*
Jason Eisner (1996),
Three New Probabilistic Models for Dependency Parsing -- An Exploration
-
*
Eliyahu Kiperwasser and Yoav Goldberg (2016),
Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature Representations
-
*
Google,
SyntaxNet
|
|
18 Dec |
Statistical machine translation - Alignments
|
|
A4 due;
A5
|
21 Dec |
Statistical machine translation - Translation
|
|
|
25 Dec |
(Christmas break)
|
|
|
28 Dec |
(Christmas break)
|
|
|
01 Jan |
(Christmas break)
|
|
|
04 Jan |
(Christmas break)
|
|
|
08 Jan |
Discussion of Assignment 4
|
|
|
11 Jan |
More expressive grammar formalisms
|
|
|
15 Jan |
Bayesian methods - LDA
|
|
A5 due;
A6
|
18 Jan |
Bayesian methods - Grammar induction
|
|
|
22 Jan |
Discussion of Assignment 5
|
|
|
25 Jan |
Semantic parsing
|
|
|
29 Jan |
(AK away)
|
|
A6 due
|
01 Feb |
Lexical semantics
|
|
|
05 Feb |
Discussion of Assignment 6
|
|
|
08 Feb |
Discussion of final projects
|
|
|