Spring 2018 | Spring 2016

In this course you will apply several machine learning techniques to deal with structure predictions problems related to translation. Our course is composed of 3 blocks of lectures: lexical alignment (3 lectures), statistical machine translation (4 lectures), and neural machine translation (4 lectures). We will also have lab sessions related to project assignments.

Lectures

Introduction: parallel corpora
Khalil Sima'an. 2018-04-06.
Abstract Slides

Lexical alignment

IBM 1 and 2: Models over words and MLE via EM for categorical distributions
Khalil Sima'an. 2018-04-09.
Abstract Slides Class material Background reading Further reading
Cont. IBM 1 and 2: Models over words and MLE via EM for categorical distributions
Khalil Sima'an. 2018-04-13.
Abstract Slides Class material Background reading Further reading
Bayesian IBM1: Dirichlet priors and posterior inference
Wilker Aziz. 2018-04-16 and 2018-04-26.
Abstract Slides Class material Background reading Further reading Discussion

Statistical machine translation

Bitext Parsing: Alignment and Word Order Difference
Khalil Simaan. 2018-04-20.
Abstract Class material Background reading Further reading
Factorisation of permutations and reordering grammar
Khalil Simaan. 2018-04-23.
Abstract Slides Background reading
Linear models (part 1): phrase-based and hierarchical phrase-based SMT
Miguel Rios and Iacer Calixto. 2018-04-30.
Class material Background reading
Linear models (part 2): phrase-based and hierarchical phrase-based SMT
Miguel Rios and Iacer Calixto. 2018-05-4.
Class material Background reading
Machine Translation Evaluation
Iacer Calixto and Miguel Rios. 2018-05-7.
Abstract Slides Background reading Further reading

Neural machine translation

Neural language models
Iacer Calixto, Miguel Rios and Joost Bastings. 2018-05-14.
Slides Background reading Further reading
Neural Machine Translation: encoder-decoder
Joost Bastings, Iacer Calixto and Miguel Rios. 2018-05-18.
Abstract Slides Background reading Further reading
Remaining papers, Projects and closing remarks
Khalil Simaan, Miguel Rios and Iacer Calixto. 2018-05-25.

Labs

  • Recap (1 session)
    • alignment
  • Project 1 (2 sessions)
    • word alignment models
  • Project 2 (2 sessions)
    • seq2seq
    • attention