Statistical and Neural Machine Translation

This website contains resources for research in statistical and neural machine translation, i.e. the translation of text from one human language to another by a computer that learned how to translate from vast amounts of translated text.

Events

Conference on machine translation: 2022, 2021, 2020, 2019, 2018, 2017, 2016.
Workshop on machine translation: 2015. 2014. 2013. 2012. 2011. 2010. 2009. 2008. 2007. 2006.
Workshop on building and using parallel text 2015
Machine Translation Marathon: 2022, 2019, 2018, 2017, 2016, 2015, 2014, 2013, 2012, 2011b, 2011a, 2010, 2009, 2008, 2007.
Machine Translation Marathon of the Americas: 2022, 2019, 2018, 2017, 2016, 2015.

Resources

External Historic Links: Introduction to Statistical MT Research

The Mathematics of Statistical Machine Translation by Brown, Della Petra, Della Pietra, and Mercer
Statistical MT Handbook by Kevin Knight
SMT Tutorial (2003) by Kevin Knight and Philipp Koehn
ESSLLI Summer Course on SMT (2005), day1, 2, 3, 4, 5 by Chris Callison-Burch and Philipp Koehn.
MT Archive by John Hutchins, electronic repository and bibliography of articles, books and papers on topics in machine translation and computer-based translation tools

External Historic Software

Giza++ a training tool for IBM Model 1-5 (version for gcc-4)
Moses, a complete SMT system
UCAM-SMT, the Cambridge Statistical Machine Translation system
Phrasal, a toolkit for phrase-based SMT
cdec, a decoder for syntax-based SMT
Joshua, a decoder for syntax-based SMT
Jane, decoder for syntax-based SMT
Pharaoh a decoder for phrase-based SMT
Rewrite a decoder for IBM Model 4
BLEU scoring tool for machine translation evaluation

External Parallel Corpora

maintained by Philipp Koehn