Pre-Reordering

Since reordering is a hard problem, there have been efforts to handle it in a separate prior translation stage, so that the main translation model can focus on the lexical aspects.

Publications

Reordering in pre-processing by a hand-crafted component has been explored for German–English (Collins et al., 2005), Japanese–English (Komachi et al., 2006), Chinese–English (Wang et al., 2007), and English–Hindi (Ramanathan et al., 2008). Zwarts and Dras (2007) point out that translation improvements are due to both a reduction of reordering needed during decoding and the increased learning of phrases of syntactic dependents. Nguyen and Shimazu (2006) also use manual rules for syntactic transformation in a preprocessing step. Such a reordering component may also be learned automatically from parsed training data, as shown for French–English (Xia and McCord, 2004), Arabic–English (Habash, 2007), and Chinese–English (Crego and Mariño, 2007) — the latter work encodes different orderings in a input lattice to the decoder. Li et al. (2007) propose a maximum entropy pre-reordering model based on syntactic parse trees in the source language. It may be beneficial to train different such pre-reordering models for different sentence types (questions etc.) (Zhang et al., 2008). Preprocessing the input to a machine translation system may also include splitting it up into smaller sentences (Lee et al., 2008).

Reordering patterns may also be learned over part-of-speech tags, allowing the input to be converted into a reordering graph (Crego and Mariño, 2006) or enabling a rescoring approach with the patterns as features (Chen et al., 2006). The reordering rules may also be integrated into an otherwise monotone decoder (Tillmann, 2008). Such rules may also be used in a separate reordering model. Such rules may be based on automatic word classes (Costa-jussà and Fonollosa, 2006; Crego et al., 2006), which was shown to outperform part-of-speech tags (Costa-jussà and Fonollosa, 2007), or they may be based on syntactic chunks (Zhang et al., 2007; Zhang et al., 2007b; Crego and Habash, 2008). Scoring for rule applications may be encoded in the reordering graph, or done once the target word order is established which allows for rewarding reorderings that happened due to phrase-internal reordering (Elming, 2008; Elming, 2008b).

Benchmarks

Discussion

New Publications

Thai Phuong Nguyen and Akira Shimazu (2006): Improving phrase-based statistical machine translation with morphosyntactic transformation, Machine Translation
add
@article{MTJ:2006:Nguyen,
author = {Thai Phuong Nguyen and Akira Shimazu},
title = {Improving phrase-based statistical machine translation with morphosyntactic transformation},
url = {http://www.mt-archive.info/AMTA-2006-Nguyen.pdf},
googlescholar = {16426137314611909296},
pages = {147--166},
journal = {Machine Translation},
volume = {20},
number = {3},
month = {September},
year = 2006
}
Nguyen and Shimazu (2006)
Holmqvist, Maria and Stymne, Sara and Foo, Jody and Ahrenberg, Lars (2009): Improving Alignment for SMT by Reordering and Augmenting the Training Corpus, Proceedings of the Fourth Workshop on Statistical Machine Translation
add
@InProceedings{holmqvist-EtAl:2009:WMT-09,
author = {Holmqvist, Maria and Stymne, Sara and Foo, Jody and Ahrenberg, Lars},
title = {Improving Alignment for {SMT} by Reordering and Augmenting the Training Corpus},
booktitle = {Proceedings of the Fourth Workshop on Statistical Machine Translation},
month = {March},
address = {Athens, Greece},
publisher = {Association for Computational Linguistics},
pages = {120--124},
url = {http://www.aclweb.org/anthology/W/W09/W09-0421},
year = 2009
}
Holmqvist et al. (2009)
Li, Jin-Ji and Kim, Jungi and Kim, Dong-Il and Lee, Jong-Hyeok (2009): Chinese Syntactic Reordering for Adequate Generation of Korean Verbal Phrases in Chinese-to-Korean SMT, Proceedings of the Fourth Workshop on Statistical Machine Translation
add
@InProceedings{li-EtAl:2009:WMT-092,
author = {Li, Jin-Ji and Kim, Jungi and Kim, Dong-Il and Lee, Jong-Hyeok},
title = {{C}hinese Syntactic Reordering for Adequate Generation of {K}orean Verbal Phrases in {C}hinese-to-{K}orean {SMT}},
booktitle = {Proceedings of the Fourth Workshop on Statistical Machine Translation},
month = {March},
address = {Athens, Greece},
publisher = {Association for Computational Linguistics},
pages = {190--196},
url = {http://www.aclweb.org/anthology/W/W09/W09-0433},
year = 2009
}
Li et al. (2009)
Xu, Peng and Kang, Jaeho and Ringgaard, Michael and Och, Franz (2009): Using a Dependency Parser to Improve SMT for Subject-Object-Verb Languages, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
add
@InProceedings{xu-EtAl:2009:NAACLHLT09,
author = {Xu, Peng and Kang, Jaeho and Ringgaard, Michael and Och, Franz},
title = {Using a Dependency Parser to Improve {SMT} for Subject-Object-Verb Languages},
booktitle = {Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics},
month = {June},
address = {Boulder, Colorado},
publisher = {Association for Computational Linguistics},
pages = {245--253},
url = {http://www.aclweb.org/anthology/N/N09/N09-1028},
year = 2009
}
Xu et al. (2009)
Elming, Jakob and Habash, Nizar (2009): Syntactic Reordering for English-Arabic Phrase-Based Machine Translation, Proceedings of the EACL 2009 Workshop on Computational Approaches to Semitic Languages
add
@InProceedings{elming-habash:2009:Semitic,
author = {Elming, Jakob and Habash, Nizar},
title = {Syntactic Reordering for {E}nglish-{A}rabic Phrase-Based Machine Translation},
booktitle = {Proceedings of the EACL 2009 Workshop on Computational Approaches to Semitic Languages},
month = {March},
address = {Athens, Greece},
publisher = {Association for Computational Linguistics},
pages = {69--77},
url = {http://www.aclweb.org/anthology/W09-0809},
year = 2009
}
Elming and Habash (2009)
Genzel, Dmitriy (2010): Automatically Learning Source-side Reordering Rules for Large Scale Machine Translation, Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
add
@InProceedings{genzel:2010:PAPERS,
author = {Genzel, Dmitriy},
title = {Automatically Learning Source-side Reordering Rules for Large Scale Machine Translation},
booktitle = {Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)},
month = {August},
address = {Beijing, China},
publisher = {Coling 2010 Organizing Committee},
pages = {376--384},
url = {http://www.aclweb.org/anthology/C10-1043},
year = 2010
}
Genzel (2010)
Khalilov, Maxim and Sima'an, Khalil (2010): A Discriminative Syntactic Model for Source Permutation via Tree Transduction, Proceedings of the 4th Workshop on Syntax and Structure in Statistical Translation
add
@InProceedings{khalilov-simaan:2010:SSST,
author = {Khalilov, Maxim and Sima'an, Khalil},
title = {A Discriminative Syntactic Model for Source Permutation via Tree Transduction},
booktitle = {Proceedings of the 4th Workshop on Syntax and Structure in Statistical Translation},
month = {August},
address = {Beijing, China},
publisher = {Coling 2010 Organizing Committee},
pages = {92--100},
url = {http://www.aclweb.org/anthology/W10-3812},
year = 2010
}
Khalilov and Sima'an (2010)
Chooi-Ling Goh and Takashi Onishi and Eiichiro Sumita (2011): Rule-based Reordering Constraints for Phrase-based SMT, Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)
add
@inproceedings{eamt11:Goh,
author = {Chooi-Ling Goh and Takashi Onishi and Eiichiro Sumita},
title = {Rule-based Reordering Constraints for Phrase-based {SMT}},
url = {http://mt-archive.info/EAMT-2011-Goh.pdf},
googlescholar = {13488968312707614244},
pages = {113--120},
booktitle = {Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)},
location = {Leuven, Belgium},
editor = {Mikel L. Forcada and Heidi Depraetere and Vincent Vandeghinste},
year = 2011
}
Goh et al. (2011)
Bisazza, Arianna and Federico, Marcello (2010): Chunk-Based Verb Reordering in VSO Sentences for Arabic-English Statistical Machine Translation, Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
add
@InProceedings{bisazza-federico:2010:WMT,
author = {Bisazza, Arianna and Federico, Marcello},
title = {Chunk-Based Verb Reordering in VSO Sentences for Arabic-English Statistical Machine Translation},
booktitle = {Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {241--249},
url = {http://www.aclweb.org/anthology/W10-1735},
year = 2010
}
Bisazza and Federico (2010)
Isozaki, Hideki and Sudoh, Katsuhito and Tsukada, Hajime and Duh, Kevin (2010): Head Finalization: A Simple Reordering Rule for SOV Languages, Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
add
@InProceedings{isozaki-EtAl:2010:WMT,
author = {Isozaki, Hideki and Sudoh, Katsuhito and Tsukada, Hajime and Duh, Kevin},
title = {Head Finalization: A Simple Reordering Rule for SOV Languages},
booktitle = {Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {250--257},
url = {http://www.aclweb.org/anthology/W10-1737},
year = 2010
}
Isozaki et al. (2010)
Badr, Ibrahim and Zbib, Rabih and Glass, James (2009): Syntactic Phrase Reordering for English-to-Arabic Statistical Machine Translation, Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009)
add
@InProceedings{badr-zbib-glass:2009:EACL,
author = {Badr, Ibrahim and Zbib, Rabih and Glass, James},
title = {Syntactic Phrase Reordering for {E}nglish-to-{A}rabic Statistical Machine Translation},
booktitle = {Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009)},
month = {March},
address = {Athens, Greece},
publisher = {Association for Computational Linguistics},
pages = {86--93},
url = {http://www.aclweb.org/anthology/E09-1011},
year = 2009
}
Badr et al. (2009)
Jiang, Jie and Du, Jinhua and Way, Andy (2010): Source-side Syntactic Reordering Patterns with Functional Words for Improved Phrase-based SMT, Proceedings of the 4th Workshop on Syntax and Structure in Statistical Translation
add
@InProceedings{jiang-du-way:2010:SSST,
author = {Jiang, Jie and Du, Jinhua and Way, Andy},
title = {Source-side Syntactic Reordering Patterns with Functional Words for Improved Phrase-based SMT},
booktitle = {Proceedings of the 4th Workshop on Syntax and Structure in Statistical Translation},
month = {August},
address = {Beijing, China},
publisher = {Coling 2010 Organizing Committee},
pages = {19--27},
url = {http://www.aclweb.org/anthology/W10-3803},
year = 2010
}
Jiang et al. (2010)
Katz-Brown, Jason and Petrov, Slav and McDonald, Ryan and Och, Franz and Talbot, David and Ichikawa, Hiroshi and Seno, Masakazu and Kazawa, Hideto (2011): Training a Parser for Machine Translation Reordering, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
add
@InProceedings{katzbrown-EtAl:2011:EMNLP,
author = {Katz-Brown, Jason and Petrov, Slav and McDonald, Ryan and Och, Franz and Talbot, David and Ichikawa, Hiroshi and Seno, Masakazu and Kazawa, Hideto},
title = {Training a Parser for Machine Translation Reordering},
booktitle = {Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing},
month = {July},
address = {Edinburgh, Scotland, UK.},
publisher = {Association for Computational Linguistics},
pages = {183--192},
url = {http://www.aclweb.org/anthology/D11-1017},
year = 2011
}
Katz-Brown et al. (2011)
Howlett, Susan and Dras, Mark (2011): Clause Restructuring For SMT Not Absolutely Helpful, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies
add
@InProceedings{howlett-dras:2011:ACL-HLT2011,
author = {Howlett, Susan and Dras, Mark},
title = {Clause Restructuring For {SMT} Not Absolutely Helpful},
booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies},
month = {June},
address = {Portland, Oregon, USA},
publisher = {Association for Computational Linguistics},
pages = {384--388},
url = {http://www.aclweb.org/anthology/P11-2067},
year = 2011
}
Howlett and Dras (2011)
Andreas, Jacob and Habash, Nizar and Rambow, Owen (2011): Fuzzy Syntactic Reordering for Phrase-based Statistical Machine Translation, Proceedings of the Sixth Workshop on Statistical Machine Translation mentioned in Syntactic Prereordering and POS Chunk Prereordering
add
@InProceedings{andreas-habash-rambow:2011:WMT,
author = {Andreas, Jacob and Habash, Nizar and Rambow, Owen},
title = {Fuzzy Syntactic Reordering for Phrase-based Statistical Machine Translation},
booktitle = {Proceedings of the Sixth Workshop on Statistical Machine Translation},
month = {July},
address = {Edinburgh, Scotland},
publisher = {Association for Computational Linguistics},
pages = {227--236},
url = {http://www.aclweb.org/anthology/W11-2127},
year = 2011
}
Andreas et al. (2011)

MT Research Survey Wiki

A Comprehensive Survey of Neural and Statistical Machine Translation Research Publications

Search Descriptions

Pre-Reordering

Publications

Benchmarks

Discussion

Related Topics

New Publications