Search Descriptions

Main Topics

Search Publications


author

title

other

year

Unknown Words

Even with large training corpora, unknown words in the test data will appear.

Unknown Words is the main subject of 12 publications.

Publications

Habash (2008) points out that these come in different types, which require different solutions: unknown names need to be transliterated, unknown morphological variants need to be matched to known ones, and spelling errors need to be corrected. Another case are abbreviation, which may be expanded with a corpus-driven method (Karakos et al., 2008).

Benchmarks

Discussion

Related Topics

New Publications

  • Servan and Dymetman (2015)
  • Tsvetkov and Dyer (2015)
  • Singla et al. (2014)
  • Fishel and Sennrich (2014)
  • Razmara et al. (2013)
  • Huang et al. (2010)
  • Banerjee et al. (2012)
  • Habash and Metsky (2008)
  • Zhang et al. (2009)
  • Mirkin et al. (2009)

Actions

Download

Contribute