Cognate Production using Character-based Machine Translation

Beinborn, Lisa; Zesch, Torsten; Gurevych, Iryna

doi:10.17185/duepublico/72180

Tagungsbeitrag 2013 CC BY-NC-SA 3.0

Veröffentlicht

Cognate Production using Character-based Machine Translation

Beinborn, Lisa ; Zesch, Torsten ; Gurevych, Iryna

Cognates are words in different languages that are associated with each other by language learners. Thus, cognates are important indicators for the prediction of the perceived difficulty of a text. We introduce a method for automatic cognate production using character-based machine translation. We show that our approach is able to learn production patterns from noisy training data and that it works for a wide range of language pairs. It even works across different alphabets, e.g. we obtain good results on the tested language pairs English-Russian, English-Greek, and English-Farsi. Our method performs significantly better than similarity measures used in previous work on cognates.

Vorschau

Einordnung

Konferenz:: Sixth International Joint Conference on Natural Language Processing, October 14-18, 2013, Nagoya, Japan
Datum der Veröffentlichung:: 2013
URN:: urn:nbn:de:hbz:464-20211027-095940-6
DOI:: 10.17185/duepublico/72180
Sprache:: Englisch
Ressourcentyp:: Text
Kollektion:: E-Publikationen
Sachgruppen der Deutschen Nationalbibliographie:: 004 Informatik
Link URL:: https://aclanthology.org/I13-1000
Einrichtung:: Fakultät für Ingenieurwissenschaften, Informatik und Angewandte Kognitionswissenschaft, Informatik, Sprachtechnologie
Informationen zur Erstveröffentlichung:: Beinborn, L., Zesch, T, Gurevych, I. (2013): Cognate Production using Character-based Machine Translation. In: Proceedings of the Sixth International Joint Conference on Natural Language Processing, pages 883-891. https://aclanthology.org/I13-1000