Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks

Pilán, Ildikó; Volodina, Elena; Zesch, Torsten

doi:10.17185/duepublico/72093

Tagungsbeitrag 2016 CC BY 4.0

Veröffentlicht

Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks

Pilán, Ildikó; Volodina, Elena; Zesch, Torsten

The lack of a sufficient amount of data tailored for a task is a well-recognized problem for many statistical NLP methods. In this paper, we explore whether data sparsity can be successfully tackled when classifying language proficiency levels in the domain of learner-written output texts. We aim at overcoming data sparsity by incorporating knowledge in the trained model from another domain consisting of input texts written by teaching professionals for learners. We compare different domain adaptation techniques and find that a weighted combination of the two types of data performs best, which can even rival systems based on considerably larger amounts of in-domain data. Moreover, we show that normalizing errors in learners’ texts can substantially improve classification when level-annotated in-domain data is not available.

Vorschau

Einordnung

Konferenz:: COLING 2016, The 26th International Conference on Computational Linguistics, December 11-16, 2016 Osaka, Japan
Datum der Veröffentlichung:: 2016
URN:: urn:nbn:de:hbz:464-20211021-144012-4
DOI:: 10.17185/duepublico/72093
Sprache:: Englisch
Ressourcentyp:: Text
Kollektion:: E-Publikationen
Sachgruppen der Deutschen Nationalbibliographie:: 004 Informatik
Link URL:: https://www.aclweb.org/anthology/C16-1198
Einrichtung:: Fakultät für Ingenieurwissenschaften, Informatik und Angewandte Kognitionswissenschaft, Informatik, Sprachtechnologie
Informationen zur Erstveröffentlichung:: Pilán, I., Volodina, E., Zesch, T. (2016). Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks. In: Proceedings of COLING 2016, The 26th International Conference on Computational Linguistics: Technical Papers, pp. 2101–2111. The COLING 2016 Organizing Committee. https://www.aclweb.org/anthology/C16-1198