Fast or Accurate? – A Comparative Evaluation of PoS Tagging Models

Horsmann, Tobias; Erbs, Nicolai; Zesch, Torsten

doi:10.17185/duepublico/72101

Tagungsbeitrag 2015 CC BY-NC-SA 4.0

Veröffentlicht

Fast or Accurate? – A Comparative Evaluation of PoS Tagging Models

Horsmann, Tobias ; Erbs, Nicolai ; Zesch, Torsten

We perform a comparison of 22 PoS tagger models for English and German offered by 9 different implementations. By evaluating on a mix of corpora from different domains, we simulate a black-box usage where researchers select a tagger (because of popularity, ease of use, etc.) and apply it to all sorts of text. We find the expected trade-off between fast models with relatively low accuracy and slower models with higher accuracy. The choice of the model, even for the same tagger, does matter and the model should always be chosen for the task at hand. Our evaluation provides researchers with a basis for selecting taggers according to their needs.

Vorschau

Einordnung

Konferenz:: International Conference of the German Society for Computational Linguistics and Language Technology (GSCL 2015), Sep 30 – Oct 2, 2015, University of Duisburg-Essen, Germany
Datum der Veröffentlichung:: 2015
URN:: urn:nbn:de:hbz:464-20211021-164937-9
DOI:: 10.17185/duepublico/72101
Sprache:: Englisch
Ressourcentyp:: Text
Kollektion:: E-Publikationen
Sachgruppen der Deutschen Nationalbibliographie:: 004 Informatik
Link URL:: https://konvens.org/proceedings/2015/index.html
Einrichtung:: Fakultät für Ingenieurwissenschaften, Informatik und Angewandte Kognitionswissenschaft, Informatik, Sprachtechnologie
Informationen zur Erstveröffentlichung:: Horsmann, T., Erbs, N., Zesch, T. (2015): Fast or Accurate? – A Comparative Evaluation of PoS Tagging Models. In: Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology (GSCL 2015), Sep 30 – Oct 2, 2015, University of Duisburg-Essen, Germany, pp. 22-30. https://www.ltl.uni-due.de/wp-content/uploads/posTaggerEvaluation.pdf