Reflective View on Text SimilarityA

Bär, Daniel; Zesch, Torsten; Gurevych, Iryna

doi:10.17185/duepublico/72190

Tagungsbeitrag 2011 CC BY-NC-SA 3.0

Veröffentlicht

A Reflective View on Text Similarity

Bär, Daniel ; Zesch, Torsten ; Gurevych, Iryna

While the concept of similarity is well grounded in psychology, text similarity is less well-defined. Thus, we analyze text similarity with respect to its definition and the datasets used for evaluation. We formalize text similarity based on the geometric model of conceptual spaces along three dimensions inherent to texts: structure, style, and content. We empirically ground these dimensions in a set of annotation studies, and categorize applications according to these dimensions. Furthermore, we analyze the characteristics of the existing evaluation datasets, and use those datasets to assess the performance of common text similarity measures.

Vorschau

Einordnung

Konferenz:: International Conference Recent Advances in Natural Language Processing 2011, Hissar, Bulgaria 12-14 September, 2011
Datum der Veröffentlichung:: 2011
URN:: urn:nbn:de:hbz:464-20211028-162008-6
DOI:: 10.17185/duepublico/72190
Sprache:: Englisch
Ressourcentyp:: Text
Kollektion:: E-Publikationen
Sachgruppen der Deutschen Nationalbibliographie:: 004 Informatik
Link URL:: https://aclanthology.org/R11-1071
Einrichtung:: Fakultät für Ingenieurwissenschaften, Informatik und Angewandte Kognitionswissenschaft, Informatik, Sprachtechnologie
Abgrenzungspolitik:: Bär, D., Zesch, T., Gurevych, I. (2011) A Reflective View on Text Similarity. In: Proceedings of the International Conference Recent Advances in Natural Language Processing 2011, pp. 515–520. Association for Computational Linguistics. https://aclanthology.org/R11-1071