The More the Better? Assessing the Influence of Wikipedia's Growth on Semantic Relatedness Measures
Wikipedia has been used as a knowledge source in many areas of natural language processing. As most studies only use a certain Wikipedia snapshot, the influence of Wikipedia’s massive growth on the results is largely unknown. For the first time, we perform an in-depth analysis of this influence using semantic relatedness as an example application that tests a wide range of Wikipedia’s properties. We find that the growth of Wikipedia has almost no effect on the correlation of semantic relatedness measures with human judgments, while the coverage steadily increases.
Vorschau
Zitieren
Rechte
Nutzung und Vervielfältigung:
Dieses Werk kann unter einerCreative Commons Namensnennung - Nicht-kommerziell - Weitergabe unter gleichen Bedingungen 3.0 Lizenz (CC BY-NC-SA 3.0)
genutzt werden.