Proceedings chapter CC BY-NC-SA 3.0
The More the Better? Assessing the Influence of Wikipedia's Growth on Semantic Relatedness Measures
Wikipedia has been used as a knowledge source in many areas of natural language processing. As most studies only use a certain Wikipedia snapshot, the influence of Wikipedia’s massive growth on the results is largely unknown. For the first time, we perform an in-depth analysis of this influence using semantic relatedness as an example application that tests a wide range of Wikipedia’s properties. We find that the growth of Wikipedia has almost no effect on the correlation of semantic relatedness measures with human judgments, while the coverage steadily increases.
Could not load citation form.
Use and reproduction:This work may be used under a
Creative Commons Attribution - NonCommercial - ShareAlike 3.0 License (CC BY-NC-SA 3.0)