New Measures to Investigate Term Typology by Distributional Data
2013 (English)In: Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013), May 22–24, 2013, Oslo University, Norway. NEALT Proceedings Series 16 / [ed] Stephan Oepen, Kristin Hagen, Janne Bondi Johannessen, Linköping: Linköping University Electronic Press, 2013Conference paper, Poster (Refereed)
This report describes a series of exploratory experiments to establish whether terms of different semantic type can be distinguished in useful ways in a semantic space constructed from distributional data. The hypotheses explored in this paper are that some words are more variant in their distribution than others; that the varying semantic character of words will be reflected in their distribution; and this distributional difference is encoded in current distributional models, but that the information is not accessible through the methods typically used in application of them. This paper proposes some new measures to explore variation encoded in distributional models but not usually put to use in understanding the character of words represented in them. These exploratory findings show that some proposed measures show a wide range of variation across words of various types.
Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2013.
Term typology, distributional semantics
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-122920ISBN: 1650-3740OAI: oai:DiVA.org:kth-122920DiVA: diva2:624092
19th Nordic Conference of Computational Linguistics (NODALIDA 2013)
FunderSwedish Research Council
QC 201307102013-05-302013-05-302013-07-10Bibliographically approved