Semantic Analysis of Natural Language and Definite Clause Grammar using Statistical Parsing and Thesauri
Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Services that rely on the semantic computations of users’ natural linguistic inputs are becoming more frequent. Computing semantic relatedness between texts is problematic due to the inherit ambiguity of natural language. The purpose of this thesis was to show how a sentence could be compared to a predefined semantic Definite Clause Grammar (DCG). Furthermore, it should show how a DCG-based system could benefit from such capabilities.
Our approach combines openly available specialized NLP frameworks for statistical parsing, part-of-speech tagging and word-sense disambiguation. We compute the semantic relatedness using a large lexical and conceptual-semantic thesaurus. Also, we extend an existing programming language for multimodal interfaces, which uses static predefined DCGs: COactive Language Definition (COLD). That is, every word that should be acceptable by COLD needs to be explicitly defined. By applying our solution, we show how our approach can remove dependencies on word definitions and improve grammar definitions in DCG-based systems.
Place, publisher, year, edition, pages
2013. , 27 p.
COLD, WordNet, thesaurus, semantic analysis, NLP, natural language processing, DFG, definite clause grammar, semantic relatedness
IdentifiersURN: urn:nbn:se:mdh:diva-26142OAI: oai:DiVA.org:mdh-26142DiVA: diva2:757050
Subject / course