A Natural Language Interface for Querying Linked Data
2020 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE credits
Student thesis
Abstract [en]
The thesis introduces a proof of concept idea that could spark great interest from many industries. The idea consists of a remote Natural Language Interface (NLI), for querying Knowledge Bases (KBs). The system applies natural language technology tools provided by the Stanford CoreNLP, and queries KBs with the use of the query language SPARQL. Natural Language Processing (NLP) is used to analyze the semantics of a question written in natural language, and generates relational information about the question. With correctly defined relations, the question can be queried on KBs containing relevant Linked Data. The Linked Data follows the Resource Description Framework (RDF) model by expressing relations in the form of semantic triples: subject-predicate-object.
With our NLI, any KB can be understood semantically. By providing correct training data, the AI can learn to understand the semantics of the RDF data stored in the KB. The ability to understand the RDF data allows for the process of extracting relational information from questions about the KB. With the relational information, questions can be translated to SPARQL and be queried on the KB.
Place, publisher, year, edition, pages
2020. , p. 47
Keywords [en]
SPARQL, NLP, RDF, Semantic Web, Knowledge Base, Knowledge Graph
National Category
Language Technology (Computational Linguistics) Computer Sciences
Identifiers
URN: urn:nbn:se:kau:diva-78921OAI: oai:DiVA.org:kau-78921DiVA, id: diva2:1449614
External cooperation
Redpill-Linpro
Subject / course
Computer Science
Educational program
Engineering: Computer Engineering (300 ECTS credits)
Supervisors
Examiners
2020-06-302020-06-302020-06-30Bibliographically approved