An Evaluation of NLP Toolkits for Information Quality Assessment
Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Documentation is often the first source, which can help user to solve problems or provide conditions of use of some product. That is why it should be clear and understandable. But what does “understandable” mean? And how to detect whether some text is unclear? And this thesis can answer on those questions.The main idea of current work is to measure clarity of the text information using natural language processing capabilities. There are three global steps to achieve this goal: to define criteria of bad clarity of text information, to evaluate different natural language toolkits and find suitable for us, and to implement a prototype system that, given a text, measures text clarity.Current thesis project is planned to be included to VizzAnalyzer (quality analysis tool, which processes information on structure level) and its main task is to perform a clarity analysis of text information extracted by VizzAnalyzer from different XML-files.
Place, publisher, year, edition, pages
2012. , 34 p.
Natural language processing analysis, information quality, clarity guidelines, natural language processing toolkits, graph format
Computer Science Software Engineering
IdentifiersURN: urn:nbn:se:lnu:diva-22606OAI: oai:DiVA.org:lnu-22606DiVA: diva2:571819
Software Technology Programme, Master Programme, 120 credits
2012-09-26, D1136, SE-351 95, Vaxjo, 17:15 (English)
UppsokPhysics, Chemistry, Mathematics
Ericsson, MorganHall, JohanWingkvist, Anna