Natural Language Processing from a Software Engineering Perspective
Independent thesis Advanced level (degree of Master (One Year))Student thesis
This thesis is intended to deal with questions related to the processing of naturally occurring texts, also known as natural language processing (NLP). The subject will be approached from a software engineering perspective, and the problem description will be formulated thereafter. The thesis is roughly divided into two major parts. The first part contains a literature study covering fundamental concepts and algorithms. We discuss both serial and parallel architectures, and conclude that different scenarios call for different architectures. The second part is an empirical evaluation of an NLP framework or toolkit chosen amongst a few, conducted in order to elucidate the theoretical part of the thesis. We argue that component based development in a portable language could increase the reusability in the NLP community, where reuse is currently low. The recent emergence of the discovered initiatives and the great potential of many applications in this area reveal a bright future for NLP.
Place, publisher, year, edition, pages
2004. , 67 p.
Natural Language Processing, Software Engineering, Language Engineering, Architecture
IdentifiersURN: urn:nbn:se:bth-2056Local ID: oai:bth.se:arkivex01D2D1B5124AF6D4C1256EB6003364A0OAI: oai:DiVA.org:bth-2056DiVA: diva2:829319