Change search
ReferencesLink to record
Permanent link

Direct link
Pattern Acquisition Methods for Information Extraction Systems
Blekinge Institute of Technology, School of Engineering, Department of Systems and Software Engineering.
2007 (English)Independent thesis Advanced level (degree of Master (One Year))Student thesis
Abstract [en]

This master thesis treats about Event Recognition in the reports of Polish stockholders. Event Recognition is one of the Information Extraction tasks. This thesis provides a comparison of two approaches to Event Recognition: manual and automatic. In the manual approach regular expressions are used. Regular expressions are used as a baseline for the automatic approach. In the automatic approach three Machine Learning methods were applied. In the initial experiment the Decision Trees, naive Bayes and Memory Based Learning methods are compared. A modification of the standard Memory Based Learning method is presented which goal is to create a classifier that uses only positives examples in the classification task. The performance of the modified Memory Based Learning method is presented and compared to the baseline and also to other Machine Learning methods. In the initial experiment one type of annotation is used and it is the meeting date annotation. The final experiment is conducted using three types of annotations: the meeting time, the meeting date and the meeting place annotation. The experiments show that the classification can be performed using only one class of instances with the same level of performance.

Place, publisher, year, edition, pages
2007. , 68 p.
Keyword [en]
Natural Language Processing, Information Extraction, Patterns Acquisition, Linguistic Patterns, Memory Based Learning, Event Recognition
National Category
Computer Science Software Engineering
URN: urn:nbn:se:bth-4291Local ID: diva2:831623
(+48)669808616Available from: 2015-04-22 Created: 2007-10-08 Last updated: 2015-06-30Bibliographically approved

Open Access in DiVA

fulltext(1087 kB)38 downloads
File information
File name FULLTEXT01.pdfFile size 1087 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Systems and Software Engineering
Computer ScienceSoftware Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 38 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 25 hits
ReferencesLink to record
Permanent link

Direct link