Semantic Processing of a Hungarian Ethnographic Corpus
2010 (English)In: Proceedings of the 1st International AMICUS Workshop, October 21, 2010, Vienna, Austria, 112-115 p.Article in journal (Refereed)
In this poster, a Hungarian ethnographic database containing
linguistic annotation is presented. The corpus contains
texts from three domains, namely, folk beliefs, t altos texts
and tales. All the possible morphosyntactic analyses assigned
to each word and the appropriate one selected from
them (based on contextual information) are also marked.
Syntactic (dependency) annotation is added semi-automatically
to the corpus texts at a second phase of the processing.
With the help of these enriched linguistic attributes,
the texts can be semantically analyzed and clustered. The
research and development team is working on a semantic
search tool enabling to browse the texts on the basis of their
semantic meaning. The proposed technology may result in
a new approach to the ethnographic research and may open
a new type of access to the databases.
Place, publisher, year, edition, pages
University of Szeged , 2010. 112-115 p.
ethnography, folktales, folk beliefs, semantic processing, semantic annotation, natural language processing, NLP, Computational ethnography
Language Technology (Computational Linguistics) Information Studies
IdentifiersURN: urn:nbn:se:hb:diva-2895Local ID: 2320/7171ISBN: 978-963-306-069-8OAI: oai:DiVA.org:hb-2895DiVA: diva2:870989