Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Report on the Fourth Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR 11)
KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.ORCID iD: 0000-0003-4042-4919
2012 (English)In: SIGIR Forum, ISSN 0163-5840, E-ISSN 1558-0229, Vol. 46, no 1, 56-64 p.Article in journal (Refereed) Published
Abstract [en]

There is an increasing amount of structure on the Web as a result of modern Web languages, user tagging and annotation, and emerging robust NLP tools. These meaningful, semantic, annotations hold the promise to significantly enhance information access, by increasing the depth of analysis of today’s systems. Currently, we have only started to explore the possibilities and only begun to understand how these valuable semantic cues can be put to fruitful use. The workshop had an interactive format consisting of keynotes, boasters and posters, breakout groups and reports, and a final discussion, which was prolonged into the evening. There was a strong feeling that we made substantial progress. Specifically, each of the breakout groups contributed to our understanding of the way forward. First, annotations and use cases come in many different shapes and forms depending on the domain at hand, but at a higher level there are remarkable commonalities in annotation tools, indexing methods, user interfaces, and general methodology. Second, we got insights in the "exploitation" aspects, leading to a clear separation between the low-level annotations giving context or meaning to small units of information (e.g., NLP, sentiments, entities), and annotations bringing out the structure inherent in the data (e.g., sources, data schemas, document genres). Third, the plan to enrich ClueWeb with various document level (e.g., pagerank and spam scores, but also reading level) and lower level (e.g., named entities or sentiments) annotations was embraced by the workshop as a concrete next step to promote research in semantic annotations.

Place, publisher, year, edition, pages
DBLP , 2012. Vol. 46, no 1, 56-64 p.
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-116364OAI: oai:DiVA.org:kth-116364DiVA: diva2:589141
Note

QC 20130129

Available from: 2013-01-29 Created: 2013-01-17 Last updated: 2017-12-06Bibliographically approved

Open Access in DiVA

fulltext(694 kB)114 downloads
File information
File name FULLTEXT01.pdfFile size 694 kBChecksum SHA-512
b9d2e451c09ed5cf8d5e6c264384601a47ddf9ea40278023c71c78b8662a89ac28814540861f12548d5ba0cf4482df55aac0382a88494ad03c615d34fd138f9c
Type fulltextMimetype application/pdf

Other links

ACM Digital Library

Authority records BETA

Karlgren, Jussi

Search in DiVA

By author/editor
Karlgren, Jussi
By organisation
Theoretical Computer Science, TCS
In the same journal
SIGIR Forum
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 114 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 157 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf