Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Inferring the location of authors from words in their texts
Gavagai.
KTH, Skolan för datavetenskap och kommunikation (CSC), Teoretisk datalogi, TCS.ORCID-id: 0000-0003-4042-4919
Stockholms universitet.
Stockholms universitet.
2015 (engelsk)Inngår i: Proceedings of the 20th Nordic Conference of Computational Linguistics, Linköping University Electronic Press, 2015Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

For the purposes of computational dialec- tology or other geographically bound text analysis tasks, texts must be annotated with their or their authors’ location. Many texts are locatable but most have no ex- plicit annotation of place. This paper describes a series of experiments to de- termine how positionally annotated mi- croblog posts can be used to learn loca- tion indicating words which then can be used to locate blog texts and their authors. A Gaussian distribution is used to model the locational qualities of words. We in- troduce the notion of placeness to describe how locational words are.

We find that modelling word distributions to account for several locations and thus several Gaussian distributions per word, defining a filter which picks out words with high placeness based on their local distributional context, and aggregating lo- cational information in a centroid for each text gives the most useful results. The re- sults are applied to data in the Swedish language. 

sted, utgiver, år, opplag, sider
Linköping University Electronic Press, 2015.
Serie
Linköping Electronic Conference Proceedings, ISSN 1650-3740 ; 109
HSV kategori
Forskningsprogram
Informations- och kommunikationsteknik
Identifikatorer
URN: urn:nbn:se:kth:diva-169619ISBN: 978-91-7519-098-3 (tryckt)OAI: oai:DiVA.org:kth-169619DiVA, id: diva2:823404
Konferanse
NoDaLiDa,May 11–13, 2015 in Vilnius, Lithuania
Prosjekter
SINUS (Spridning av innovationer i nutida svenska)
Forskningsfinansiär
Swedish Research Council
Merknad

Qc 20150618

Tilgjengelig fra: 2015-06-18 Laget: 2015-06-18 Sist oppdatert: 2018-01-11bibliografisk kontrollert

Open Access i DiVA

fulltext(26529 kB)131 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 26529 kBChecksum SHA-512
7d751056fd05248cc1de77ed1a51d9e35a4e74e67fb7d9533bcc26904869da99fe1e15f580d131b132d526944b2f1ee26e31dec9cdb0cf439bf0c47649bac6b3
Type fulltextMimetype application/pdf

Andre lenker

http://aclweb.org/anthology/W/W15/W15-1826.pdfConference website

Søk i DiVA

Av forfatter/redaktør
Karlgren, Jussi
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 131 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 603 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf