Tracking Online Trend Locations using a Geo-Aware Topic Model
Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
In automatically categorizing massive corpora of text, various topic models have been applied with good success. Much work has been done on applying machine learning and NLP methods on Internet media, such as Twitter, to survey online discussion. However, less focus has been placed on studying how geographical locations discussed in online fora evolve over time, and even less on associating such location trends with topics. Can online discussions be geographically tracked over time? This thesis attempts to answer this question by evaluating a geo-aware Streaming Latent Dirichlet Allocation (SLDA) implementation which can recognize location terms in text. We show how the model can predict time-dependent locations of the 2016 American primaries by automatic discovery of election topics in various Twitter corpora, and deduce locations over time.
Place, publisher, year, edition, pages
2016. , 62 p.
geo-aware topic model lda latent dirichlet geographic location
IdentifiersURN: urn:nbn:se:kth:diva-190887OAI: oai:DiVA.org:kth-190887DiVA: diva2:953540
Subject / course
Master of Science in Engineering - Computer Science and Technology
2016-06-16, KTH D building, KTH, Stockholm, 05:10 (English)
Brynielsson, Joel, Adjunct Associate ProfessorLozano, Marianela Garcia
Lansner, Anders, Professor