Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Faces, Fights, and Families: Topic Modeling and Gendered Themes in Two Corpora of Swedish Prose Fiction
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology.
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Arts, Department of Literature, Sociology of Literature.ORCID iD: 0000-0001-7280-1112
2019 (English)In: DHN 2019 Copenhagen, Proceedings of 4th Conference of The Association Digital Humanities in the Nordic Countries Copenhagen, March 6-8 2019 / [ed] Constanza Navaretta et al., 2019, p. 92-111Conference paper, Published paper (Refereed)
Abstract [en]

This paper explores topic modeling (TM) as a tool for “dis- tant reading” of two Swedish literary corpora. We investigate what kinds of insight and knowledge a TM-based approach can provide to Swedish literary history, and which methodological difficulties are associated with this endeavour. The TM is based on 12- and 24-term chunks of selected verb and common noun lemmas. We generate models with 20, 40, and 100 topics. We also propose a method for a quantitative and qualita- tive gendered thematic analysis by combining TM with a study of how the topics relate to gender in characters and authors. The two corpora contain, respectively, Swedish classics (1821–1941) and recent bestsellers (2004–2017). We find that most of the topics proposed by the TM are easy to interpret as conceptual themes, and that the “same” themes ap- pear for the two corpora and for different TM settings. The study allows us to make interesting observations concerning different aspects of gender and topic distribution.

Place, publisher, year, edition, pages
2019. p. 92-111
Keywords [en]
Topic Modeling · Distant Reading; Gender Analysis; Literary Methodology; Swedish Prose Fiction; Bestsellers
National Category
General Literature Studies Language Technology (Computational Linguistics)
Research subject
Literature; Computational Linguistics
Identifiers
URN: urn:nbn:se:uu:diva-382230OAI: oai:DiVA.org:uu-382230DiVA, id: diva2:1306386
Conference
DHN 2019, 4th Digital Humanities in the Nordic Countries 2019, University of Copenhagen, Copenhagen, Denmark, March 6–8, 2019
Available from: 2019-04-23 Created: 2019-04-23 Last updated: 2019-09-12Bibliographically approved

Open Access in DiVA

fulltext(509 kB)19 downloads
File information
File name FULLTEXT02.pdfFile size 509 kBChecksum SHA-512
5f853d0ab95067fbb56a034033d59240e27c9a33f4af92d6e505107d8bac27869272c500cbf27a2ce13f6d44879d28d29af33ff11ffb900b9db9c136b908eab0
Type fulltextMimetype application/pdf

Other links

ProceedingsPaper

Search in DiVA

By author/editor
Dahllöf, MatsBerglund, Karl
By organisation
Department of Linguistics and PhilologySociology of Literature
General Literature StudiesLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 60 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 203 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf