Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Swe-Clarin: Language Resources and Technology for Digital Humanities
Språkbanken, Department of Swedish, University of Gothenburg.
Språkbanken, Department of Swedish, University of Gothenburg.
Språkbanken, Department of Swedish, University of Gothenburg.
Swedish National Data Service, University of Gothenburg.
Show others and affiliations
2016 (English)In: Extended Papers of the International Symposium on Digital Humanities, 2016, p. 29-51Conference paper, Published paper (Refereed)
Abstract [en]

CLARIN is a European Research Infrastructure Consortium (ERIC), which aims at (a) making extensive language-based materials available as primary research data to the humanities and social sciences (HSS); and (b) offering state-of-the-art language technology (LT) as an eresearch tool for this purpose, positioning CLARIN centrally in what is often referred to as the digital humanities (DH). The Swedish CLARIN node Swe-Clarin was established in 2015 with funding from the Swedish Research Council.

In this paper, we describe the composition and activities of Swe-Clarin, aiming at meeting the requirements of all HSS and other researchers whose research involves using text and speech as primary research data, and spreading the awareness of what Swe-Clarin can offer these research communities. We focus on one of the central means for doing this: pilot projects conducted in collaboration between HSS researchers and Swe-Clarin, together formulating a research question, the addressing of which requires working with large language-based materials. Four such pilot projects are described in more detail, illustrating research on rhetorical history, second-language acquisition, literature, and political science. A common thread to these projects is an aspiration to meet the challenge of conducting research on the basis of very large amounts of textual data in a consistent way without losing sight of the individual cases making up the mass of data, i.e., to be able to move between Moretti’s “distant” and “close reading” modes.

While the pilot projects clearly make substantial contributions to DH, they also reveal some needs for more development, and in particular a need for document-level access to the text materials. As a consequence of this, work has now been initiated in Swe-Clarin to meet this need, so that Swe-Clarin together with HSS scholars investigating intricate research questions can take on the methodological challenges of big-data language-based digital humanities.

Place, publisher, year, edition, pages
2016. p. 29-51
Keywords [en]
Swe-Clarin, CLARIN, digital humanities, language technology
National Category
Language Technology (Computational Linguistics)
Research subject
Computational Linguistics
Identifiers
URN: urn:nbn:se:uu:diva-337520OAI: oai:DiVA.org:uu-337520DiVA, id: diva2:1169893
Conference
International Symposium on Digital Humanities, Nov. 7-8, 2016, Växjö, Sweden
Available from: 2017-12-30 Created: 2017-12-30 Last updated: 2018-01-13Bibliographically approved

Open Access in DiVA

fulltext(1892 kB)12 downloads
File information
File name FULLTEXT01.pdfFile size 1892 kBChecksum SHA-512
aee9c16fddacfb701e005f9d439b96e9cc959e5840d0b431681c02a020ab4228d66737f47dabaac6153d5e155cee23c6a7ba3a55efcde18e620ade94152770ab
Type fulltextMimetype application/pdf

Other links

http://ceur-ws.org/Vol-2021/paper2.pdf

Search in DiVA

By author/editor
Viklund, JonMegyesi, BeátaNäsman, JesperPalmér, Anne
By organisation
Department of LiteratureDepartment of Linguistics and PhilologyDepartment of Scandinavian Languages
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 12 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 58 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf