Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
SVALA: Annotation of Second-Language Learner Text Based on Mostly Automatic Alignment of Parallel Corpora
Stockholms universitet, Humanistiska fakulteten, Institutionen för lingvistik, Avdelningen för datorlingvistik.ORCID-id: 0000-0003-4040-3544
2019 (engelsk)Inngår i: Selected papers from the CLARIN Annual Conference 2018, Pisa, 8-10 October 2018 / [ed] Inguna Skadina, Maria Eskevich, Linköping: Linköping University Electronic Press, 2019, s. 222-234, artikkel-id 023Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Annotation of second-language learner text is a cumbersome manual task which in turn requires interpretation to postulate the intended meaning of the learner’s language. This paper describes SVALA, a tool which separates the logical steps in this process while providing rich visual support for each of them. The first step is to pseudonymize the learner text to fulfil the legal and ethical requirements for a distributable learner corpus. The second step is to correct the text, which is carried out in the simplest possible way by text editing. During the editing, SVALA automatically maintains a parallel corpus with alignments between words in the learner source text and corrected text, while the annotator may repair inconsistent word alignments. Finally, the actual labelling of the corrections (the postulated errors) is performed. We describe the objectives, design and workflow of SVALA, and our plans for further development.

sted, utgiver, år, opplag, sider
Linköping: Linköping University Electronic Press, 2019. s. 222-234, artikkel-id 023
Serie
Linköping Electronic Conference Proceedings, ISSN 1650-3686, E-ISSN 1650-3740 ; 159
Emneord [en]
Normalization, Error annotation, Learner corpora, Parallel corpora, Word alignment
HSV kategori
Forskningsprogram
datorlingvistik
Identifikatorer
URN: urn:nbn:se:su:diva-170363ISBN: 978-91-7685-034-3 (tryckt)OAI: oai:DiVA.org:su-170363DiVA, id: diva2:1332091
Konferanse
CLARIN Annual Conference, Pisa, Italy, 8-10 October, 2018
Forskningsfinansiär
Riksbankens Jubileumsfond, IN16- 0464:1Tilgjengelig fra: 2019-06-27 Laget: 2019-06-27 Sist oppdatert: 2019-06-28bibliografisk kontrollert

Open Access i DiVA

fulltext(1276 kB)13 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 1276 kBChecksum SHA-512
6cfc7d1f4e7d9ea82103c36d8b61e0ffb2a8ae4d82035854fe38a7ec661d906555e92ecbb8798c1a2af88b05f4a81f448c68f19f95de257645cb76e4c733fb64
Type fulltextMimetype application/pdf

Andre lenker

Free full text

Søk i DiVA

Av forfatter/redaktør
Wirén, MatsVolodina, Elena
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 13 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 201 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf