Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Bootstrapping Language Description: The case of Mpiemo (Bantu A, Central African Republic)
Department of Computing Science, Chalmers University, Gothenburg.
Department of African Languages, Gothenburg University, Gothenburg.
Department of African Languages, Gothenburg University, Gothenburg.
Uppsala universitet, Humanistisk-samhällsvetenskapliga vetenskapsområdet, Språkvetenskapliga fakulteten, Institutionen för lingvistik och filologi.
2008 (engelsk)Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a time-consuming process, which has already adapted to improved technology for recording and storage. We present here a novel application of NLP techniques to bootstrap analysis of collected data and speed-up manual selection work. To be more precise, we argue that unsupervised induction of morphology and part-of-speech analysis from raw text data is mature enough to produce useful results. Experiments with Latent Semantic Analysis were less fruitful. We exemplify this on Mpiemo, a so-far essentially undescribed Bantu language of the Central African Republic, for which raw text data was available.

sted, utgiver, år, opplag, sider
2008.
Emneord [en]
Mpiemo, Bantu A, Central African Republic, NLP, Latent Semantic Analysis, bootstrapping
HSV kategori
Identifikatorer
URN: urn:nbn:se:uu:diva-126666OAI: oai:DiVA.org:uu-126666DiVA, id: diva2:326014
Konferanse
Sixth international conference on Language Resources and Evaluation, LREC 2008, 28-30 May 2008, Marrakech
Tilgjengelig fra: 2010-06-30 Laget: 2010-06-21 Sist oppdatert: 2018-12-06bibliografisk kontrollert

Open Access i DiVA

fulltekst(153 kB)182 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 153 kBChecksum SHA-512
1003785b34ed450fe11dcc96e0a7b606b55f1b40919d417fb02764670007aa786fb2ae8302fccdfa937ea1083e0262933fb53a4be264757235f8218b498d8324
Type fulltextMimetype application/pdf

Andre lenker

http://www.lrec-conf.org/proceedings/lrec2008/pdf/848_paper.pdf

Søk i DiVA

Av forfatter/redaktør
Hammarström, HaraldWesterlund, Torbjörn
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 182 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 501 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf