Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Swedish Natural Language Processing with Long Short-term Memory Neural Networks: A Machine Learning-powered Grammar and Spell-checker for the Swedish Language
Linnaeus University, Faculty of Technology, Department of computer science and media technology (CM).
Linnaeus University, Faculty of Technology, Department of computer science and media technology (CM).
2018 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

Natural Language Processing (NLP) is a field studying computer processing of human language. Recently, neural network language models, a subset of machine learning, have been used to great effect in this field. However, research remains focused on the English language, with few implementations in other languages of the world. This work focuses on how NLP techniques can be used for the task of grammar and spelling correction in the Swedish language, in order to investigate how language models can be applied to non-English languages. We use a controlled experiment to find the hyperparameters most suitable for grammar and spelling correction on the Göteborgs-Posten corpus, using a Long Short-term Memory Recurrent Neural Network. We present promising results for Swedish-specific grammar correction tasks using this kind of neural network; specifically, our network has a high accuracy in completing these tasks, though the accuracy achieved for language-independent typos remains low.

Place, publisher, year, edition, pages
2018. , p. 33
Keywords [en]
natural language processing, machine learning, long short-term memory, recurrent neural networks, grammar correction, spelling correction, Swedish language
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:lnu:diva-76819OAI: oai:DiVA.org:lnu-76819DiVA, id: diva2:1232482
Subject / course
Computer Science
Presentation
2018-05-29, 15:15 (English)
Supervisors
Examiners
Available from: 2018-07-12 Created: 2018-07-11 Last updated: 2018-07-12Bibliographically approved

Open Access in DiVA

fulltext(706 kB)91 downloads
File information
File name FULLTEXT01.pdfFile size 706 kBChecksum SHA-512
b62d841d0717eea7d3367365605f2693e1f904e19bfcb4506164231117c8faee57c93459d6f2b7189c03c6d53d3881afbb49dbff6270a441139ab23cea340659
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Gudmundsson, JohanMenkes, Francis
By organisation
Department of computer science and media technology (CM)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 91 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 244 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf