Change search
ReferencesLink to record
Permanent link

Direct link
Can a graded reader of authentic material be generated?
Linköping University, Department of Computer and Information Science. Linköping University, The Institute of Technology.
2013 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The thesis investigates if a graded reader for English leveled to the CEFR levels by using the English Vocabulary Profile (EVP) dictionary can be generated from a corpus of authentic material. It was tested on Wikipedia and the ukWaC corpus. There were some problems in making correctmatches between the words in the EVP word lists with the tagged words of the corpora. The results show it might be possible to find enough suitable texts to generate a graded reader for at least the higher CEFR levels if only lemmas are considered. If also the POS tags should be matched between the word list and the corpora the errors were too big to be able to give a conclusive answer.

Place, publisher, year, edition, pages
2013. , 69 p.
National Category
Computer Science
URN: urn:nbn:se:liu:diva-100131ISRN: LIU-IDA/LITH-EX-A--13/050--SEOAI: diva2:660072
Subject / course
Computer and information science at the Institute of Technology
Available from: 2014-03-04 Created: 2013-10-28 Last updated: 2014-03-04Bibliographically approved

Open Access in DiVA

fulltext(732 kB)207 downloads
File information
File name FULLTEXT01.pdfFile size 732 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Computer and Information ScienceThe Institute of Technology
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 207 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 93 hits
ReferencesLink to record
Permanent link

Direct link