Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Developing Optical Character Recoginition for Ethiopic Scripts
Dalarna University, School of Technology and Business Studies, Computer Engineering.
2011 (English)Independent thesis Advanced level (degree of Master (Two Years))Student thesis
Abstract [en]

The Amharic language is the Official language of over 70 million people mainly in Ethiopia. An extensive literature survey and the government report reveal no single Amharic character recognition is found in the country. The Amharic script has 33 basic characters each with seven orders giving 310 distinct characters, including numbers and punctuation symbols. The characters are visually similar; there is a typeface, but no capitalization. Beside this there is no any standard font to use the language in the computer but they use different fonts developed by different stakeholders without keeping a standard on their own way and interest and this create a problem of incompatibility between different fonts and documents. This project is to investigate the reason why Amharic optical character recognition is not addressed by local and international researchers and developers and finally to develop Amharic optical character recognition uses the features and facilities of Microsoft windows Vista or 7 using Unicode standard.

Place, publisher, year, edition, pages
Borlänge, 2011. , 76 p.
Keyword [en]
Ethiopic, Geez, Amharic, SVM, OCR, Latin, Non-Latin.
Identifiers
URN: urn:nbn:se:du-5541OAI: oai:dalea.du.se:5541DiVA: diva2:519067
Uppsok
Technology
Supervisors
Available from: 2011-06-01 Created: 2011-06-01 Last updated: 2012-04-24Bibliographically approved

Open Access in DiVA

fulltext(1667 kB)