Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automatic identification and cropping of rectangular objects in digital images
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology.
2012 (English)Independent thesis Advanced level (degree of Master (One Year)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Today, digital images are commonly used to preserve and present analogue media. To minimize the need for digital storage space, it is important that the object covers as large part of the image as possible. This paper presents a robust methodology, based on common edge and line detection techniques, to automatically identify rectangular objects in digital images. The methodology is tailored to identify posters, photographs and books digitized at the National Library of Sweden (the KB). The methodology has been implemented as a part of DocCrop, a computer program written in Java to automatically identify and crop documents in digital images. With the aid of the developed tool, the KB hopes to decrease the time and manual labour required to crop their digital images.

Three multi-paged documents digitized at the KB have been used to evaluate the tool's performance. Each document features different characteristics. The overall identification results, as well as an in-depth analysis of the different methodology stages, are presented in this paper. In average, the developed software identified 98% of the digitized document pages successfully. The software's identification success rate never went below 95% for any of the three documents. The robustness and execution speed of the methodology suggests that the methodology can be a compelling alternative to the manual identification used at the KB today.

Place, publisher, year, edition, pages
2012.
Series
IT, 12 040
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:uu:diva-180381OAI: oai:DiVA.org:uu-180381DiVA: diva2:549806
Educational program
Freestanding course
Uppsok
Technology
Supervisors
Examiners
Available from: 2012-09-05 Created: 2012-09-05 Last updated: 2012-09-05Bibliographically approved

Open Access in DiVA

fulltext(12370 kB)4872 downloads
File information
File name FULLTEXT01.pdfFile size 12370 kBChecksum SHA-512
623ccef4e4deb4e5adb72e092b21e823f535081cd4c1e1df610c08a85031657ad1e87679bfc8e6d29b684eaa87e87cfa4bb2cef95e8c60af1690df1e39edcc7d
Type fulltextMimetype application/pdf

By organisation
Department of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 4872 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 479 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf