Change search
ReferencesLink to record
Permanent link

Direct link
Semantic and Verbatim Word Spotting using Deep Neural Networks
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Visual Information and Interaction. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computerized Image Analysis and Human-Computer Interaction.
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Visual Information and Interaction. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computerized Image Analysis and Human-Computer Interaction.
2016 (English)Conference paper (Refereed)
Abstract [en]

In the last few years, deep convolutional neural networks have become ubiquitous in computer vision, achieving state-of-the-art results on problems like object detection, semantic segmentation, and image captioning. However, they have not yet been widely investigated in the document analysis community. In this paper, we present a word spotting system based on convolutional neural networks. We train a network to extract a powerful image representation, which we then embed into a word embedding space. This allows us to perform wordspotting using both query-by-string and query-by-example in a variety of word embedding spaces, both learned and handcrafted, for verbatim as well as semantic word spotting. Our novel approach is versatile and the evaluation shows that it outperforms the previous state-of-the-art for word spotting on standard datasets.

Place, publisher, year, edition, pages
2016.
Keyword [en]
handwritten word spotting, convolutional neural networks, deep learning, word embeddings
National Category
Computer Vision and Robotics (Autonomous Systems)
Research subject
Computerized Image Processing
Identifiers
URN: urn:nbn:se:uu:diva-306667OAI: oai:DiVA.org:uu-306667DiVA: diva2:1044046
Conference
International Conference on Frontiers in Handwriting Recognition (ICFHR), October 23-26, 2016, Shenzhen, China.
Projects
q2b
Funder
Swedish Research Council, 2012-5743Riksbankens Jubileumsfond, NHS14-2068:1
Available from: 2016-11-01 Created: 2016-11-01 Last updated: 2016-11-03

Open Access in DiVA

fulltext(699 kB)32 downloads
File information
File name FULLTEXT01.pdfFile size 699 kBChecksum SHA-512
fbc1d8ffa156dfe637ae2423fd2c85ee2055d9a5dc7d1d6498844f88cfe45b7d6f05e7b3aa1893944720538fcc8a5c2f2f3469eb847371531838764bbc85f36d
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Wilkinson, TomasBrun, Anders
By organisation
Division of Visual Information and InteractionComputerized Image Analysis and Human-Computer Interaction
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 32 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 69 hits
ReferencesLink to record
Permanent link

Direct link