Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Biologically-Based Interactive Neural Network Models for Visual Attention and Object Recognition
Linköping University, Department of Computer and Information Science. Linköping University, The Institute of Technology.
2012 (English)Doctoral thesis, monograph (Other academic)
Abstract [en]

The main focus of this thesis is to develop biologically-based computational models for object recognition. A series of models for attention and object recognition were developed in the order of increasing functionality and complexity. These models are based on information processing in the primate brain, and specially inspired from the theory of visual information processing along the two parallel processing pathways of the primate visual cortex. To capture the true essence of incremental, constraint satisfaction style processing in the visual system, interactive neural networks were used for implementing our models. Results from eye-tracking studies on the relevant visual tasks, as well as our hypothesis regarding the information processing in the primate visual system, were implemented in the models and tested with simulations.

As a first step, a model based on the ventral pathway was developed to recognize single objects. Through systematic testing, structural and algorithmic parameters of these models were fine tuned for performing their task optimally. In the second step, the model was extended by considering the dorsal pathway, which enables simulation of visual attention as an emergent phenomenon. The extended model was then investigated for visual search tasks. In the last step, we focussed on occluded and overlapped object recognition. A couple of eye-tracking studies were conducted in this regard and on the basis of the results we made some hypotheses regarding information processing in the primate visual system. The models were further advanced on the lines of the presented hypothesis, and simulated on the tasks of occluded and overlapped object recognition.

On the basis of the results and analysis of our simulations we have further found that the generalization performance of interactive hierarchical networks improves with the addition of a small amount of Hebbian learning to an otherwise pure error-driven learning. We also concluded that the size of the receptive fields in our networks is an important parameter for the generalization task and depends on the object of interest in the image. Our results show that networks using hard coded feature extraction perform better than the networks that use Hebbian learning for developing feature detectors. We have successfully demonstrated the emergence of visual attention within an interactive network and also the role of context in the search task. Simulation results with occluded and overlapped objects support our extended interactive processing approach, which is a combination of the interactive and top-down approach, to the segmentation-recognition issue. Furthermore, the simulation behavior of our models is in line with known human behavior for similar tasks.

In general, the work in this thesis will improve the understanding and performance of biologically-based interactive networks for object recognition and provide a biologically-plausible solution to recognition of occluded and overlapped objects. Moreover, our models provide some suggestions for the underlying neural mechanism and strategies behind biological object recognition.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2012. , p. 200
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 1465
Keywords [en]
Biologically-Based Models, Object Recognition, Visual Attention, Interactive Neural Network, Occlusion, Overlapping
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:liu:diva-79336ISBN: 978-91-7519-838-5 (print)OAI: oai:DiVA.org:liu-79336DiVA, id: diva2:541831
Public defence
2012-09-20, Visionen, Building B, Campus Valla, Linköpings universitet, Linköping, 13:15 (English)
Opponent
Supervisors
Available from: 2012-07-26 Created: 2012-07-10 Last updated: 2019-12-10Bibliographically approved

Open Access in DiVA

Biologically-Based Interactive Neural Network Models for Visual Attention and Object Recognition(9586 kB)56675 downloads
File information
File name FULLTEXT02.pdfFile size 9586 kBChecksum SHA-512
fa46fefd9bc796ba237116b54c8f4951cebb7f088ce389d626fbbb8283b191648f5e1a2c502e93fcfcd0075da1e7eb05a60aacc1440b6d6336a9145a8fce6199
Type fulltextMimetype application/pdf
omslag(40 kB)99 downloads
File information
File name COVER01.pdfFile size 40 kBChecksum SHA-512
4be51a679cd33aab4bdbab8862fd938ed1544de3f3bcf4770eaaefc87465b7f93a60fa614adc562e916eeee9f5c3bbdf69e7cba78927006afb7561d1d2227bc4
Type coverMimetype application/pdf
Order online >>

Search in DiVA

By author/editor
Saifullah, Mohammad
By organisation
Department of Computer and Information ScienceThe Institute of Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 56680 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1641 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf