Change search
ReferencesLink to record
Permanent link

Direct link
Channel Coding for Joint Colour and Depth Segmentation
Linköping University, Department of Electrical Engineering, Computer Vision. Linköping University, The Institute of Technology.
Linköping University, Department of Electrical Engineering, Computer Vision. Linköping University, The Institute of Technology.ORCID iD: 0000-0002-6096-3648
Linköping University, Department of Electrical Engineering, Computer Vision. Linköping University, The Institute of Technology.ORCID iD: 0000-0002-5698-5983
Institut de Robotica i Informatica Industrial (CSIC-UPC) Llorens i Artigas 4-6, 08028 Barcelona, Spain.
2011 (English)In: Proceedings of  Pattern Recognition 33rd DAGM Symposium, Frankfurt/Main, Germany, August 31 - September 2 / [ed] Rudolf Mester and Michael Felsberg, SpringerLink , 2011, 306-315 p.Conference paper (Refereed)
Abstract [en]

Segmentation is an important preprocessing step in many applications. Compared to colour segmentation, fusion of colour and depth greatly improves the segmentation result. Such a fusion is easy to do by stacking measurements in different value dimensions, but there are better ways. In this paper we perform fusion using the channel representation, and demonstrate how a state-of-the-art segmentation algorithm can be modified to use channel values as inputs. We evaluate segmentation results on data collected using the Microsoft Kinect peripheral for Xbox 360, using the superparamagnetic clustering algorithm. Our experiments show that depth gradients are more useful than depth values for segmentation, and that channel coding both colour and depth gradients makes tuned parameter settings generalise better to novel images.

Place, publisher, year, edition, pages
SpringerLink , 2011. 306-315 p.
, Lecture Notes in Computer Science, ISSN 0302-9743 ; 6835
National Category
Engineering and Technology
URN: urn:nbn:se:liu:diva-70708DOI: 10.1007/978-3-642-23123-0_31ISBN: 978-3-642-23122-3OAI: diva2:441260
33rd DAGM Symposium, Frankfurt 29 August - 2 September 2011
Available from: 2011-09-15 Created: 2011-09-15 Last updated: 2016-05-04Bibliographically approved
In thesis
1. Components of Embodied Visual Object Recognition: Object Perception and Learning on a Robotic Platform
Open this publication in new window or tab >>Components of Embodied Visual Object Recognition: Object Perception and Learning on a Robotic Platform
2013 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Object recognition is a skill we as humans often take for granted. Due to our formidable object learning, recognition and generalisation skills, it is sometimes hard to see the multitude of obstacles that need to be overcome in order to replicate this skill in an artificial system. Object recognition is also one of the classical areas of computer vision, and many ways of approaching the problem have been proposed. Recently, visually capable robots and autonomous vehicles have increased the focus on embodied recognition systems and active visual search. These applications demand that systems can learn and adapt to their surroundings, and arrive at decisions in a reasonable amount of time, while maintaining high object recognition performance. Active visual search also means that mechanisms for attention and gaze control are integral to the object recognition procedure. This thesis describes work done on the components necessary for creating an embodied recognition system, specifically in the areas of decision uncertainty estimation, object segmentation from multiple cues, adaptation of stereo vision to a specific platform and setting, and the implementation of the system itself. Contributions include the evaluation of methods and measures for predicting the potential uncertainty reduction that can be obtained from additional views of an object, allowing for adaptive target observations. Also, in order to separate a specific object from other parts of a scene, it is often necessary to combine multiple cues such as colour and depth in order to obtain satisfactory results. Therefore, a method for combining these using channel coding has been evaluated. Finally, in order to make use of three-dimensional spatial structure in recognition, a novel stereo vision algorithm extension along with a framework for automatic stereo tuning have also been investigated. All of these components have been tested and evaluated on a purpose-built embodied recognition platform known as Eddie the Embodied.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2013. 64 p.
Linköping Studies in Science and Technology. Thesis, ISSN 0280-7971 ; 1607
computer vision, object recognition, stereo vision, classification
National Category
Signal Processing Computer Vision and Robotics (Autonomous Systems)
urn:nbn:se:liu:diva-93812 (URN)978-91-7519-564-3 (print) (ISBN)
2013-08-16, Visionen, Hus B, Campus Valla, Linköpings universitet, Linköping, 13:15 (English)
Embodied Visual Object Recognition
Swedish Research Council
Available from: 2013-07-09 Created: 2013-06-10 Last updated: 2015-12-10Bibliographically approved

Open Access in DiVA

fulltext(3199 kB)780 downloads
File information
File name FULLTEXT01.pdfFile size 3199 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Wallenberg, MarcusFelsberg, MichaelForssén, Per-Erik
By organisation
Computer VisionThe Institute of Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 780 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 446 hits
ReferencesLink to record
Permanent link

Direct link