Change search
ReferencesLink to record
Permanent link

Direct link
Improving Image Classification Performance using Joint Feature Selection
KTH, School of Computer Science and Communication (CSC), Computer Vision and Active Perception, CVAP.
2014 (English)Doctoral thesis, monograph (Other academic)
Abstract [en]

In this thesis, we focus on the problem of image classification and investigate how its performance can be systematically improved. Improving the performance of different computer vision methods has been the subject of many studies. While different studies take different approaches to achieve this improvement, in this thesis we address this problem by investigating the relevance of the statistics collected from the image.

We propose a framework for gradually improving the quality of an already existing image descriptor. In our studies, we employ a descriptor which is composed the response of a series of discriminative components for summarizing each image. As we will show, this descriptor has an ideal form in which all categories become linearly separable. While, reaching this form is not possible, we will argue how by replacing a small fraction of these components, it is possible to obtain a descriptor which is, on average, closer to this ideal form. To do so, we initially identify which components do not contribute to the quality of the descriptor and replace them with more robust components. As we will show, this replacement has a positive effect on the quality of the descriptor.

While there are many ways of obtaining more robust components, we introduce a joint feature selection problem to obtain image features that retains class discriminative properties while simultaneously generalising between within class variations. Our approach is based on the concept of a joint feature where several small features are combined in a spatial structure. The proposed framework automatically learns the structure of the joint constellations in a class dependent manner improving the generalisation and discrimination capabilities of the local descriptor while still retaining a low-dimensional representations.

The joint feature selection problem discussed in this thesis belongs to a specific class of latent variable models that assumes each labeled sample is associated with a set of different features, with no prior knowledge of which feature is the most relevant feature to be used. Deformable-Part Models (DPM) can be seen as good examples of such models. These models are usually considered to be expensive to train and very sensitive to the initialization. Here, we focus on the learning of such models by introducing a topological framework and show how it is possible to both reduce the learning complexity and produce more robust decision boundaries. We will also argue how our framework can be used for producing robust decision boundaries without exploiting the dataset bias or relying on accurate annotations.

To examine the hypothesis of this thesis, we evaluate different parts of our framework on several challenging datasets and demonstrate how our framework is capable of gradually improving the performance of image classification by collecting more robust statistics from the image and improving the quality of the descriptor.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2014. , 135 p.
TRITA-CSC-A, ISSN 1653-5723 ; 2014:08
Keyword [en]
Image Classification, Latent Variable Models
National Category
Computer Vision and Robotics (Autonomous Systems)
Research subject
Computer Science
URN: urn:nbn:se:kth:diva-144896ISBN: 978-91-7595-139-3OAI: diva2:715228
Public defence
2014-05-21, F3, Lindstedtsv 26, KTH, Stockholm, 09:30 (English)

QC 20140506

Available from: 2014-05-06 Created: 2014-05-02 Last updated: 2014-05-06Bibliographically approved

Open Access in DiVA

heydar_maboudi_thesis.pdf(8168 kB)236 downloads
File information
File name FULLTEXT01.pdfFile size 8168 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Maboudi Afkham, Heydar
By organisation
Computer Vision and Active Perception, CVAP
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 236 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 216 hits
ReferencesLink to record
Permanent link

Direct link