Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
One Shot Object Detection: For Tracking Purposes
KTH, School of Information and Communication Technology (ICT).
2017 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

One of the things augmented reality depends on is object tracking, which is a problem classically found in cinematography and security. However, the algorithms designed for the classical application are often too expensive computationally or too complex to run on simpler mobile hardware. One of the methods to do object tracking is with a trained neural network, this has already led to great results but is unfortunately still running into some of the same problems as the classical algorithms. For this reason a neural network designed specifically for object tracking on mobile hardware needs to be developed. This thesis will propose two di erent neural networks designed for object tracking on mobile hardware. Both are based on a siamese network structure and methods to improve their accuracy using filtering are also introduced. The first network is a modified version of “CNN architecture for geometric matching” that utilizes an a ne regression to perform object tracking. This network was shown to underperform in the MOT benchmark as-well as the VOT benchmark and therefore not further developed. The second network is an object detector based on “SqueezeDet” in a siamese network structure utilizing the performance optimized layers of “MobileNets”. The accuracy of the object detector network is shown to be competitive in the VOT benchmark, placing at the 16th place compared to trackers from the 2016 challenge. It was also shown to run in real-time on mobile hardware. Thus the one shot object detection network used for a tracking application can improve the experience of augmented reality applications on mobile hardware. 

Place, publisher, year, edition, pages
2017. , p. 66
Series
TRITA-ICT-EX ; 2017:141
Keyword [en]
Object tracking, Deep learning, Siamese neural network, Affine regression network, One shot learning, Object detector, PID controller
National Category
Computer Sciences Embedded Systems
Identifiers
URN: urn:nbn:se:kth:diva-217252OAI: oai:DiVA.org:kth-217252DiVA, id: diva2:1161376
External cooperation
Viorama GmBh.
Subject / course
Computer Science
Educational program
Master of Science - Embedded Systems
Presentation
2017-09-06, Alonzo, Isafjordsgatan 22, Kista, 10:00 (English)
Supervisors
Examiners
Available from: 2017-12-10 Created: 2017-11-29 Last updated: 2018-01-13Bibliographically approved

Open Access in DiVA

fulltext(10156 kB)226 downloads
File information
File name FULLTEXT02.pdfFile size 10156 kBChecksum SHA-512
acfd7c65896fbcbdfb5c03779f4d12aeef5231a4f26c8252719903dc79177a9d623ab0f7b0883b897465591d6be24190e1ebb955e6d28ebf8bccfdbc3cf577d4
Type fulltextMimetype application/pdf

By organisation
School of Information and Communication Technology (ICT)
Computer SciencesEmbedded Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 226 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 766 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf