Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Adaptive Supervision Online Learning for Vision Based Autonomous Systems
Linköpings universitet, Institutionen för systemteknik, Datorseende. Linköpings universitet, Tekniska fakulteten.
2016 (Engelska)Doktorsavhandling, monografi (Övrigt vetenskapligt)
Abstract [en]

Driver assistance systems in modern cars now show clear steps towards autonomous driving and improvements are presented in a steady pace. The total number of sensors has also decreased from the vehicles of the initial DARPA challenge, more resembling a pile of sensors with a car underneath. Still, anyone driving a tele-operated toy using a video link is a demonstration that a single camera provides enough information about the surronding world.  

Most lane assist systems are developed for highway use and depend on visible lane markers. However, lane markers may not be visible due to snow or wear, and there are roads without lane markers. With a slightly different approach, autonomous road following can be obtained on almost any kind of road. Using realtime online machine learning, a human driver can demonstrate driving on a road type unknown to the system and after some training, the system can seamlessly take over. The demonstrator system presented in this work has shown capability of learning to follow different types of roads as well as learning to follow a person. The system is based solely on vision, mapping camera images directly to control signals.  

Such systems need the ability to handle multiple-hypothesis outputs as there may be several plausible options in similar situations. If there is an obstacle in the middle of the road, the obstacle can be avoided by going on either side. However the average action, going straight ahead, is not a viable option. Similarly, at an intersection, the system should follow one road, not the average of all roads.  

To this end, an online machine learning framework is presented where inputs and outputs are represented using the channel representation. The learning system is structurally simple and computationally light, based on neuropsychological ideas presented by Donald Hebb over 60 years ago. Nonetheless the system has shown a cabability to learn advanced tasks. Furthermore, the structure of the system permits a statistical interpretation where a non-parametric representation of the joint distribution of input and output is generated. Prediction generates the conditional distribution of the output, given the input.  

The statistical interpretation motivates the introduction of priors. In cases with multiple options, such as at intersections, a prior can select one mode in the multimodal distribution of possible actions. In addition to the ability to learn from demonstration, a possibility for immediate reinforcement feedback is presented. This allows for a system where the teacher can choose the most appropriate way of training the system, at any time and at her own discretion.  

The theoretical contributions include a deeper analysis of the channel representation. A geometrical analysis illustrates the cause of decoding bias commonly present in neurologically inspired representations, and measures to counteract it. Confidence values are analyzed and interpreted as evidence and coherence. Further, the use of the truncated cosine basis function is motivated.  

Finally, a selection of applications is presented, such as autonomous road following by online learning and head pose estimation. A method founded on the same basic principles is used for visual tracking, where the probabilistic representation of target pixel values allows for changes in target appearance.

Ort, förlag, år, upplaga, sidor
Linköping: Linköping University Electronic Press, 2016. , s. 176
Serie
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 1749
Nationell ämneskategori
Datorseende och robotik (autonoma system)
Identifikatorer
URN: urn:nbn:se:liu:diva-125916DOI: 10.3384/diss.diva-125916ISBN: 978-91-7685-815-8 (tryckt)OAI: oai:DiVA.org:liu-125916DiVA, id: diva2:916645
Disputation
2016-05-20, Visionen, B-building, Campus Valla, Linköping, 09:00 (Engelska)
Opponent
Handledare
Forskningsfinansiär
EU, FP7, Sjunde ramprogrammetVetenskapsrådetTillgänglig från: 2016-04-19 Skapad: 2016-03-08 Senast uppdaterad: 2018-01-10Bibliografiskt granskad

Open Access i DiVA

fulltext(5398 kB)581 nedladdningar
Filinformation
Filnamn FULLTEXT02.pdfFilstorlek 5398 kBChecksumma SHA-512
50bbd88b0dc56e8820cb50c9a8a25e466217b795fa543b1d61c1a779c3505f7761c978d3f2011dbffc45503c9854cd8bf23837e1d219e494e2d9d11c34314b7a
Typ fulltextMimetyp application/pdf
omslag(361 kB)50 nedladdningar
Filinformation
Filnamn COVER01.pdfFilstorlek 361 kBChecksumma SHA-512
5af1613f62e4591b02904c7365896feaa55905da035f67fb1c49a7919e46e149498915ab43292a95ffe650b46ade3eb44f88f1901c9477e872dbf2e6bac6e5d3
Typ coverMimetyp application/pdf
Supplementary files with videos(62 kB)26 nedladdningar
Filinformation
Filnamn ATTACHMENT03.pdfFilstorlek 62 kBChecksumma SHA-512
1eed3c69434a9e2694fd01e95b83ed0cb495866b0f54ef2daad7533a7cdba5411e2534286560898cea5856695c4ab636fa45d19691de5f8a46aa5757dadcdda7
Typ attachmentMimetyp application/pdf
Channel vector curves, Four channels, 3D space(5573 kB)132 nedladdningar
Filinformation
Filnamn MOVIE01.mp4Filstorlek 5573 kBChecksumma SHA-512
ddb6bd937d01e46d234c344753ba0e481d038440a251ddc4d6e04d73827b163f04e6140fe5d0c8a48e5676ec2e8a2d0cb3e8f518126edff0722971c6f48a4087
Typ movieMimetyp video/mp4
Channel vector curves, Five channels, 4D space(3862 kB)96 nedladdningar
Filinformation
Filnamn MOVIE02.mp4Filstorlek 3862 kBChecksumma SHA-512
ac83d61c0fbfdda0059b5dd116027a7d7efa4f3a80706c46084171f26a78ba12aaf1de986b116d395e266bcf471dc5ce72c3ad437858eb444dfd1535aad313f9
Typ movieMimetyp video/mp4
Channel vector curves, Seven channels, 6D space(6091 kB)90 nedladdningar
Filinformation
Filnamn MOVIE03.mp4Filstorlek 6091 kBChecksumma SHA-512
59715ae071d2c0fc0c71519f5c5f020f831f25f912903b5b6226d2920a1bc023e234589f00147fc05e38f3bd4cb4f8cdb03db802527f8f2679eb7484c4379604
Typ movieMimetyp video/mp4
Cone, Three channels, 3D space(11233 kB)77 nedladdningar
Filinformation
Filnamn MOVIE04.mp4Filstorlek 11233 kBChecksumma SHA-512
ae4aafb4563b5cb079c8e526c274682306db669fea421b38b1052448bb848d876933224eecbd0000605feebceea6b3d095737d7e8c968e4728c3a4758e257b66
Typ movieMimetyp video/mp4
Cone, Four channels, 4D space(21066 kB)92 nedladdningar
Filinformation
Filnamn MOVIE05.mp4Filstorlek 21066 kBChecksumma SHA-512
5339c565420dd8109986651711362a1063a56ea774e120cb0044c1cb0936888c8a590ceaedfde1c8c0987edf1f59b7d95db093849e7570534833491d1d5212c2
Typ movieMimetyp video/mp4
Cone, Seven channels, 7D space(37349 kB)82 nedladdningar
Filinformation
Filnamn MOVIE06.mp4Filstorlek 37349 kBChecksumma SHA-512
37e0ab435981553023739a4e494ab27e6c7a59eb4b44ed915838e8c21a91b1cea331c8e7f6c61f77d1e79e466acc3d3072ee1849e348cff44c03d39303b1304e
Typ movieMimetyp video/mp4
Associative learning illustration(2842 kB)91 nedladdningar
Filinformation
Filnamn MOVIE07.mp4Filstorlek 2842 kBChecksumma SHA-512
857a3df7676ccd43674592f5937d50bb95e85f0980ddb98809c58c931857e948d9f85bbc26286efeab7e31ca66c1b750531583502936ea9f62fcfc6f56d85375
Typ movieMimetyp video/mp4
Decoding of five pixels in a sequence(26451 kB)107 nedladdningar
Filinformation
Filnamn MOVIE17.mp4Filstorlek 26451 kBChecksumma SHA-512
2b1a6a5a6eee7e4ded10dddb4fa736c01b35dbecfbb12af490ebe2dad0f4a3f469a4b34133dcc8cf2aca10d9c15aec93897cae9a427faccdc80b940fe8d362b4
Typ movieMimetyp video/mp4
Sequence with translating cameraman image(742 kB)69 nedladdningar
Filinformation
Filnamn MOVIE14.mp4Filstorlek 742 kBChecksumma SHA-512
e326c2316ef99e50f5af6d6a565741078f4c9e40bc2075210e2bca90d06ec7cc6f43f618704b18b496d5fbb5bb264cc28728e9e5a714c1c685fe92caaa46aecd
Typ movieMimetyp video/mp4
Video from UAV(8388 kB)88 nedladdningar
Filinformation
Filnamn MOVIE15.mp4Filstorlek 8388 kBChecksumma SHA-512
f77d3b93b71089dcee36a0e1b76b545b1b0d5ada2e94400fc60bb6bab7ef08e4061d67c59512260902c28a08ba53737c168a8f60932cb633d650d6ee7c67c508
Typ movieMimetyp video/mp4
Original video from the UAV(16423 kB)78 nedladdningar
Filinformation
Filnamn MOVIE16.mp4Filstorlek 16423 kBChecksumma SHA-512
7bc1c4b91b77b162c9cde2a393626824cd2a988c6a103b7a3210901bf0093262486497eb9a88c4fd41c7d9152cef34878b9a10ab2fd5d53c4fe1e390e13a82d0
Typ movieMimetyp video/mp4
Autonomous Road Following Application, Use case demo(270078 kB)129 nedladdningar
Filinformation
Filnamn MOVIE12.mp4Filstorlek 270078 kBChecksumma SHA-512
35fb150aa7eeafa6238ef917b4207cd437ae106cfadd63c9068650ddbe94abbc0d368a2e8133b7483e01e69c005992d3ee30f596e15bb4a9aea489e87d5e63a2
Typ movieMimetyp video/mp4
Autonomous Road Following Application, Demonstrator system(159031 kB)184 nedladdningar
Filinformation
Filnamn MOVIE13.mp4Filstorlek 159031 kBChecksumma SHA-512
e738f3ee1e2f102da7db238a4875052b8e1676b7b85ba0a4cadb75437e39b397b8126d1a46c0f12b02f838379fc535af3945152fdfcbe62e720eab616841df29
Typ movieMimetyp video/mp4

Övriga länkar

Förlagets fulltext

Sök vidare i DiVA

Av författaren/redaktören
Öfjäll, Kristoffer
Av organisationen
DatorseendeTekniska fakulteten
Datorseende och robotik (autonoma system)

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 587 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 5817 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf