Digitala Vetenskapliga Arkivet

Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Cognition Rehearsed: Recognition and Reproduction of Demonstrated Behavior
Umeå universitet, Institutionen för datavetenskap.ORCID-id: 0000-0002-6568-9342
2012 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Resurstyp
Text
Alternativ tittel
Robotövningar : Igenkänning och återgivande av demonstrerat beteende (svensk)
Abstract [en]

The work presented in this dissertation investigates techniques for robot Learning from Demonstration (LFD). LFD is a well established approach where the robot is to learn from a set of demonstrations. The dissertation focuses on LFD where a human teacher demonstrates a behavior by controlling the robot via teleoperation. After demonstration, the robot should be able to reproduce the demonstrated behavior under varying conditions. In particular, the dissertation investigates techniques where previous behavioral knowledge is used as bias for generalization of demonstrations.

The primary contribution of this work is the development and evaluation of a semi-reactive approach to LFD called Predictive Sequence Learning (PSL). PSL has many interesting properties applied as a learning algorithm for robots. Few assumptions are introduced and little task-specific configuration is needed. PSL can be seen as a variable-order Markov model that progressively builds up the ability to predict or simulate future sensory-motor events, given a history of past events. The knowledge base generated during learning can be used to control the robot, such that the demonstrated behavior is reproduced. The same knowledge base can also be used to recognize an on-going behavior by comparing predicted sensor states with actual observations. Behavior recognition is an important part of LFD, both as a way to communicate with the human user and as a technique that allows the robot to use previous knowledge as parts of new, more complex, controllers.

In addition to the work on PSL, this dissertation provides a broad discussion on representation, recognition, and learning of robot behavior. LFD-related concepts such as demonstration, repetition, goal, and behavior are defined and analyzed, with focus on how bias is introduced by the use of behavior primitives. This analysis results in a formalism where LFD is described as transitions between information spaces. Assuming that the behavior recognition problem is partly solved, ways to deal with remaining ambiguities in the interpretation of a demonstration are proposed.

The evaluation of PSL shows that the algorithm can efficiently learn and reproduce simple behaviors. The algorithm is able to generalize to previously unseen situations while maintaining the reactive properties of the system. As the complexity of the demonstrated behavior increases, knowledge of one part of the behavior sometimes interferes with knowledge of another parts. As a result, different situations with similar sensory-motor interactions are sometimes confused and the robot fails to reproduce the behavior.

One way to handle these issues is to introduce a context layer that can support PSL by providing bias for predictions. Parts of the knowledge base that appear to fit the present context are highlighted, while other parts are inhibited. Which context should be active is continually re-evaluated using behavior recognition. This technique takes inspiration from several neurocomputational models that describe parts of the human brain as a hierarchical prediction system. With behavior recognition active, continually selecting the most suitable context for the present situation, the problem of knowledge interference is significantly reduced and the robot can successfully reproduce also more complex behaviors.

sted, utgiver, år, opplag, sider
Umeå: Department of Computing Science, Umeå University , 2012. , s. 30
Serie
Report / UMINF, ISSN 0348-0542 ; 11.16
Emneord [en]
Behavior Recognition, Learning and Adaptive Systems, Learning from Demonstration, Neurocomputational Modeling, Robot Learning
HSV kategori
Identifikatorer
URN: urn:nbn:se:his:diva-12139ISBN: 978-91-7459-349-5 (tryckt)OAI: oai:DiVA.org:his-12139DiVA, id: diva2:1076501
Disputas
2012-01-26, S1031, Norra Beteendevetarhuset, Umeå Universitet, 13:15 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2017-04-18 Laget: 2017-02-22 Sist oppdatert: 2023-05-03bibliografisk kontrollert
Delarbeid
1. Cognitive Perspectives on Robot Behavior
Åpne denne publikasjonen i ny fane eller vindu >>Cognitive Perspectives on Robot Behavior
2010 (engelsk)Inngår i: Proceedings of the 2nd International Conference on Agents and Artificial Intelligence: Volume 2 / [ed] Joaquim Filipe, Ana Fred and Bernadette Sharp, SciTePress, 2010, s. 373-382Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

A growing body of research within the field of intelligent robotics argues for a view of intelligence drastically different from classical artificial intelligence and cognitive science. The holistic and embodied ideas expressed by this research promote the view that intelligence is an emergent phenomenon. Similar perspectives, where numerous interactions within the system lead to emergent properties and cognitive abilities beyond that of the individual parts, can be found within many scientific fields. With the goal of understanding how behavior may be represented in robots, the present review tries to grasp what this notion of emergence really means and compare it with a selection of theories developed for analysis of human cognition, including the extended mind, distributed cognition and situated action. These theories reveal a view of intelligence where common notions of objects, goals, language and reasoning have to be rethought. A view where behavior, as well as the agent as such, is defined by the observer rather than given by their nature. Structures in the environment emerge by interaction rather than recognized. In such a view, the fundamental question is how emergent systems appear and develop, and how they may be controlled.

sted, utgiver, år, opplag, sider
SciTePress, 2010
Emneord
Behavior based control, Cognitive artificial intelligence, Distributed cognition, Ontology, Reactive robotics, Sensory-motor coordination, Situated action
HSV kategori
Forskningsprogram
data- och systemvetenskap
Identifikatorer
urn:nbn:se:his:diva-12141 (URN)10.5220/0002782103730382 (DOI)978-989-674-022-1 (ISBN)
Konferanse
2nd International Conference on Agents and Artificial Intelligence (ICAART 2010), Valencia, Spain, January 22-24, 2010
Tilgjengelig fra: 2017-02-22 Laget: 2017-02-22 Sist oppdatert: 2018-01-13bibliografisk kontrollert
2. Behavior recognition for segmentation of demonstrated tasks
Åpne denne publikasjonen i ny fane eller vindu >>Behavior recognition for segmentation of demonstrated tasks
2008 (engelsk)Inngår i: IEEE SMC International Conference on Distributed Human-Machine Systems (DHMS), 2008, s. 228-234Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

One common approach to the robot learning technique Learning From Demonstration, is to use a set of pre-programmed skills as building blocks for more complex tasks. One important part of this approach is recognition of these skills in a demonstration comprising a stream of sensor and actuator data. In this paper, three novel techniques for behavior recognition are presented and compared. The first technique is function-oriented and compares actions for similar inputs. The second technique is based on auto-associative neural networks and compares reconstruction errors in sensory-motor space. The third technique is based on S-Learning and compares sequences of patterns in sensory-motor space. All three techniques compute an activity level which can be seen as an alternative to a pure classification approach. Performed tests show how the former approach allows a more informative interpretation of a demonstration, by not determining "correct" behaviors but rather a number of alternative interpretations.

Emneord
Learning from demonstration, Segmentation, Generalization, Sequence Learning, Auto-associative neural networks, S-Learning
HSV kategori
Identifikatorer
urn:nbn:se:his:diva-12145 (URN)978-80-01-04027-0 (ISBN)
Konferanse
IEEE SMC International Conference on Distributed Human-Machine Systems (DHMS)
Tilgjengelig fra: 2008-03-19 Laget: 2017-02-22 Sist oppdatert: 2018-01-13bibliografisk kontrollert
3. A formalism for learning from demonstration
Åpne denne publikasjonen i ny fane eller vindu >>A formalism for learning from demonstration
2010 (engelsk)Inngår i: Paladyn - Journal of Behavioral Robotics, ISSN 2080-9778, E-ISSN 2081-4836, Vol. 1, nr 1, s. 1-13Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

The paper describes and formalizes the concepts and assumptions involved in Learning from Demonstration (LFD), a common learning technique used in robotics. LFD-related concepts like goal, generalization, and repetition are here defined, analyzed, and put into context. Robot behaviors are described in terms of trajectories through information spaces and learning is formulated as mappings between some of these spaces. Finally, behavior primitives are introduced as one example of good bias in learning, dividing the learning process into the three stages of behavior segmentation, behavior recognition, and behavior coordination. The formalism is exemplified through a sequence learning task where a robot equipped with a gripper arm is to move objects to specific areas. The introduced concepts are illustrated with special focus on how bias of various kinds can be used to enable learning from a single demonstration, and how ambiguities in demonstrations can be identified and handled.

sted, utgiver, år, opplag, sider
De Gruyter Open, 2010
Emneord
Learning from demonstration, ambiguities, behavior, bias, generalization, robot learning
HSV kategori
Identifikatorer
urn:nbn:se:his:diva-12143 (URN)10.2478/s13230-010-0001-5 (DOI)
Tilgjengelig fra: 2017-02-22 Laget: 2017-02-22 Sist oppdatert: 2018-01-13bibliografisk kontrollert
4. Predictive learning from demonstration
Åpne denne publikasjonen i ny fane eller vindu >>Predictive learning from demonstration
2011 (engelsk)Inngår i: Agents and Artificial Intelligence: Second International Conference, ICAART 2010, Valencia, Spain, January 22-24, 2010. Revised Selected Papers / [ed] Joaquim Filipe; Ana Fred; Bernadette Sharp, Berlin: Springer Berlin/Heidelberg, 2011, 1, s. 186-200Kapittel i bok, del av antologi (Fagfellevurdert)
Abstract [en]

A model-free learning algorithm called Predictive Sequence Learning (PSL) is presented and evaluated in a robot Learning from Demonstration (LFD) setting. PSL is inspired by several functional models of the brain. It constructs sequences of predictable sensory-motor patterns, without relying on predefined higher-level concepts. The algorithm is demonstrated on a Khepera II robot in four different tasks. During training, PSL generates a hypothesis library from demonstrated data. The library is then used to control the robot by continually predicting the next action, based on the sequence of passed sensor and motor events. In this way, the robot reproduces the demonstrated behavior. PSL is able to successfully learn and repeat three elementary tasks, but is unable to repeat a fourth, composed behavior. The results indicate that PSL is suitable for learning problems up to a certain complexity, while higher level coordination is required for learning more complex behaviors.

sted, utgiver, år, opplag, sider
Berlin: Springer Berlin/Heidelberg, 2011 Opplag: 1
Serie
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 129
HSV kategori
Identifikatorer
urn:nbn:se:his:diva-12148 (URN)10.1007/978-3-642-19890-8_14 (DOI)000302748200014 ()2-s2.0-84879489110 (Scopus ID)978-3-642-19889-2 (ISBN)978-3-642-19890-8 (ISBN)
Konferanse
Second International Conference, ICAART 2010, Valencia, Spain, January 22-24, 2010
Tilgjengelig fra: 2017-02-22 Laget: 2017-02-22 Sist oppdatert: 2023-05-03bibliografisk kontrollert
5. Behavior recognition for learning from demonstration
Åpne denne publikasjonen i ny fane eller vindu >>Behavior recognition for learning from demonstration
2010 (engelsk)Inngår i: 2010 IEEE International Conference on Robotics and Automation / [ed] Nancy M. Amato et. al, 2010, s. 866-872Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Two methods for behavior recognition are presented and evaluated. Both methods are based on the dynamic temporal difference algorithm Predictive Sequence Learning (PSL) which has previously been proposed as a learning algorithm for robot control. One strength of the proposed recognition methods is that the model PSL builds to recognize behaviors is identical to that used for control, implying that the controller (inverse model) and the recognition algorithm (forward model) can be implemented as two aspects of the same model. The two proposed methods, PSLE-Comparison and PSLH-Comparison, are evaluated in a Learning from Demonstration setting, where each algorithm should recognize a known skill in a demonstration performed via teleoperation. PSLH-Comparison produced the smallest recognition error. The results indicate that PSLH-Comparison could be a suitable algorithm for integration in a hierarchical control system consistent with recent models of human perception and motor control.

Serie
IEEE International Conference on Robotics and Automation ICRA, ISSN 1050-4729
Emneord
learning and adaptive systems, neurorobotics, autonomous agents
HSV kategori
Forskningsprogram
data- och systemvetenskap
Identifikatorer
urn:nbn:se:his:diva-12149 (URN)10.1109/ROBOT.2010.5509912 (DOI)2-s2.0-77955785914 (Scopus ID)978-1-4244-5040-4 (ISBN)978-1-4244-5038-1 (ISBN)
Konferanse
IEEE International Conference on Robotics and Automation (ICRA), Anchorage, Alaska, USA, May 3-7, 2010
Tilgjengelig fra: 2017-02-22 Laget: 2017-02-22 Sist oppdatert: 2023-05-03bibliografisk kontrollert
6. Robot learning from demonstration using predictive sequence learning
Åpne denne publikasjonen i ny fane eller vindu >>Robot learning from demonstration using predictive sequence learning
2012 (engelsk)Inngår i: Robotic systems: applications, control and programming / [ed] Ashish Dutta, Kanpur, India: IN-TECH , 2012, s. 235-250Kapittel i bok, del av antologi (Fagfellevurdert)
Abstract [en]

In this chapter, the prediction algorithm Predictive Sequence Learning (PSL) is presented and evaluated in a robot Learning from Demonstration (LFD) setting. PSL generates hypotheses from a sequence of sensory-motor events. Generated hypotheses can be used as a semi-reactive controller for robots. PSL has previously been used as a method for LFD, but suffered from combinatorial explosion when applied to data with many dimensions, such as high dimensional sensor and motor data. A new version of PSL, referred to as Fuzzy Predictive Sequence Learning (FPSL), is presented and evaluated in this chapter. FPSL is implemented as a Fuzzy Logic rule base and works on a continuous state space, in contrast to the discrete state space used in the original design of PSL. The evaluation of FPSL shows a significant performance improvement in comparison to the discrete version of the algorithm. Applied to an LFD task in a simulated apartment environment, the robot is able to learn to navigate to a specific location, starting from an unknown position in the apartment.

sted, utgiver, år, opplag, sider
Kanpur, India: IN-TECH, 2012
HSV kategori
Forskningsprogram
data- och systemvetenskap
Identifikatorer
urn:nbn:se:his:diva-12146 (URN)10.5772/26165 (DOI)978-953-307-941-7 (ISBN)
Tilgjengelig fra: 2012-01-02 Laget: 2017-02-22 Sist oppdatert: 2023-05-03bibliografisk kontrollert
7. Simultaneous control and recognition of demonstrated behavior
Åpne denne publikasjonen i ny fane eller vindu >>Simultaneous control and recognition of demonstrated behavior
2011 (engelsk)Rapport (Annet vitenskapelig)
Abstract [en]

A method for Learning from Demonstration (LFD) is presented and evaluated on a simulated Robosoft Kompai robot. The presented algorithm, called Predictive Sequence Learning (PSL), builds fuzzy rules describing temporal relations between sensory-motor events recorded while a human operator is tele-operating the robot. The generated rule base can be used to control the robot and to predict expected sensor events in response to executed actions. The rule base can be trained under different contexts, represented as fuzzy sets. In the present work, contexts are used to represent different behaviors. Several behaviors can in this way be stored in the same rule base and partly share information. The context that best matches present circumstances can be identified using the predictive model and the robot can in this way automatically identify the most suitable behavior for precent circumstances. The performance of PSL as a method for LFD is evaluated with, and without, contextual information. The results indicate that PSL without contexts can learn and reproduce simple behaviors. The system also successfully identifies the most suitable context in almost all test cases. The robot's ability to reproduce more complex behaviors, with partly overlapping and conflicting information, significantly increases with the use of contexts. The results support a further development of PSL as a component of a dynamic hierarchical system performing control and predictions on several levels of abstraction. 

sted, utgiver, år, opplag, sider
Umeå: Umeå University, Department of Computing Science, 2011. s. 22
Serie
Report / UMINF, ISSN 0348-0542 ; 15
Emneord
Behavior Recognition, Context Dependent, Fuzzy Logic, Learning and Adaptive Systems, Learning from Demonstration
HSV kategori
Identifikatorer
urn:nbn:se:his:diva-12147 (URN)
Tilgjengelig fra: 2017-02-22 Laget: 2017-02-22 Sist oppdatert: 2023-05-03bibliografisk kontrollert

Open Access i DiVA

fulltekst(494 kB)137 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 494 kBChecksum SHA-512
074287820f9a6ac6daef5635aff823222768afeb8bed0c33dd8b4365915ae21920ad60fe1b1e7b52efcdc398a8c17dfe6ebba892c5e89eb99ab496819bb62a22
Type fulltextMimetype application/pdf

Andre lenker

http://www.cognitionreversed.com

Søk i DiVA

Av forfatter/redaktør
Billing, Erik

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 137 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 356 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf