Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Data Consistency Approach to Model Validation
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Automatic control.
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Automatic control.
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Automatic control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control.ORCID iD: 0000-0002-7957-3711
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Automatic control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control.
(English)In: Article in journal (Refereed) Submitted
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:uu:diva-357607OAI: oai:DiVA.org:uu-357607DiVA, id: diva2:1239823
Funder
Swedish Foundation for Strategic Research Swedish Research CouncilAvailable from: 2018-08-17 Created: 2018-08-17 Last updated: 2018-10-01
In thesis
1. Machine learning with state-space models, Gaussian processes and Monte Carlo methods
Open this publication in new window or tab >>Machine learning with state-space models, Gaussian processes and Monte Carlo methods
2018 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Numbers are present everywhere, and when they are collected and recorded we refer to them as data. Machine learning is the science of learning mathematical models from data. Such models, once learned from data, can be used to draw conclusions, understand behavior, predict future evolution, and make decisions. This thesis is mainly concerned with two particular statistical models for this purpose: the state-space model and the Gaussian process model, as well as a combination thereof. To learn these models from data, Monte Carlo methods are used, and in particular sequential Monte Carlo (SMC) or particle filters.

The thesis starts with an introductory background on state-space models, Gaussian processes and Monte Carlo methods. The main contribution lies in seven scientific papers. Several contributions are made on the topic of learning nonlinear state-space models with the use of SMC. An existing SMC method is tailored for learning in state-space models with little or no measurement noise. The SMC-based method particle Gibbs with ancestor sampling (PGAS) is used for learning an approximation of the Gaussian process state-space model. PGAS is also combined with stochastic approximation expectation maximization (EM). This  method, which we refer to as particle stochastic approximation EM, is a general method for learning parameters in nonlinear state-space models. It is later applied to the particular problem of maximum likelihood estimation in jump Markov linear models. An alternative and non-standard approach for how to use SMC to estimate parameters in nonlinear state-space models is also presented.

There are also two contributions not related to learning state-space models. One is how SMC can be used also for learning hyperparameters in Gaussian process regression models. The second is a method for assessing consistency between model and data. By using the model to simulate new data, and compare how similar that data is to the observed one, a general criterion is obtained which follows directly from the model specification. All methods are implemented and illustrated, and several are also applied to various real-world examples.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2018. p. 74
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 1709
Keywords
Machine learning, State-space models, Gaussian processes
National Category
Signal Processing Probability Theory and Statistics
Research subject
Electrical Engineering with specialization in Automatic Control
Identifiers
urn:nbn:se:uu:diva-357611 (URN)978-91-513-0417-5 (ISBN)
Public defence
2018-10-12, ITC 2446, Lägerhyddsvägen 2, Uppsala, 10:15 (English)
Opponent
Supervisors
Available from: 2018-09-18 Created: 2018-08-21 Last updated: 2018-10-02

Open Access in DiVA

No full text in DiVA

Search in DiVA

By author/editor
Svensson, AndreasZachariah, DaveStoica, PeterSchön, Thomas B.
By organisation
Division of Systems and ControlAutomatic control
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 39 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf