Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Sliding Window-based Fault Detection from High-dimensional Data Streams
Luleå University of Technology, Department of Civil, Environmental and Natural Resources Engineering, Operation, Maintenance and Acoustics.ORCID iD: 0000-0001-7310-5717
Luleå University of Technology, Department of Civil, Environmental and Natural Resources Engineering, Operation, Maintenance and Acoustics.ORCID iD: 0000-0002-7458-6820
Luleå University of Technology, Department of Civil, Environmental and Natural Resources Engineering, Operation, Maintenance and Acoustics.ORCID iD: 0000-0002-0055-2740
Number of Authors: 3
2017 (English)In: IEEE Transactions on Systems, Man & Cybernetics. Systems, ISSN 2168-2216, Vol. 47, no 2, 289-303 p., 7509594Article in journal (Refereed) Published
Abstract [en]

High-dimensional data streams are becoming increasingly ubiquitous in industrial systems. Efficient detection of system faults from these data can ensure the reliability and safety of the system. The difficulties brought about by high dimensionality and data streams are mainly the ``curse of dimensionality'' and concept drifting, and one current challenge is to simultaneously address them. To this purpose, this paper presents an approach to fault detection from nonstationary high-dimensional data streams. An angle-based subspace anomaly detection approach is proposed to detect low-dimensional subspace faults from high-dimensional datasets. Specifically, it selects fault-relevant subspaces by evaluating vectorial angles and computes the local outlier-ness of an object in its subspace projection. Based on the sliding window strategy, the approach is further extended to an online mode that can continuously monitor system states. To validate the proposed algorithm, we compared it with the local outlier factor-based approaches on artificial datasets and found the algorithm displayed superior accuracy. The results of the experiment demonstrated the efficacy of the proposed algorithm. They also indicated that the algorithm has the ability to discriminate low-dimensional subspace faults from normal samples in high-dimensional spaces and can be adaptive to the time-varying behavior of the monitored system. The online subspace learning algorithm for fault detection would be the main contribution of this paper.

Place, publisher, year, edition, pages
2017. Vol. 47, no 2, 289-303 p., 7509594
National Category
Other Civil Engineering
Research subject
Operation and Maintenance
Identifiers
URN: urn:nbn:se:ltu:diva-12680DOI: 10.1109/TSMC.2016.2585566ISI: 000396231100008Scopus ID: 2-s2.0-85009962444Local ID: bd869fb2-6668-49d9-9511-535c94003f4fOAI: oai:DiVA.org:ltu-12680DiVA: diva2:985631
Note

Validerad; 2017; Nivå 2; 2017-01-23 (andbra)

Available from: 2016-09-29 Created: 2016-09-29 Last updated: 2017-11-24Bibliographically approved
In thesis
1. Big Data Analytics for Fault Detection and its Application in Maintenance
Open this publication in new window or tab >>Big Data Analytics for Fault Detection and its Application in Maintenance
2016 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Big Data Analytics för Feldetektering och Applicering inom Underhåll
Abstract [en]

Big Data analytics has attracted intense interest recently for its attempt to extract information, knowledge and wisdom from Big Data. In industry, with the development of sensor technology and Information & Communication Technologies (ICT), reams of high-dimensional, streaming, and nonlinear data are being collected and curated to support decision-making. The detection of faults in these data is an important application in eMaintenance solutions, as it can facilitate maintenance decision-making. Early discovery of system faults may ensure the reliability and safety of industrial systems and reduce the risk of unplanned breakdowns.

Complexities in the data, including high dimensionality, fast-flowing data streams, and high nonlinearity, impose stringent challenges on fault detection applications. From the data modelling perspective, high dimensionality may cause the notorious “curse of dimensionality” and lead to deterioration in the accuracy of fault detection algorithms. Fast-flowing data streams require algorithms to give real-time or near real-time responses upon the arrival of new samples. High nonlinearity requires fault detection approaches to have sufficiently expressive power and to avoid overfitting or underfitting problems.

Most existing fault detection approaches work in relatively low-dimensional spaces. Theoretical studies on high-dimensional fault detection mainly focus on detecting anomalies on subspace projections. However, these models are either arbitrary in selecting subspaces or computationally intensive. To meet the requirements of fast-flowing data streams, several strategies have been proposed to adapt existing models to an online mode to make them applicable in stream data mining. But few studies have simultaneously tackled the challenges associated with high dimensionality and data streams. Existing nonlinear fault detection approaches cannot provide satisfactory performance in terms of smoothness, effectiveness, robustness and interpretability. New approaches are needed to address this issue.

This research develops an Angle-based Subspace Anomaly Detection (ABSAD) approach to fault detection in high-dimensional data. The efficacy of the approach is demonstrated in analytical studies and numerical illustrations. Based on the sliding window strategy, the approach is extended to an online mode to detect faults in high-dimensional data streams. Experiments on synthetic datasets show the online extension can adapt to the time-varying behaviour of the monitored system and, hence, is applicable to dynamic fault detection. To deal with highly nonlinear data, the research proposes an Adaptive Kernel Density-based (Adaptive-KD) anomaly detection approach. Numerical illustrations show the approach’s superiority in terms of smoothness, effectiveness and robustness.

Place, publisher, year, edition, pages
Luleå, Sweden: Luleå University of Technology, 2016. 72 p.
Series
Doctoral thesis / Luleå University of Technology 1 jan 1997 → …, ISSN 1402-1544
Keyword
Big Data analytics, eMaintenance, fault detection, high-dimensional data, stream data mining, nonlinear data
National Category
Other Civil Engineering
Research subject
Operation and Maintenance
Identifiers
urn:nbn:se:ltu:diva-60423 (URN)978-91-7583-769-7 (ISBN)978-91-7583-770-3 (ISBN)
Public defence
2017-01-27, F1031, Luleå University of Technology, 971 87, Luleå, 10:00 (English)
Opponent
Supervisors
Available from: 2016-11-15 Created: 2016-11-15 Last updated: 2017-11-24Bibliographically approved

Open Access in DiVA

fulltext(3313 kB)305 downloads
File information
File name FULLTEXT01.pdfFile size 3313 kBChecksum SHA-512
b80142405157b99821e2bbccae4dae14d1695bdbd8c303a0c5aa6aee90fb21381716c7b0e19cf83d217cc45a53f88b512f94f5f85139f6e023a269c7a8237429
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopushttp://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7509594&newsearch=true&queryText=Sliding%20Window-based%20Fault%20Detection%20from%20High-dimensional%20Data%20Streams Ospecifierade

Search in DiVA

By author/editor
Zhang, LiangweiLin, JanetKarim, Ramin
By organisation
Operation, Maintenance and Acoustics
Other Civil Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 305 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 415 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf