Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An Angle-based Subspace Anomaly Detection Approach to High-dimensional Data: With an Application to Industrial Fault Detection
Luleå University of Technology, Department of Civil, Environmental and Natural Resources Engineering, Operation, Maintenance and Acoustics.ORCID iD: 0000-0001-7310-5717
Luleå University of Technology, Department of Civil, Environmental and Natural Resources Engineering, Operation, Maintenance and Acoustics.ORCID iD: 0000-0002-7458-6820
Luleå University of Technology, Department of Civil, Environmental and Natural Resources Engineering, Operation, Maintenance and Acoustics.ORCID iD: 0000-0002-0055-2740
2015 (English)In: Reliability Engineering & System Safety, ISSN 0951-8320, E-ISSN 1879-0836, Vol. 142, 482-497 p.Article in journal (Refereed) Published
Abstract [en]

The accuracy of traditional anomaly detection techniques implemented on full-dimensional spaces degrades significantly as dimensionality increases, thereby hampering many real-world applications. This work proposes an approach to selecting meaningful feature subspace and conducting anomaly detection in the corresponding subspace projection. The aim is to maintain the detection accuracy in high-dimensional circumstances. The suggested approach assesses the angle between all pairs of two lines for one specific anomaly candidate: the first line is connected by the relevant data point and the center of its adjacent points; the other line is one of the axis-parallel lines. Those dimensions which have a relatively small angle with the first line are then chosen to constitute the axis-parallel subspace for the candidate. Next, a normalized Mahalanobis distance is introduced to measure the local outlier-ness of an object in the subspace projection. To comprehensively compare the proposed algorithm with several existing anomaly detection techniques, we constructed artificial datasets with various high-dimensional settings and found the algorithm displayed superior accuracy. A further experiment on an industrial dataset demonstrated the applicability of the proposed algorithm in fault detection tasks and highlighted another of its merits, namely, to provide preliminary interpretation of abnormality through feature ordering in relevant subspaces.

Place, publisher, year, edition, pages
2015. Vol. 142, 482-497 p.
National Category
Other Civil Engineering
Research subject
Operation and Maintenance
Identifiers
URN: urn:nbn:se:ltu:diva-13556DOI: 10.1016/j.ress.2015.05.025Local ID: ccb88d21-0f57-4412-9035-a6b9f78de9c7OAI: oai:DiVA.org:ltu-13556DiVA: diva2:986509
Note
Validerad; 2015; Nivå 2; 20150531 (liazha)Available from: 2016-09-29 Created: 2016-09-29 Last updated: 2017-11-24Bibliographically approved
In thesis
1. Big Data Analytics for Fault Detection and its Application in Maintenance
Open this publication in new window or tab >>Big Data Analytics for Fault Detection and its Application in Maintenance
2016 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Big Data Analytics för Feldetektering och Applicering inom Underhåll
Abstract [en]

Big Data analytics has attracted intense interest recently for its attempt to extract information, knowledge and wisdom from Big Data. In industry, with the development of sensor technology and Information & Communication Technologies (ICT), reams of high-dimensional, streaming, and nonlinear data are being collected and curated to support decision-making. The detection of faults in these data is an important application in eMaintenance solutions, as it can facilitate maintenance decision-making. Early discovery of system faults may ensure the reliability and safety of industrial systems and reduce the risk of unplanned breakdowns.

Complexities in the data, including high dimensionality, fast-flowing data streams, and high nonlinearity, impose stringent challenges on fault detection applications. From the data modelling perspective, high dimensionality may cause the notorious “curse of dimensionality” and lead to deterioration in the accuracy of fault detection algorithms. Fast-flowing data streams require algorithms to give real-time or near real-time responses upon the arrival of new samples. High nonlinearity requires fault detection approaches to have sufficiently expressive power and to avoid overfitting or underfitting problems.

Most existing fault detection approaches work in relatively low-dimensional spaces. Theoretical studies on high-dimensional fault detection mainly focus on detecting anomalies on subspace projections. However, these models are either arbitrary in selecting subspaces or computationally intensive. To meet the requirements of fast-flowing data streams, several strategies have been proposed to adapt existing models to an online mode to make them applicable in stream data mining. But few studies have simultaneously tackled the challenges associated with high dimensionality and data streams. Existing nonlinear fault detection approaches cannot provide satisfactory performance in terms of smoothness, effectiveness, robustness and interpretability. New approaches are needed to address this issue.

This research develops an Angle-based Subspace Anomaly Detection (ABSAD) approach to fault detection in high-dimensional data. The efficacy of the approach is demonstrated in analytical studies and numerical illustrations. Based on the sliding window strategy, the approach is extended to an online mode to detect faults in high-dimensional data streams. Experiments on synthetic datasets show the online extension can adapt to the time-varying behaviour of the monitored system and, hence, is applicable to dynamic fault detection. To deal with highly nonlinear data, the research proposes an Adaptive Kernel Density-based (Adaptive-KD) anomaly detection approach. Numerical illustrations show the approach’s superiority in terms of smoothness, effectiveness and robustness.

Place, publisher, year, edition, pages
Luleå, Sweden: Luleå University of Technology, 2016. 72 p.
Series
Doctoral thesis / Luleå University of Technology 1 jan 1997 → …, ISSN 1402-1544
Keyword
Big Data analytics, eMaintenance, fault detection, high-dimensional data, stream data mining, nonlinear data
National Category
Other Civil Engineering
Research subject
Operation and Maintenance
Identifiers
urn:nbn:se:ltu:diva-60423 (URN)978-91-7583-769-7 (ISBN)978-91-7583-770-3 (ISBN)
Public defence
2017-01-27, F1031, Luleå University of Technology, 971 87, Luleå, 10:00 (English)
Opponent
Supervisors
Available from: 2016-11-15 Created: 2016-11-15 Last updated: 2017-11-24Bibliographically approved

Open Access in DiVA

fulltext(1846 kB)104 downloads
File information
File name FULLTEXT01.pdfFile size 1846 kBChecksum SHA-512
d1bdbb5ba0e8edcc6beb001eafb6540b149e8af8b80c484edcd050904bbffce478e4557bf2a002e72e7988dd1aed319b97e0925892f077b9939179e074107872
Type fulltextMimetype application/pdf

Other links

Publisher's full texthttp://authors.elsevier.com/a/1RK3V_Lf6GwVmE

Search in DiVA

By author/editor
Zhang, LiangweiLin, JanetKarim, Ramin
By organisation
Operation, Maintenance and Acoustics
In the same journal
Reliability Engineering & System Safety
Other Civil Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 104 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 155 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf