Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Filtering Misleading Repair Log Labels to Improve Predictive Maintenance Models
Halmstad University, School of Information Technology.ORCID iD: 0000-0001-5395-5482
Halmstad University, School of Information Technology.ORCID iD: 0000-0002-7796-5201
Halmstad University, School of Information Technology. RISE Research Institutes of Sweden, Kista, Sweden.ORCID iD: 0000-0003-3272-4145
2022 (English)In: Proceedings of the 7th European Conference of the Prognostics and Health Management Society 2022 / [ed] Phuc Do; Gabriel Michau; Cordelia Ezhilarasu, State College, PA: PHM Society , 2022, Vol. 7 (1), p. 110-117Conference paper, Published paper (Refereed)
Abstract [en]

One of the main challenges for predictive maintenance in real applications is the quality of the data, especially the labels. In this paper, we propose a methodology to filter out the misleading labels that harm the performance of Machine Learning models. Ideally, predictive maintenance would be based on the information of when a fault has occurred in a machine and what specific type of fault it was. Then, we could train machine learning models to identify the symptoms of such fault before it leads to a breakdown. However, in many industrial applications, this information is not available. Instead, we approximate it using a log of component replacements, usually coming from the sales or maintenance departments. The repair history provides reliable labels for fault prediction models only if the replaced component was indeed faulty, with symptoms captured by collected data, and it was going to lead to a breakdown.

However, very often, at least for complex equipment, this assumption does not hold. Models trained using unreliable labels will then, necessarily, fail. We demonstrate that filtering misleading labels leads to improved results. Our central claim is that the same fault, happening several times, should have similar symptoms in the data; thus, we can train a model to predict them. On the contrary, replacements of the same component that do not exhibit similar symptoms will be confusing and harm the ML models. Therefore, we aim to filter the maintenance operations, keeping only those that can be used to predict each other. Suppose we can train a successful model using the data before a component replacement to predict another component replacement. In that case, those maintenance operations must be motivated by the same, or a very similar, type of fault.

We test this approach on a real scenario using data from a fleet of sterilizers deployed in hospitals. The data includes sensor readings from the machines describing their operations and the service logs indicating the replacement of components when the manufacturing company performs the service. Since sterilizers are complex machines consisting of many components and systems interacting with each other, there is the possibility of faults happening simultaneously.

Place, publisher, year, edition, pages
State College, PA: PHM Society , 2022. Vol. 7 (1), p. 110-117
Series
Proceedings of the European Conference of the Prognostics and Health Management Society (PHME), E-ISSN 2325-016X
Keywords [en]
Predictive maintenance, misleading labels, Machine Learning
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-48120DOI: 10.36001/phme.2022.v7i1.3360ISBN: 978-1-936263-36-3 (print)OAI: oai:DiVA.org:hh-48120DiVA, id: diva2:1697963
Conference
7th European Conference of the Prognostics and Health Management (PHM) Society, Turin, Italy, July 6-8, 2022
Funder
Knowledge FoundationSwedish Research CouncilAvailable from: 2022-09-22 Created: 2022-09-22 Last updated: 2025-10-01Bibliographically approved
In thesis
1. Hierarchical Methods for Self-Monitoring Systems: Theory and Application
Open this publication in new window or tab >>Hierarchical Methods for Self-Monitoring Systems: Theory and Application
2022 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Self-monitoring solutions first appeared to avoid catastrophic breakdowns in safety-critical mechanisms. The design behind these solutions relied heavily on the physical knowledge of the mechanism and its fault. They usually involved installing specialized sensors to monitor the state of the mechanism and statistical modeling of the recorded data. Mainly, these solutions focused on specific components of a machine and rarely considered more than one type of fault.

In our work, on the other hand, we focus on self-monitoring of complex machines, systems composed of multiple components performing heterogeneous tasks and interacting with each other: systems with many possible faults. Today, the data available to monitor these machines is vast but usually lacks the design and specificity to monitor each possible fault in the system accurately. Some faults will show distinctive symptoms in the data; some faults will not; more interestingly, there will be groups of faults with common symptoms in the recorded data.

The thesis in this manuscript is that we can exploit the similarities between faults to train machine learning models that can significantly improve the performance of self-monitoring solutions for complex systems that overlook these similarities. We choose to encode these similarity relationships into hierarchies of faults, which we use to train hierarchical supervised models. We use both real-life problems and standard benchmarks to prove the adequacy of our approach on tasks like fault diagnosis and fault prediction.

We also demonstrate that models trained on different hierarchies result in significantly different performances. We analyze what makes a good hierarchy and what are the best practices to develop methods to extract hierarchies of classes from the data. We advance the state-of-the-art by defining the concept of heterogeneity of decision boundaries and studying how it affects the performance of different class decompositions. 

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2022. p. 66
Series
Halmstad University Dissertations ; 93
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-48138 (URN)978-91-88749-98-7 (ISBN)978-91-88749-97-0 (ISBN)
Public defence
2022-10-14, Wigforssalen, Hus J (Visionen), Kristian IV:s väg 3, Halmstad, 10:00 (English)
Opponent
Supervisors
Available from: 2022-09-23 Created: 2022-09-23 Last updated: 2025-10-01Bibliographically approved

Open Access in DiVA

fulltext(565 kB)171 downloads
File information
File name FULLTEXT01.pdfFile size 565 kBChecksum SHA-512
47488b2b22fdd29bce79b1321aea4fb866bb2f8f1290e76a0c316055045303e720b88508fa7a0ea9a40aea5f5ca9497743eeaa9b12c47442620afaf7a6741e59
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Del Moral, PabloNowaczyk, SławomirPashami, Sepideh
By organisation
School of Information Technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 172 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 259 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf