Data-driven segmentation of long term condition monitoring data in the presence of heavy-tailed distributed noise with finite-variance
Data-driven segmentation of long term condition monitoring data in the presence of heavy-tailed distributed noise with finite-variance
Machinery condition prognosis systems use long-term historical data to predict the remaining useful life (RUL). One of the critical steps to reach this purpose is to segment long-term data into two or several degradation stages (healthy, unhealthy, critical stage). Finding a changing point between stages may be a crucial preliminary task for further prediction of degradation process. However, finding the accurate partition into two or more stages is a challenging task in actual application when noise inherent in the observed process exhibits non-Gaussian characteristics. In this paper, a framework for data-driven segmentation is presented for prognosis of machinery long-term data in presence of heavy-tailed distributed noise with finite variance. It is assumed that three different stages are inherent in degradation process and each segment of data follows a specific trend (constant, linear, exponential or polynomial). At first, data is divided into three parts. Trend functions are fitted to the data by using robust regression method, and cumulative error is calculated. This process is done iteratively for all possible partitions into three intervals to find the segmentation which minimizes the error. The framework has been tested via empirical analysis of estimators of the changing points obtained in Monte Carlo simulations. Also, discussed approaches are applied to the real data. In such measurement, data that are commonly available (in condition monitoring systems) is aggregated from the raw signal and sampled at long intervals. Finally, effectiveness of the segmentation results is assessed by comparing them with envelope frequency analysis of raw signal to confirm the fact that detected changing points coincide with start time of the fault in the machine or not.
Shiri, Hamid
7a4304e3-a4bc-4007-961b-29530af225fd
Zimroz, Paweł
30c91d8b-525d-410f-ae8d-51899faebb7a
Wodecki, Jacek
bed412ce-c860-4637-9aee-34685199239d
Wylomanska, Agnieszka
420eb98f-605e-486c-8d33-bd3c2a859be1
Zimroz, Radoslaw
d3d00d36-da1f-411b-8f02-22871182ff08
11 October 2023
Shiri, Hamid
7a4304e3-a4bc-4007-961b-29530af225fd
Zimroz, Paweł
30c91d8b-525d-410f-ae8d-51899faebb7a
Wodecki, Jacek
bed412ce-c860-4637-9aee-34685199239d
Wylomanska, Agnieszka
420eb98f-605e-486c-8d33-bd3c2a859be1
Zimroz, Radoslaw
d3d00d36-da1f-411b-8f02-22871182ff08
Shiri, Hamid, Zimroz, Paweł, Wodecki, Jacek, Wylomanska, Agnieszka and Zimroz, Radoslaw
(2023)
Data-driven segmentation of long term condition monitoring data in the presence of heavy-tailed distributed noise with finite-variance.
Mechanical Systems and Signal Processing, 205, [110833].
(doi:10.1016/j.ymssp.2023.110833).
Abstract
Machinery condition prognosis systems use long-term historical data to predict the remaining useful life (RUL). One of the critical steps to reach this purpose is to segment long-term data into two or several degradation stages (healthy, unhealthy, critical stage). Finding a changing point between stages may be a crucial preliminary task for further prediction of degradation process. However, finding the accurate partition into two or more stages is a challenging task in actual application when noise inherent in the observed process exhibits non-Gaussian characteristics. In this paper, a framework for data-driven segmentation is presented for prognosis of machinery long-term data in presence of heavy-tailed distributed noise with finite variance. It is assumed that three different stages are inherent in degradation process and each segment of data follows a specific trend (constant, linear, exponential or polynomial). At first, data is divided into three parts. Trend functions are fitted to the data by using robust regression method, and cumulative error is calculated. This process is done iteratively for all possible partitions into three intervals to find the segmentation which minimizes the error. The framework has been tested via empirical analysis of estimators of the changing points obtained in Monte Carlo simulations. Also, discussed approaches are applied to the real data. In such measurement, data that are commonly available (in condition monitoring systems) is aggregated from the raw signal and sampled at long intervals. Finally, effectiveness of the segmentation results is assessed by comparing them with envelope frequency analysis of raw signal to confirm the fact that detected changing points coincide with start time of the fault in the machine or not.
Text
1-s2.0-S0888327023007410-main
- Version of Record
More information
Accepted/In Press date: 28 September 2023
e-pub ahead of print date: 11 October 2023
Published date: 11 October 2023
Identifiers
Local EPrints ID: 503327
URI: http://eprints.soton.ac.uk/id/eprint/503327
ISSN: 0888-3270
PURE UUID: 9662af6c-5556-44b0-a682-2a66e1cf8f2f
Catalogue record
Date deposited: 29 Jul 2025 16:42
Last modified: 22 Aug 2025 02:49
Export record
Altmetrics
Contributors
Author:
Hamid Shiri
Author:
Paweł Zimroz
Author:
Jacek Wodecki
Author:
Agnieszka Wylomanska
Author:
Radoslaw Zimroz
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics