Articles | Volume 16, issue 2
https://doi.org/10.5194/ms-16-685-2025
https://doi.org/10.5194/ms-16-685-2025
Research article
 | 
04 Nov 2025
Research article |  | 04 Nov 2025

Data dynamic networks coupled with mechanical vibration energy for bearing defect identification

Lingwei Hou, Tianfeng Wang, Mukai Wang, Duhui Lu, Sicheng Li, and Faju Qiu
Abstract

Traditional machine defect identification methods have problems such as poor adaptability and low accuracy. As for solving the problems, a large number of improved machine defect identification techniques based on artificial intelligence have been developed; however, all of these methods focus on the neural networks and ignore the connections and influences of the data nodes. This paper proposes a feature-enhanced method based on a data dynamic network for bearing defect identifications. The research preprocesses the data by means of optimization algorithms and defines nodes and edges of data for the construction of a data dynamic network. In order to further improve the accuracy of the machine bearing defect identification, the data are also washed for the energy analysis to be coupled with the data dynamic network for feature enhancement. The training sets and test sets are defined with different data coupling techniques. A performance evaluation of the proposed method is carried out by means of the evaluation function for more effective detection of bearing defects.

Share
1 Introduction

As one of the key parts in mechanical equipment, the importance of bearings cannot be ignored, and they play a key role in motion control and support. Due to the low accuracy and poor adaptability of the traditional defect identification methods and the heavy reliance on manual experience, it is necessary to continue to develop intelligent defect identification methods based on machine learning. In the context of today's rapid development of artificial intelligence and machine learning technology, convolutional neural network technology has been widely used in the field of machinery defect identification. Traditional mechanical defect identification methods usually require a large amount of experience and professional knowledge, while convolutional neural networks can automatically identify the type and location of mechanical defects by learning and analyzing a large amount of mechanical operation data and, at the same time, generating feedback results. This method can not only improve the efficiency and accuracy of mechanical system defect identification but also largely reduce the cost and risk of manual identification.

Mechanical equipment defect identification technology is a scientific and technological means to ensure the stable operation of equipment and to maintain safe production. Using all kinds of sensors or intelligent sensing equipment, through effective work and cooperation, real-time monitoring, the collection of machinery and equipment running parameters, the use of a variety of signal processing and analysis methods for research, and accurate identification of the equipment and by doing a good job of protecting the corresponding work, we can greatly improve the safety and stability of machinery and equipment operation.

To improve defect identification accuracy and to understand the different reasons for and types of defects, many researchers have focused on this area. Wu (2020) conducted a detailed study on the rotor crack defect monitoring and identification technology for large-scale equipment such as turbine generators and formed a systematic and comprehensive rotor crack defect identification, analysis, and elimination method for turbine generators by establishing a relationship model related to different types of defects and two specific frequency components, as well as four specific working-condition parameters, which can effectively help to eliminate the rotor crack defects and provide an innovative and effective diagnostic method for the actual on-site defect identification. It can help effectively in troubleshooting rotor cracks and in providing an innovative and effective diagnostic method for on-site defect identification. Wang et al. (2013) designed a turbine rotor defect detection and diagnostic analysis method based on deep convolutional neural networks, which can realize end-to-end detection of large-scale equipment such as turbine rotor defects and can effectively classify different kinds of single simple defects and, at the same time, carry out multi-task collaborative detection and analysis of different degrees and locations, further applying the theory of deep learning to the actual identification of turbine rotor defects and directly mapping the multi-measurement point vibration data signals to the desired defect characteristics, which has the advantage of strong generalizability and high accuracy. The deep learning theory is further applied to the actual steam turbine rotor defect identification, and the vibration data signals from multiple measurement points are directly mapped to the desired defect characteristics, which has the advantages of strong generalizability and high accuracy. Zhong et al. (2022) further improved the diagnostic algorithm by applying the fuzzy-proximity algorithm to analyze and calculate the precise proximity between the data to be identified and the standard clustering centers and realized effective defect identification of the ground rotor imbalance under the working condition of variable speed. Zhang et al. (2022) combined the energy of the effective intrinsic mode function (IMF) component of the original signal and the energy entropy of the signal obtained after using the empirical modal decomposition method as new feature vectors, proposed an innovative feature vector combination, and optimized the back-propagation (BP) error of the back-propagation neural network using the DPSO algorithm to further improve the BP neural network by optimizing the threshold value of the neural network and the initial weight value. The BP neural network is further improved by optimizing the threshold and initial weights of the neural network. Jia et al. (2023) proposed an alternating evolution discontinuous Galerkin (AEDG) method, which provides a new means to solve the difficulties encountered in the traditional data-driven defect identification method when the actual operating conditions of the rotor mechanical system are variable through practical defect identification experiments and specific analysis and application. Chen et al. (2023) solved the problem of it being difficult to effectively identify the different touch-wear defects generated in the early and middle stages of using a turbine rotor through the vibration-signal-based defect detection and identification method by improving the reasonable method of obtaining the feature data set and proposed a new touch-wear defect identification method based on ensemble empirical mode decomposition with long short-term memory (EEMD-LSTM) in order for the turbine rotor to be researched and analyzed. Wu et al. (2023) proposed a marine eddy detection model, YOLOX-EDDY, based on YOLOX high-performance target detection, which is capable of automatically extracting feature parameters such as eddy center position, edge position, and scale type and can realize automated and intelligent processing for automatic eddy detection and feature information extraction.

In general, there are many kinds of mechanical equipment defect identification methods, which can be divided into two main categories: signal-processing-based defect identification methods and defect identification methods based on existing knowledge. Defect identification methods based on signal processing mainly include vibration identification, non-destructive testing, oil sample analysis, and temperature monitoring. Defect identification methods based on existing knowledge employ a combination of artificial intelligence; deep learning; and other computer technologies, mainly convolutional neural networks, expert systems, decision trees, support vector machines, and graphical neural networks. Compared with the traditional machinery and equipment defect identification methods, using the methods outlined above, rapidity, accuracy, stability, and robustness have been greatly improved.

Neural network technology is a nonlinear dynamic simulation study of biological neural network patterns. With deep learning and machine learning techniques gradually becoming the mainstream research direction in the field of artificial intelligence research, neural networks, especially convolutional neural networks, have been fully developed.

Zhang et al. (2020) provided a comprehensive review of graph-structure-based machine learning techniques by approaching them from the perspective of semi-supervised and unsupervised learning. Qiu et al. (2023) proposed a two-channel collaborative clustering analysis algorithm for heterogeneous information networks by designing a simple and effective two-channel encoder to efficiently aggregate the neighborhood information and use the collaborative clustering mechanism to cluster different types of nodes at the same time, which results in better performance than the widely used neural network encoder. Neural network technology, with its diverse research methods and advanced knowledge in many fields, provides new ideas and methods for bearing defect identification. Su et al. (2024) proposed a knowledge-informed deep network (KIDN) for robust rolling-bearing fault diagnosis, using prior knowledge-based features and data-driven learning. They developed a generalizability-based adaptive design strategy using a constrained Gaussian process (CGP) surrogate model to efficiently identify optimal KIDN architectures. Tang et al. (2024) provided a scaled-minimum unscented-Kalman-filter-aided deep belief network (DBN) (SUKF-DBN) for low-speed fault diagnosis in offshore wind turbines. This approach transforms multi-sensor time series into temporally preserved 2-D maps and augments DBN feature representation using a scaled-minimum UKF, dynamically adapting to noise conditions. To overcome the infeasibility of data-intensive deep learning for fault-rare mechanical systems, Ding et al. (2024) developed a metric-learning Siamese network (CASN) with channel attention. By mapping feature disparities between sample pairs via a shared encoder, CASN enables diagnosis from extremely limited samples while maintaining robustness in relation to noise and transmission distortion. Wen et al. (2024) proposed a Siamese neural network (SNN) framework with multi-source feature fusion for small-sample motor bearing diagnosis. A multi-stage training strategy prevents stagnation while extracting discriminative features from limited data. Fusion of multi-sensor features enhances robustness. You et al. (2025) addressed interpretability and reliability gaps in DL-based bearing diagnosis by proposing a sound vibration physical-information fusion-constraint-guided (PFCG) framework. This method fuses multi-physics data via a 15-degrees-of-freedom (DOF) nonlinear dynamics model and guides DL training through weighted physical consistency, uncertainty, and cross-entropy losses to enhance interpretability.

Regarding bearing fault diagnosis, effective feature extraction from raw vibration signals is a critical component. Section 2 commences this research by presenting rolling-bearing defect feature extraction methods, along with an overview of relevant bearing defect characteristics and theoretical foundations. Section 3 demonstrates the practical application of data augmentation techniques for robustness enhancement and presents experimental verification of three distinct feature extraction categories: energy features, recursive features, and dynamic network features. The final section summarizes this research.

2 Rolling-bearing defect feature extraction method

2.1 Recurrence quantities

In Figs. 1 and 2, since the modulating signal has a certain regularity and is a periodic signal, its recurrence plot also has a certain regularity to follow; while the defect signal is a non-periodic signal, its recurrence plot is chaotic (Eckmann et al., 1987; Wang et al., 2024). The accurate recognition of various features in the recurrence diagram requires certain experience; in order to be able to analyze the signal under study more accurately and scientifically, it is necessary to quantify the various features obtained in the recurrence diagram drawn.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f01

Figure 1FM signal with its recurrence plot (left: FM signal; right: recurrence plot).

Download

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f02

Figure 2Defect signal with its recurrence plot (left: defect signal; right: recurrence plot).

Download

2.2 Recurrence quantification analysis

The nonlinear and non-smooth characteristics of rolling-bearing defect signals make the traditional defect feature extraction methods based on the smoothness of the signals limited, while the recurrence quantification analysis (RQA) method is an effective method that can quantify the recurrence phenomenon of the system shown in the recurrence diagrams on the basis of improving the recurrence diagrams (Zbilut and Webber, 2006). Recurrence quantification analysis is an effective method to quantify the recurrence phenomenon in recurrence diagrams. The recurrence diagram is a qualitative means of analyzing the complexity of a signal, while the RQA method is a quantitative method derived from the recurrence diagram. The nonlinear characteristics of the RQA method include the following: recurrence rate (RR), recurrence entropy (ENTR), layered rate (LAM), determination rate (DET), recurrence number (TT), and average diagonal length (L). The recurrence rate, recurrence entropy, and average diagonal length (L) of the recurrence diagram are the most important characteristics of the recurrence diagram. Further more, out of the above characteristics, recursion rate, recursion entropy, and recursion trend are the more commonly used measures.

Recurrence rate (RR) refers to the ratio of the number of points appearing in the recurrence plot to the number of points appearing in the complete distance matrix (Yu et al., 2025). The RR counts only the black points in the recurrence map and is a measure used to evaluate the density of recurrence points, representing the frequency of recurrence of phase points and the degree of clustering of trajectories in phase space.

(1) RR = 1 M 2 Σ i , j = 1 M R i j

In the above, Rij denotes the recurrence matrix, and M is the number of points on the horizontal or vertical axis in the recurrence plot.

Recursive entropy (ENTR) refers to the distribution of the length of diagonal structures in a recursive graph. It is a metric that can be used to measure the strength of the periodicity of a recursive graph.

(2) ENTR = - Σ l = l min M p l ln p l

In the above, p(l) represents the number of line segments of length l that are parallel to the main diagonal.

Recurrence trend (TREND) refers to the degree of the speed of change in the rate of recurrence as the main diagonal on the recurrence graph transitions to the other two corners, essentially reflecting the degree of mixing and non-stationarity of the signal.

(3) TREND = τ = 1 N ̃ τ - N ̃ 2 RR τ - RR τ τ = 1 N ̃ τ - N ̃ 2 2

Here, 〈RRτ denotes the size of the mean of the time series, and RRτ denotes the rate of recurrence of the different parts of the time series after averaging over Ñ. If the time measure changes in the sliding window, the size of the recurrence trend depends mainly on the size of the window, and so, for different sizes of the window, the resulting recurrence trend may be very different.

2.3 Data dynamic network

The correlation coefficient is an important indicator of the size of statistical relationships and the strength of relationships between different data. Data dynamic network characteristics refer to the artificially established network of connections among data, which can effectively strengthen the defect characteristics of the data set. By studying the instantaneous linear relationship between adjacent signals, based on the Pearson correlation coefficient in conjunction with the sliding time window, we can arrange the defect data into a time-dependent network, completing the analysis of the two metrics of correlation and phase synchronization and thus endowing the data with useful dynamic attributes (Vinck et al., 2011).

The overall correlation coefficient between two variables X and Y is defined as follows:

(4) ρ X , Y = cov ( X , Y ) σ X σ Y = E X - μ X Y - μ Y σ X σ Y ,

where σX and σY denote the corresponding standard deviations.

The Pearson correlation coefficient r is expressed as follows:

(5) r = i = 1 n X i - X Y i - Y i = 1 n X i - X 2 i = 1 n Y i - Y 2 .

For network connectivity, in the analysis of specific signals, phase synchronization is an important indication of the interaction between different nodes (Onias and Viol, 2013).

If two signal sequences are phase synchronized then the correlation between the nodes represented by these two signals is stronger. As shown in the Fig. 3, data dynamic network characteristics refer to the dynamic functional network matrix composed of different nodes and connected edges between nodes, with the connected edges reflecting the correlation and consistency between the nodes, thus enabling us to show the dynamic changes in the defects in specific parts of the rolling bearing.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f03

Figure 3Example of a network connection.

Download

2.4 Data sets

The data set used in this paper is the publicly available rolling-bearing failure data set from Case Western Reserve University (CWRU) (21). The entire experimental platform consists of a 1.5 kW motor, torque transducer, power meter, etc., as shown in Fig. 4. The data used in this paper are the drive end bearing data; the bearing is sealed on both sides and is manufactured by SKF, with deep groove ball bearings (model 6205). The experimental sampling frequency is 12 kHz, and the specific parameters are shown in Table 1. There are three types of defects detected in the experimental bearings, namely, inner-ring defects, outer-ring defects, and rolling-element defects. Together with the normal data, there are four final experimental data types selected. In this paper, for the inner-ring failure, outer-ring failure, and rolling-body failure, we selected eight relevant data files, with four data files being selected for normal data for subsequent defect extraction and training tests; the specific types are shown in Table 2.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f04

Figure 4Bearing failure experiment.

Download

Table 1Bearing models and parameters.

Download Print Version | Download XLSX

Table 2Data selected for this paper.

Download Print Version | Download XLSX

3 Experimental verification

3.1 Data set enhancement

This paper is based on the programming for each selected data file, applying the written algorithm for the sliding-time-window slicing process to enhance the data set; the repetition rate is 0.6 for a total of 100 slides to complete the initial processing of the data, as shown in Fig. 5.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f05

Figure 5Demonstration of sliding time window.

Download

3.2 Power spectral density analysis

The data obtained from the power spectral density analysis of the filtered processed data in this section can be used as part of the feature quantity for subsequent machine learning, with the power spectral density being shown in Figs. 6 to 9.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f06

Figure 6Power spectrum of the inner-ring-defect sample.

Download

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f07

Figure 7Power spectrum of the outer-ring-defect sample.

Download

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f08

Figure 8Power spectrum of rolling-body-failure samples.

Download

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f09

Figure 9Power spectrum of a normal sample.

Download

The power spectral density visualization is shown in Figs. 10 to 12. After visualization, the power spectral density analysis of different defect areas can intuitively show the magnitude of and difference in energy in different frequency ranges, which can be used as the input features of the subsequent convolutional neural network with good results.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f10

Figure 10Visualization of power spectral density of inner-ring defects.

Download

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f11

Figure 11Visualization of power spectral density of outer-ring defects.

Download

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f12

Figure 12Visualization of power spectral density of rolling-body defects.

Download

3.3 Recursive-feature extraction

Using the recurrence map and the RQA method for quantitative analysis, each file of the obtained features is 8 rows × 11 columns, in which the first 10 columns represent the power spectral densities in different frequency ranges, and these features can comprehensively reflect the dynamic characteristics and complexity of the data series. The new 11th column is the recurrence rate, which provides important information about the phase space reconstruction in the data and provides strong support for the subsequent data interpretation and model construction. The inner-circle-defect sample (no. 1) with the new recurrence rate is shown in Table 3.

Table 3Sample inner-circle failures with new recurrence rates. PSD: power spectral density.

Download Print Version | Download XLSX

The extracted features are visualized and processed as shown in Fig. 13.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f13

Figure 13Data recursive-feature visualization (first row: inner and outer ring; second row: roller and normal).

Download

3.4 Data dynamic network feature extraction

A new defect data set is obtained by applying data set enhancement techniques such as sliding-time-window slicing. Within this data set, there are eight groups of data for each different defect type; each group consists of 100 rows of data after filtering and amplitude–frequency analysis using the Pearson correlation coefficient analysis method mentioned in the above section, with subsequent calculation of the correlation coefficients of the data in two adjacent rows so as to generate the strengthened feature matrix and conversion of the data into an 8 × 10 network matrix with time window processing so as to obtain the dynamic. The dynamic network matrix is constructed based on the sliding-time-window method as shown in Fig. 14.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f14

Figure 14Constructing dynamic networks based on a sliding-time-window approach.

Download

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f15

Figure 15Visualization of dynamic network features of data (first row: inner and outer ring; second row: roller and normal).

Download

The extracted features are visualized and processed as shown in Fig. 15, from which it can be intuitively seen that the dynamic network features are more effective in feature visualization than energy feature analysis due to their dynamic properties. Through the comparison between different colors, the difference between dynamic network features extracted from different samples can be clearly seen.

The variation in the strength of the constructed network nodes over time is shown in Fig. 16, from which it can be concluded that there are some fluctuations in the strength of the nodes of each defect type over time, with relatively large fluctuations in the outer-ring defects.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f16

Figure 16Plot of average strength of network nodes over time.

Download

3.5 Convolutional neural network architecture

Convolutional neural networks (CNNs) have become a key data-processing tool in the defect identification research of mechanical equipment (Zhu, 2024). The purpose of this section is to analyze and study the practical application effect of the convolutional neural network model based on energy features, recursive features, and dynamic network features in bearing defect identification. This chapter will introduce the specific structure of the convolutional neural network and the construction process in detail, including the convolutional layer, nonlinear activation function, pooling operation, fully connected layer, and other major parts, and will also clarify their functions and interconnections and then introduce the forward-propagation algorithm; the back-propagation algorithm; and a series of evaluation indexes, including the accuracy rate, the loss function, and the confusion matrix, in order to conduct a further, reasonable evaluation for different models. The experimental results show that, compared with the different models, the algorithms are more accurate and more efficient.

The experimental results show that, compared with the convolutional neural network (CNN) model using only energy features, the model with new data recursive features and the model using data dynamic features have certain dynamic attributes due to their association with time; additionally, the convergence speed and diagnostic accuracy have been improved, which indicates that the data recursive features and the dynamic network features can effectively enhance the model's ability to recognize the defect signals, thus validating the robustness of this paper's method. Thus, the advantages of the method in this paper in terms of robustness and generalizability are verified.

3.5.1 The pooling layer

The pooling layer is a method to reduce the complexity of the model (LeCun et al., 2015). By reducing the resolution of the feature map, the parameters of the neural network can be effectively reduced, and, at the same time, the adaptability to the input data in displacement and scale changes is enhanced. In practice, there are two common pooling methods: maximum pooling and mean pooling, as shown in Fig. 17.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f17

Figure 17Example of pooling operation.

Download

3.5.2 Full-connectivity layer

A fully connected layer is one of the components of a classifier, which is characterized by the fact that every neuron in the current layer is connected to all neurons in the previous layer. While operations such as convolutional, pooling, and activation layers map the original data to the hidden feature space, the fully connected layer serves to map the resulting “distributed feature representation” to the sample labeling space. The equation is as follows:

(6) x k + 1 = Σ W k x k + b k + 1 ,

where xk+1 denotes the current neuron output result, Wk denotes the connection weight, and bk represents the size of the bias value.

3.6 Forward-propagation algorithm and back-propagation algorithm

The forward-propagation algorithm is the basis of convolutional neural network prediction, which can be calculated layer by layer until the final result is output according to the hierarchical structure of the network. The back-propagation algorithm is one of the key cores of the convolutional neural network training; through the calculation of the gradient of the loss function relative to the parameters of each layer, it can realize the updating of the network parameters and, ultimately, can achieve the purpose of minimizing the difference between the prediction results and the real value.

3.6.1 Forward-propagation algorithm

The forward-propagation algorithm is one of the common processes used to perform neural network operations. The information is transmitted layer by layer by combining with weights and biases to finally obtain the output, where the activity state of each layer is determined by the corresponding activation function of that layer, as shown in Fig. 18. After completing the output, the forward propagation will quantitatively analyze the error size according to some evaluation metrics. The specific calculations are as follows:

(7) a j l = σ k ω j k l a k l - 1 + b j l .

The matrix is of the form

(8) a l = σ w l a l - 1 + b l .

Here, bjl denotes the bias value of the neuron j in the layer l, ajl denotes the activation value of the neuron j in the layer l, and w denotes the weights of two neurons in two adjacent layers.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f18

Figure 18Forward-propagation demonstration.

Download

3.6.2 Back-propagation algorithm

The back-propagation algorithm (Zhou, 2016), also known as the BP algorithm, addresses the need for an error back-propagation operation, as shown in Fig. 19.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f19

Figure 19Symbolic representation in back-propagation networks and algorithms.

Download

For a training example (xk,yk), we use the following equation:

(9) y ^ j k = f ( β j - θ j ) .

The input into the hidden-layer neuron h is calculated as follows:

(10) α h = i = 1 d ν i h x i .

The input into the output neuron j is calculated as follows:

(11) β j = h = 1 q ω h j b h .

The mean squared error is calculated as follows:

(12) E k = 1 2 j = 1 l y ^ j k - y j k 2 .

Given the error Ek, assuming a learning rate η, the weight updates are computed as follows:

(13) Δ w h j = - η E k ω h j .

Using the chain rule, we can compute the partial derivatives:

(14) E k ω h j = E k y ^ j k y ^ j k β j β j w h j .

Let

(15) g j = - E k y ^ j k y ^ j k β j = y ^ j k ( 1 - y ^ j k ) ( y j k - y ^ j k ) .

Then

(16) Δ w h j = - η E k ω n j = η g j b h .

Similarly,

(17) Δ θ j = - η g j .

3.7 Assessment of indicators

Performance metrics are a measure of how much generalizability a model has. During convolutional neural network training, evaluation metrics such as loss function, accuracy, confusion matrix, and receiver operating characteristic (ROC) curve are commonly used to evaluate the model for the next step of testing and optimization (Zhou, 2016). Through these evaluation metrics, we can quantitatively assess the robustness and generalization capabilities of the model, specifically its ability to maintain consistent performance when confronted with new test data.

3.7.1 Loss function

Loss refers to the difference between the predicted value and the true value when applying machine learning. There are two main types of loss functions commonly used in the task of performing classification: the cross-entropy loss (CEL) and the mean squared error (MSE).

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f20

Figure 20Curves of loss and accuracy (left: loss; right: accuracy).

Download

The cross-entropy loss is calculated as follows:

(18) L = 1 n i = 1 n - j = 1 c y i log p i ,

where n represents the training batch size, C represents the number of classes, y represents the sample labeling distribution, and p represents the model prediction distribution.

The mean square error loss is calculated as follows:

(19) MSE = i = 1 n y i - y j p 2 n .

3.7.2 Accuracy

Precision reflects the accuracy of the model created for prediction. Accuracy and error rate are opposite concepts of each other: the higher the accuracy, the better the performance of the designed model is proved. The specific accuracy and error rate are calculated as outlined below.

  • Accuracy.

    (20) acc = 1 m i = 1 m f x i = y i .

    In the above, yi denotes the true label corresponding to the ith sample in the test set, and f(xi) denotes its predicted label.

  • Error rate.

    (21) E = 1 m i = 1 m f x i y i = 1 - acc .

3.7.3 Confusion matrix

The confusion matrix can visualize the quality of machine learning model evaluation and thus determine how well the model performs. An example of the confusion matrix is shown in Table 4.

Table 4Confusion matrix example.

Download Print Version | Download XLSX

3.8 Data dynamic network characterization

Since the ReLU function is constant at 0 in the negative half-axis of the coordinate axis, the derivative is also constant at 0, resulting in the disappearance of the gradient in this region, a property known as unilateral suppression, which helps the model to learn a more efficient feature representation. Since the ReLU function has very good properties, this function is used as the activation function in the construction of the convolutional neural network in this chapter.

Table 5Structure parameters of the convolutional neural network model based on data energy characteristics.

Download Print Version | Download XLSX

The structural parameters of the convolutional neural network model based on energy analysis are shown in Table 5.

3.9 Synthesis based on dynamic network characterization of data

The structural parameters of the convolutional neural network model based on the dynamic network characteristics of the data are shown in Table 6.

Table 6Structure parameters of the convolutional neural network model based on data dynamic network characterization.

Download Print Version | Download XLSX

In the input into the convolutional neural network, the loss, accuracy, and confusion matrix obtained after 100 epochs of testing are depicted in Figs. 20 and 21. In Fig. 20a, the blue line represents the loss curve during training, and the orange line represents the loss curve during testing.

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f21

Figure 21Confusion matrix.

Download

https://ms.copernicus.org/articles/16/685/2025/ms-16-685-2025-f22

Figure 22Confusion matrix generated for the new test set.

Download

As the iteration time increases, the loss value decreases sharply. When the epochs reach 20, the loss value gets to around 1 %, and the good match of the loss curves for training and testing validates the good accuracy of the proposed methods and procedures for bearing defect identification. In Fig. 20b, the accuracy curve during iteration is provided. The accuracy is 95 % an epoch of 30 and 98 % at an epoch of 40. Around 100 % accuracy is obtained at epoch of 52. The significance of the proposed methods and procedures for bearing defect identification can be clearly proved and observed. The obtained results serve as a good proof of concept for the practical bearing defect identification and can be implemented in actual mechanical engineering.

The confusion matrix obtained from a new test set is depicted in Fig. 22. In the confusion matrixes shown in Figs. 21 and 22, the horizontal axis represents the predicted value; the vertical axis represents the true value; and labels 0, 1, 2, and 3, respectively, represent four defect types: normal, inner-ring defect, outer-ring defect, and rolling-element defect. In Fig. 22a, out of 180 test samples, 177 were accurately predicted, achieving an accuracy of 98.333 %. In Fig. 22b, 119 out of 120 test samples were correctly identified, resulting in an accuracy of 99.167 %. These high accuracy rates indicate that the model performs exceptionally well and can be reliably used in real-world bearing defect identification to precisely identify various types of bearing defects.

3.10 Model comparison

A comparison of model performance between the two feature extraction methods is presented in Table 7. The convolutional neural network model utilizing recurrence features achieved an average accuracy of 97.748 %. In contrast, the model employing dynamic network features attained a significantly higher average accuracy of 98.591 %. These findings indicate that the dynamic network, capturing the inherent dynamics of the system, provides an optimal feature representation. This offers a superior data preprocessing strategy for practical bearing fault diagnosis applications.

Table 7Comparison of two models.

Download Print Version | Download XLSX

4 Conclusions

In this study, we performed an effective bearing defect detection using the proposed feature enhancements of data dynamic networks. The research is summarized as follows.

In this paper, we proposed a data dynamic network for data enhancement for bearing defect identifications. Nodes and edges of data are constructed for the establishment of a data dynamic network. Together with the feature fusion techniques, the energy, recurrence rates, and amplitude–frequency spectrums are reconstructed in the time-delayed phase space for preparation of bearing defect identification. The accuracy of bearing defect identification reaches up to 98.5 %.

A convolutional neural network (CNN) model based on data recursive features and dynamic network features is designed. The effectiveness in bearing defect identification is verified through experiments. The generalizability and robustness of the intelligent neural network are improved. A more effective defect detection algorithm is explored.

The preliminary results of this paper contribute new ideas to the field of bearing defect identification. We hope that the paper can provide inspiration and promote the development of intelligent defect identifications of bearing defects.

Data availability

The data sets generated and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Author contributions

LH wrote the paper draft. TW designed the machine learning algorithm. MW supervised the research. DL did the data cutting and filtering. SL built the data dynamic network. FQ was responsible for the data drawing and schematics.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Financial support

This work is supported by the Science and Technology Plan Project of State Administration for Market Regulation of China (grant no. 2023MK199), the Key R&D and Transformation Plan Project of Qinghai Province (grant no. 2023-QY-215), the Science and Technology Program of CSEI (grant no. 2021XKTD013), and the National Key Research and Development Program of China (grant no. 2023YFC3010400).

Review statement

This paper was edited by Pengyuan Zhao and reviewed by two anonymous referees.

References

Chen, S. N., Li, L. P., Zhang, S. H., and Yiqing, Z.: EEMD-LSTM-Based Steam Turbine Rotor Rubbing defect identification Model and Its Engineering Application, Journal of Thermal Power Engineering, 38, 159–168, https://doi.org/10.16146/j.cnki.rndlgc.2023.08.020, 2023 (in Chinese with English abstract). 

CWRU data: https://engineering.case.edu/bearingdatacenter/download-data-file, last access: 17 January 2025. 

Ding, P., Xu, Y., Qin, P., and Sun, M.: A novel deep learning approach for intelligent bearing fault diagnosis under extremely small samples, Applied Intelligence 54, 5306–5316, https://doi.org/10.1007/s10489-024-05429-7, 2024. 

Eckmann, J. P., Kamphorst, S. O., and Ruelle, D.: Recurrence Plots of Dynamical Systems, Europhys. Lett., 4, 973–977, https://doi.org/10.1209/0295-5075/4/9/004, 1987. 

Jia, S. X., Sun, D. Y., Mao, G., and Li, Y. B.: Cross-Condition defect identification Method of Rotor System Based on Adversarial Entropy, Journal of Mechanical Engineering, 59, 110–120, https://doi.org/10.3901/JME.2023.15.110, 2023 (in Chinese with English abstract). 

LeCun, Y., Bengio, Y., and Hinton, G.: Deep learning, Nature, 521, 436–444, https://doi.org/10.1038/nature14539, 2015. 

Onias, H., Viol, A., and Palhano-Fontes, F.: Brain complex network analysis by means of resting state fMRI and graph analysis: will it be helpful in clinical epilepsy, Epilepsy & Behavior, 38, 71–80, https://doi.org/10.1016/j.yebeh.2013.11.019, 2013. 

Qiu, L. S., Fang, Z. Q., and Chen, L.: Dual-Channel Collaborative Clustering Algorithm for Heterogeneous Information Networks, Chinese Journal of Computers, 46, 2416–2430, 2023 (in Chinese with English abstract). 

Su, Y., Shi, L., Zhou, K., Bai, G., and Wang, Z.: Knowledge-informed deep networks for robust fault diagnosis of rolling bearings, Reliability Engineering & System Safety, 244, 109863, https://doi.org/10.1016/j.ress.2023.109863, 2024. 

Tang, H., Zhang, K., Wang, B., Zu, X., Li, Y., Feng, W., Jiang, X., Chen, P., and Li, Q.: Early bearing fault diagnosis for imbalanced data in offshore wind turbine using improved deep learning based on scaled minimum unscented Kalman filter, Ocean Engineering, 300, 117392, https://doi.org/10.1016/j.oceaneng.2024.117392, 2024. 

Vinck, M., Oostenveld, R., and Van Wingerden, M.: An improved index of phase-synchronization for electrophysiological data in the presence of volume-conduction, noise and sample-size bias, Neuroimage, 55, 1548–1565, https://doi.org/10.1016/j.neuroimage.2011.01.055, 2011. 

Wang, C. Y., Zheng, Z. L., Liu, T. Y., Yonghui, X., and Di, Z.: Research on Unbalance and Misalignment defect Detection Method of Steam Turbine Rotor Based on Convolutional Neural Network, Proceedings of the CSEE, 41, 2417–2427, https://doi.org/10.13334/j.0258-8013.pcsee.201353, 2013 (in Chinese with English abstract).  

Wang, M., Wang, T., Lu, D., and Cui, S.: Intelligent identification of Bearing Failures Based on Recurrence Quantification and Energy Difference, Appl. Sci., 14, 9643, https://doi.org/10.3390/app14219643, 2024. 

Wen, C., Xue, Y., Liu, W., Chen, G., and Liu, X.: Bearing fault diagnosis via fusing small samples and training multi-state Siamese neural networks, Neurocomputing, 576, 127355, https://doi.org/10.1016/j.neucom.2024.127355, 2024. 

Wu, J. Q., Chen, G., Ma, C. Y., and Ying W.: Automatic Detection and Feature Extraction of Oceanic Vortices in SAR Images Using Deep Learning, Journal of Ship Science and Technology, 45, 170–173, https://doi.org/10.3404/j.issn.1672-7649.2023.06.033, 2023 (in Chinese with English abstract). 

Wu, Y. H.: Research on Rotor Crack defect identification Technology of Large Steam Turbine Generator, Shanghai Jiao Tong University, 50–66, https://doi.org/10.27307/d.cnki.gsjtu.2020.001467, 2020 (in Chinese with English abstract). 

You, K., Wang, P., Huang, P., and Gu, Y.: A sound-vibration physical-information fusion constraint-guided deep learning method for rolling bearing fault diagnosis, Reliability Engineering & System Safety, 253, 110556, https://doi.org/10.1016/j.ress.2024.110556, 2025. 

Yu, W., Xu, Y., and Wang, T.: Intelligent classification of the abnormal features through time-delayed reconstruction of phase states, Journal of Applied Nonlinear Dynamics, 14, 859–874, https://doi.org/10.5890/JAND.2025.12.008, 2025. 

Zbilut, J. P. and Webber, C. L.: Recurrence Quantification Analysis, Wiley Encyclopedia of Biomedical Engineering, John Wiley & Sons, Inc., https://doi.org/10.1002/9780471740360.ebs1355, 2006. 

Zhang, Y. L., Qi, C., and Cheng, Y. P.: Rotor defect identification Based on DPSO-BP, Machine Tool & Hydraulics, 50, 194–199, https://doi.org/10.3969/j.issn.1001-3881, 2022 (in Chinese with English abstract). 

Zhang, Z. W., Cui, P., and Zhu, W. W.: Deep Learning on Graphs: A Survey, IEEE, https://doi.org/10.1109/TKDE.2020.2981333, 2020. 

Zhong, Z. X., Ma, L. Y., Cai, Z. H., Yijian D., and Jinhua D.: Unbalance defect identification Method of Rotor under Variable Speed Conditions Based on VMD-MPE and FCM Clustering, Journal of Vibration and Shock, 41, 290–298, https://doi.org/10.13465/j.cnki.jvs.2022.14.037, 2022 (in Chinese with English abstract). 

Zhou, Z.: Machine Learning. China: Tsinghua University Press, 101–106, ISBN 9787302423287, 2016 (in Chinese). 

Zhu, Z.: Research on Domain Adaptive Algorithm for Mechanical defect identification Based on Convolutional Neural Network, Harbin Institute of Technology, 10–12, https://doi.org/10.27061/d.cnki.ghgdu.2019.000730, 2024 (in Chinese). 

Download
Short summary
In this paper, we propose a data dynamic network for data enhancement for bearing defect identifications. Nodes and edges of data are constructed for the establishment of a data dynamic network. Together with the feature fusion techniques, the energy, recurrence rates, and amplitude–frequency spectra are reconstructed in the time-delayed phase space for preparation of bearing defect identification. The accuracy for bearing defect identification reaches up to 98.5 %.
Share