 Original Article
 Open access
 Published:
RealTime Intelligent Diagnosis of Cofrequency Vibration Faults in Rotating Machinery Based on LightweightConvolutional Neural Networks
Chinese Journal of Mechanical Engineering volumeÂ 37, ArticleÂ number:Â 41 (2024)
Abstract
The cofrequency vibration fault is one of the common faults in the operation of rotating equipment, and realizing the realtime diagnosis of the cofrequency vibration fault is of great significance for monitoring the health state and carrying out vibration suppression of the equipment. In engineering scenarios, cofrequency vibration faults are highlighted by rotational frequency and are difficult to identify, and existing intelligent methods require more hardware conditions and are exclusively timeconsuming. Therefore, Lightweightconvolutional neural networks (LWCNN) algorithm is proposed in this paper to achieve realtime fault diagnosis. The critical parameters are discussed and verified by simulated and experimental signals for the sliding window data augmentation method. Based on LWCNN and data augmentation, the realtime intelligent diagnosis of cofrequency is realized. Moreover, a realtime detection method of fault diagnosis algorithm is proposed for data acquisition to fault diagnosis. It is verified by experiments that the LWCNN and sliding window methods are used with high accuracy and realtime performance.
1 Introduction
Cofrequency vibration, which is dominated by the rotational vibration component, is the most common form of vibration defects in rotating machinery. Cofrequency vibration accounts for approximately 70% of rotating equipment vibration failures, according to statistical data. When a severe vibration defect arises and the vibration signal is measured to be dominated by cofrequency components, balancing is often performed onsite or the rotor is sent to the factory. If the underlying source of a vibration defect is not identified, the vibration fault cannot be eradicated after reassembly, which not only wastes production time but is also inefficient. In addition, incorrect assembly might introduce new flaws, which further compounds the problem. With the growth of intelligent equipment, selfdiagnosis of the equipment has become a necessity [1, 2]. Selfrecovery regulatory systems such as active balancing and automatic balancing algorithms and shafting alignment algorithm to suppress and locate excessive vibration require realtime fault diagnostics [3, 4].
A variety of faults will increase the cofrequency vibration, such as imbalance, misalignment, and looseness. In engineering contexts, it is simple to attribute cofrequency vibration faults to unbalance, making it easy to identify the fault state but challenging to identify the fault category. Methods for rotating equipment problem diagnosis can be split into three categories: Vibration model, signal processing, and intelligent algorithm [5]. The methods based on vibration models, such as employing the finite element method to establish the dynamics equations of the rotorbearingfoundation systems [6, 7], and identifying faults by calculating the residual forces of faults [8], but the kinetic parameters of various equipment vary greatly and are partially random. Fault diagnosis methods based on signal processing, such as the empirical mode decomposition (EMD) algorithm, decomposing vibration signals into intrinsic mode functions (IMFs) [9], and ensemble empirical mode decomposition (EEMD) [10], need to be improved in different application scenarios [11,12,13], and the process of extracting IMFs is timeconsuming in loop iteration.
Combining the intelligent algorithm's adaptability and processing precision with the signal processing approach makes the algorithm more applicable. Support vector machine (SVM) and convolutional neural network (CNN) are two intelligent defect diagnosis algorithms. Linear SVM is a type of binary classification model whose fundamental model is a linear classifier with the biggest interval in the feature space, which has the advantages of broad applicability and quick training. Pei et al. [14] introduced a joint optimization of SVM and nondominated sorting genetic algorithm II for the target, and the performance of the jointly optimized classifier was at least 99.9%. Ahmad et al. [15] proposed two new factors for axis misalignment that occurs during overload and employed SVM for fault identification, which has a high degree of accuracy in practice. In the fields of image processing and fault diagnostics, the traditional CNN has found widespread application. Both Wang et al. [16] and Li et al. [17] detected motor bearing vibration fault signals under changing loads using CNN as a classifier. Nowadays, gearbox and bearing fault diagnosis comprise the majority of rotating equipment fault diagnosis [11, 18, 19], and these fault signal characteristics are accompanied by shock signals.
The majority of fault diagnosis studies utilizing deep learning approaches focus on fault signals with impact characteristics, whereas realtime diagnostic investigations for cofrequency faults have not been published domestically or internationally. The cofrequency faults of rotating machinery, such as unbalance, misalignment, and looseness, are frequently characterized by a single spectral characteristic, vibration without shock, and vibration energy values that are close to one another under the same working condition, making them difficult to differentiate using conventional spectrograms. As explained in Section 2.1, the reasons of misalignment fault are different, but the misalignment fault is evident from the vibration mechanism [8]. In the meantime, there is always more than one problem particular to distinct rotor system damping, stiffness, and installation conditions, thus the effect of the spectrum graph is obscure. The classification method employs machine learning and deep learning techniques to extract the signal characteristics of these cofrequency defects from a highdimensional space or a deep network. In engineering settings, the majority of the data collected throughout the collection process is normal data, while the fault data is rather tiny, necessitating the need for logical data augmentation. The initial vibration signal in the time domain is intercepted by sliding window processing and can be utilized immediately for classifier. Significant sliding window data augmentation method parameters are explored and statistically analyzed in this article.
Lightweightconvolutional neural networks (LWCNN) is proposed in this paper, which minimizes the number of layers and parameters to accomplish realtime diagnosis. The CNN structure consists primarily of input and output layers as well as an intermediate convolutionalpooling layer. The LWCNN suggested in this study, however, reduces the number of convolutional pooling layers while maintaining accuracy.
This paper is organized as follows: In Section 2, a summary of the mechanism of cofrequency faults in rotating machinery is made; meanwhile, the principle of the cofrequency fault diagnosis algorithm LWCNN and SVM is introduced. In Section 3, the method of realtime diagnosis of cofrequency faults and its evaluation method are proposed, as well as the method of data preprocessing. In Section 4, developed experiments are used to validate the performance of the realtime intelligent diagnosis approach based on the LWCNN. Finally, the conclusions are in Section 5.
2 Rotating Machinery Failure Mechanism and Intelligent Algorithms
The common rotating equipments such as pumps, compressors, fans, etc., are composed of four parts: rotor system, bearing support, seals and other accessories. The rotor is supported by the bearing in the process of movement, the impeller interact with fluid materials, the motor rotor is affected by electromagnetic field and coupling. In short, the rotorbearing system is affected by centrifugal force, gravity, magnetic force, and interaction between rigid bodies and fluid. The causes of the cofrequency vibration are complex and often result from the comprehensive action of many factors. The common causes are as shown in Figure 1.
The cofrequency vibration of rotating machinery is not always caused by rotor unbalance but also may be due to poor alignment, bearing problems, improper clearance of bearing shell, or other reasons. The cofrequency vibration excitation force mainly comes from the radial rotation vector caused by rotor shaft system, bearing failure, interaction between the rotor and stator or transmission parts, shell vibration and resonance, etc. The locations of the rotor system unbalance, misalignment and looseness fault are shown in Figure 2.
2.1 Comparison Faults Mechanism and Actual Faults Signal
Unbalance: Typical fault of radial rotation vector caused by the rotor itself is unbalance fault. When the rotor has an unbalance fault, i.e., the center of mass and rotation do not coincide, the dynamic unbalance caused by centrifugal force always has obvious response in each rotation. The intense response at the rotor bending critical speed, which is also related to the unbalance position. The mass eccentric m causes the bearing to suffer the same rotational periodic \(\omega\) force, as shown in Eq. (1), where e is the eccentricity.
Misalignment: Rotating machinery needs couplings to transmit forces and moments. The axis often has a certain misalignment, when the rotor is running under loads. In general, rotors always have center and angular misalignment, and couplings can overcome misalignment to a lesser extent. However, if the misalignment exceeds a certain level, the performance of the bearing will be affected.
Any point in the coupling moves around the center of the shaft from a to b by an angle \(\omega\mathrm{^{\prime}} t\), the offset is \(\Delta y\).
Then, the trajectory of O'Â is expressed as follows and is shown in Figure 3.
Expression of exciting force:
In the case of only central misalignment, the radial vibration frequency caused by the misaligned excited rotor is twice the rotational frequency, with a large of harmonic components.
Looseness: It is a kind of fault caused by the existence of gaps or insufficient rigidity of the connection at the joint surface of the rotor support system, resulting in low mechanical damping and excessive vibration in operation. Looseness, unbalance, and misalignment faults can all lead to cofrequency vibrations in the rotor system. Loose rotor support parts make the connection stiffness and mechanical damping decrease, assuming that the gap is \(\Delta\), the mass converted to the rotor disc is recorded as \(C_{0}\), the rotor's equation of motion is as follows:
where x is the displacement of disc mass, \(k\) is the rotor support stiffness, and \(M\) is the disc mass.
Harmonic analysis of the above equation shows that the rotor support system is nonlinear, the vibration response contains the cofrequency signal, and 2X,Â 3\({\mathbf{X}}\) and other high and low harmonics, in which X is the rotational frequency.
However, although all the above analyses provide a reference direction for the cofrequency fault diagnosis, the actual collected cofrequency fault signals are unsatisfactory. As shown in Figure 4, the high amplitude frequency is basically concentrated in power frequency and 2X. The amplitude changes of different measurement points under the same fault and the amplitude changes of the same measurement point under different faults are irregular.
2.2 Intelligent Diagnosis Algorithm
CNN and SVM are classical classification methods in the fault diagnosis field. Different convolutional structures, convolutional kernels number, convolutional kernel steps, the size of fully connected layers, and different normalization methods, etc., form different CNN. SVM combines other probabilistic conditions [20] or, after combining multiple SVMs, forms different improved SVM [11, 21]. The following briefly describes CNN and SVM principles and signal processing.
2.2.1 CNN and LWCNN
CNN, a kind of neural network containing convolution computation, gets feature maps by convolution window sliding. After that, the feature map used to represent input data features will be obtained, then the pooling calculation is carried out, which calculate through a sliding window without weight.
The 2D convolution is defined as follows:
where z is the result of the convolution of x on \(k(u,v)\), i and j are the two directions respectively. For an l convolutional layer, given the input feature map size is \((m \times m)\), the convolutional kernel size is \((n \times n)\), and the output of the layer is:
Using average pooling method, the weight of each convolution kernel is \(\lambda^{l + 1}\), and the output of the pooling layer with convolution adds the bias as shown in Eq.Â (7):
The pooling layer has two functions, one is to allow further feature extraction to reduce the number of weight parameters, and the other is to make the input to the network with translation without distortion. Key differences between CNNs and traditional neural networks are reflected in local connectivity, shared weights, and spatial pooling [22]. These differences also allow CNN to have better translation invariance, fewer parameters, and better generalization performance of the model compared to previous neural networks.
CNN was widely used in computer vision and natural language recognition and are also prevalent in fault diagnosis. In recent years, CNN has made great progress in its development, among which AlexNet, VGGNet, GoogLeNet, and ResNet [23,24,25,26] are all representative deep networks based on CNN. These methods have hundreds of layers, billions of parameters and require a long training cycle and expensive hardware, thus not acceptable for researchers who want to know test results quickly. The fewer the layers of neural networks, the simpler the structure, the fewer the training parameters, and the shorter the training time, the shorter the prediction time of the trained network, so the lightweight neural network should be considered.
The CNN model mainly consists of convolutional, pooling, and fully connected layers, and Figure 5 shows a LWCNN model with five hidden layers after the vibration signal is processed. As shown in Figure 4, the 5 hidden layers of CNN are the convolutional layer Conv 1 (first hidden layer), Pooling layer 1 (second hidden layer), convolutional layer Conv 2 (third hidden layer), Pooling layer 2 (fourth hidden layer) and fully connected (FC) layer (fifth hidden layer).
2.2.2 SVM
Assume that the input space and feature space of SVM are two different spaces, where the input space is an Euclidean space or discrete set and the feature space is an Euclidean space or Hilbert space. Assume that the elements of the two spaces, linear separable SVM and linear SVM, are in onetoone correspondence, and map the inputs of the input space to the feature vectors of the feature space.
Rotor fault features are basically nonlinear, and a nonlinear SVM, i.e., a ticked SVM, is required to achieve fault type classification. Assuming that there exists a mapping relationship \(\varphi\) from the input space to the Hilbert feature space, the kernel function satisfies Eq. (8):
The input space corresponds to the Hilbert space, i.e., Î—, so that the original highdimensional hypersurface in the input space is transformed into a hyperplane in the feature space, thus achieving nonlinear distribution classification [27]. For SVM with kernel functions, the process is essentially the same as for linear SVM, and the presence of the mapping relationship makes the hyperplane equation, as shown in Eq. (9):
Similar to linear SVM, the optimization model Eq. (10) at this time can be found:
where \(\omega\) isÂ the margin, as shown in Figure 6, \(\varphi\) is the linear expression and b is the linear expression intercept. To obtain the dual problem under nonlinearity, the inner product of \(x_{i}\) and \(x_{j}\) can be equal to the value in the original sample space by the kernel function \(K\left( {x_{i} ,x_{j} } \right)\), which greatly reduces the computational difficulty, and the dual problem is shown in Eq. (11):
After solving \(\alpha\) by the above equation, \(\omega\) and \(b\) can be found subsequently. The solution leads to the hyperplane equation:
From the above calculation process, the appropriate form of \(\varphi (x)\) is crucial for the kernel function \(K\). It is known from the kernel theorem that any kernel function implicitly defines a reproducing kernel Hilbert feature space. The choice of kernel function causes the biggest difference in SVM. The commonly used kernel functions are linear kernel function, polynomial kernel function, Gaussian kernel function, Laplacian kernel, and Sigmoid kernel function, among which the RBF kernel function is the most commonly used kernel function with strong adaptability and the low number of parameters. With the development of machine learning, SVM as a common classification method has been used extensively, and for a more detailed derivation of formulas in SVM in Ref. [28].
3 RealTime Diagnosis Method of CoFrequency Faults
3.1 Basic Principle
As mentioned above, the actual cofrequency vibration faults do not have obvious characteristics in time and frequency domains, and the types of faults are often not directly determined by spectrum maps. However, deep learning methods are able to extract deep features and implicit features of faults, and these features do not give a specific physical meaning interpretation because the feature extraction process in deep learning methods is a blackbox process. The cofrequency fault diagnosis method using the deep learning method is shown in Figure 7. When the raw data is preprocessed, the bad points in the measurement process are removed, and the data is collated into a suitable format and sent to the model for training.
The traditional offline fault diagnosis often relies on specialized knowledge to extract signal or mechanism features. However, these feature extractions often require an iterative processfor each diagnosis process. In contrast, realtime fault algorithms are more timeconsuming during training, but the diagnosis is very fast after the training is completed and is fully realtime.
3.2 Data Preprocessing
After normalizing the continuous vibration signals \(S_{1} ,S_{2} , \cdots ,S_{n}\), the data augmentation process is performed on \(S_{i}\). The rotor speed is n, the sampling frequency during the experiment is Fs, and the row and each sampleâ€™s row and column size is \(N_{f}\). The sample size \(N_{f}^{2}\) is the nearest positive integer of sampling points per full turn at theÂ rotate speed \(n\).
The window slides on the timedomain signals \(S_{i}\), and the feature can be ignored when the window slides to the end of the last occurrence of insufficient points. Different move steps and after several iterations can generate a large amount of data to train the model, as shown in Figure 8. In the process of sliding the window to generate data, the size of sliding window step and window size are two important parameters, and the sliding window step is expressed as the ratio of window step and window size (RWW). The correlation coefficient (CC) of the timedomain signal is used to quantitatively characterize the correlation of the data before and after the data augmentation. If the autocorrelation coefficient is close to 1 and the interrelationship is low, the data after data augmentation can be used. From statistical knowledge, the relationship between the magnitude of the correlation coefficient and the strength of the correlation is shown in Table 1.
Two different simulated signals are considered to vary mainly in terms of frequency, amplitude, initial phase and noise intensity. The simulated Signal \(i\) is characterized by the following equation:
where \(d_{i}\) is the offset and \(n_{i}\) denotes the signal noise, which is expressed by different signaltonoise ratios (SNR) in the simulation experiment. Two sets of simulated signals, Sig. 1 and Sig. 2, are set up with the parameters shown in Table 2, and time domain plots are shown in Figure 9. The results of the correlation analysis done for the signals under different RWWs are shown in Figure 10. From Figure 10, the autocorrelation coefficients of the two signals under different RWW are above 0.8, and the mutual correlation number is about 0.5, which is medium to low correlation. Therefore, the requirements for the data can be satisfied under different RWWs.
3.3 RealTime Performance Evaluation Method
In order to verify the applicability of the proposed algorithm to online fault diagnosis, the realtime performance of the algorithm needs to be tested. The opensource environment of Python, TensorFlow, and other modules are updated and iterated rapidly to facilitate further improvement of algorithms. In this paper, the algorithm is divided into the following parts: normalization of the original data, transformation into standard data, and the classification process in the classifier.
Aiming to test the realtime performance of the algorithm, the number of samples needs to be changed. There are 3 ways to increase the sample points: a) Add one point each time in the sample features based on a certain number from the original data points; b) add one sample feature size (L_{2}) each time based on a certain number from the original data points; c) add the number of sampling points (L_{1}) in each sampling time. The fault diagnosis time within each sampling time is tested to ensure that the algorithm time is much less than the sampling time. Figure 11 illustrates the above 3 methods.
4 Experimental Verification
Specific experiments are used to verify the above process. The experimental procedure is shown in Figure 12.
4.1 Experiment Preparation
The rotor test bench shown in Figure 13 mainly verifies this experimental verification. The test bench is driven by a motor, and the firstorder critical speed of the actual test bench rotor is about 2700 r/min. Both ends of the rotor are supported by sliding bearings, and various faults are simulated on the experimental bench. A simple diagram of the test bench is shown in Figure 13(b), which clearly shows the location of the fault and sensor position. The experimental data was collected by BH7000. Misalignment fault is simulated and measured by adjusting a micrometer screw at the coupling position; unbalance fault is to add a fixed mass of counterweight block to the 0 phase; loose fault is to loosen the screw on the bearing support side of the rear end, the test bench fault simulation is shown in Figure 14.
The actual faults often do not occur in isolation, and the same type of failure also has different causes of failure; in most cases, one fault is dominant, and others coexist. Hence the dynamic balance and alignment of the test rig should be completed before fault data collection, regarded as experimental data only if the test bench vibration value is in the acceptable range. After each fault simulation test, the fault is removed instead of repeating the dynamic balancing and alignment experiments on the test bench, and the trial continues with the following fault. By this operation, each major fault contains more than one faults.
Misalignment fault point is set close to the coupling position because the vibration intensity at the free end is greater when the rotor vibrates, also in loose fault experiment. For unbalance fault, the same unbalance is added to the middle of the shaft position to provoke greater vibration. To ensure that the unbalance fault in the experiment is not significantly different from other faults, the unbalance fault is applied at the motor side. The parameters of each fault in this experiment are shown in Table 3 and experiment is conducted at a constant speed of 1800 r/min (slightly below the firstorder critical speed). The sampling frequency Fs and sampling points Ns are 25.6 kHz and 16384, respectively. The amount of data collected is shown in Table 4.
4.2 Experimental Data Processing
After processing the data sampled from different faults, different measurement points, and the same time point, the time domain waveforms are shown in Figure 15, where CH1âˆ’CH4 respectively represent sensors 1ha, 1va, 2ha, 2va. The time domain waveforms are filtered by sampling moving average, filtering higher frequency. The time domain waveform from filtered data are shown in Figure 16. The frequency domain plot is shown in Section 2.1. Based on the time domain and frequency spectrum, the 3 faults are highly similar.
As mentioned in the previous section, the sliding window method is adopted for data argumentation, the window size in this experiment is 900. The training and test sets use using 3/4 and 1/2 RWW, and the validation set uses using 1/3 RWW for one of the measurement points, so the size of the resulting data set is shown in Table 5. The autocorrelation coefficients and intercorrelation numbers under different RWWs are shown in Figures 17 and 18.
From Figures 17 and 18, it can be concluded that the augmented data with different faults are welldistinguished in 2D space. The similarity between different faults is weak or even extremely weak, while the similarity between the same faults is in medium and high similarity. In Figure 17, the autocorrelation coefficient of the 2va measurement point is always slightly lower than the other measurement points. This is because, compared to the other measurement points, the 2va is further away from the fault position, the fault signal transmission path is more complex and the fault vibration signal is attenuated, resulting in a smaller signal autocorrelation coefficient. The high correlation coefficient values between the unbalance fault and the loose fault in Figure 18(b) are mainly because both have similar fault magnitude and characteristics. Therefore, the data set after data augmentation can be used for further diagnosis.
4.3 Fault Diagnosis Verification
The experiments were performed on Python 3.7.10, in a 64bit Window environment with a RTX 2080Ti GPU. The CNN is performed on Keras 2.3.1, TensorFlowgpu2.2.0 environment, and the SVM uses scikitlearn0.24.1. The SVM and two different CNN structure parameters and specific parameter values are shown in Tables 6 and 7.
In the actual fault diagnosis process, the amount of fault data is often small, and the suddenness of the fault requires the diagnosis method to be lightweight. The algorithm needs to be able to maintain a high correct rate even with little data. In order to solve this practical problem, the following work was done in this experiment during the fault diagnosis. As can be seen from Table 7, compared to the multiCNN, the LWCNN reduces one layer of convolutional pooling and keeps the other parameter structure unchanged. The final total parameters are reduced from 3.37 million to 1.3 million.

a)
The difference between the two CNNs is that the MultiCNN has more network layers and parameters. In order to compare the different layers of the CNN network, two CNN are designed respectively as LWCNN and multiCNN. Thereinto, multiCNN has more hidden layers and more parameters.

b)
The algorithm does complete training, testing, and validation at a gradual and uniform increase of the data volume from 25% to 100% of the total data volume.

c)
To avoid accidental error, the algorithms must be repeated 10 times on the same data.
For more quantitatively describing the classification performance of different algorithms under different data volumes, the F1 value, a common classification evaluation parameter in machine learning, is used for a more comprehensive evaluation. For a specific binary classification problem, the samples are labeled with positive or negative labels (1 or 0), and the algorithm outputs a predicted label after training. Compared with the actual label, the labeled results have four cases, as shown in Figure 19, which are recorded as true positive (TP), false positive (FP), false negative (FN), false negative (FP), and true negative (TN). The accuracy of the algorithm's classification (Eq. (15)) and the checkall rate (Eq. (17)) are often two contradictory quantities, and the summed average F1 score of the two (Eq. (18)) is used to do the evaluation, as shown in Eqs. (15), (16), (17), (18):
The performance of each method on the training, test, and validation sets is shown in Table 8, and the specific performance on the test set is shown in Figures 20, 21, 22, 23. In Table 8, the overall accuracy of SVM is above 85%. The accuracy performance of SVM improves by about 10% as the data volume increases, and the F1 value also improves by about 10% as the data volume increases. In contrast, the performance of the CNN network is stable with increased data volume. At 25% data volume, the accuracy of LWCNN is above 95%, most of which are at 99% level with excellent performance, and the F1 score is above 0.98 on average. By analyzing Figures 20, 21, 22, 23, the fault diagnosis accuracy of CNN under the four data amounts is basically above 95%, which has excellent performance. However, in SVM, the imbalance and misalignment diagnosis accuracy are below 85% with 100% data volume or less. Regardingparameter number, the LWCNN has 1.3 million, and the multilayer CNN has 3.37 million in comparison.
After processing the time domain signal with the sliding window method, the different cofrequency vibration faults have low correlation values with each other. The signal processing stage not only improves the ability to discriminate between different vibration fault forms but also achieves the purpose of data augmentation. According to the data analysis in Section 4.3, it is obvious that CNN performs better when the data volume is small, with a better F1 score and better generalization performance. For CNN, although LWCNN has one less convolutional layer and pooling layer than multiCNN, the performance is no worse than multiCNN. Therefore, the data processing method time domain truncation method produces data features with better adaptability to different faults and can achieve better results using lightweight neural networks.
4.4 RealTime Analysis of Diagnostic Methods
As described in Section 3.3, the feature size in this experiment is 900, the sampling time is 0.64 s, and the sampling points is 16384. Therefore, the 3 methods are tested separately in tensorflow environment as shown in Figure 24.
In fact, the sample is increased using the 3 methods before mentioned and repeat 10 times. The final result is shown in Figure 24. For method (a), because the window size and the moving step are fixed in the data processing stage, the amount of standard data entering the classifier does not change even though the number of points increases in a certain range; for method (b), 900 points are added because the size of the reference window is set, and not much standard data is added after data processing. The data amount added is small for methods (a) and (b). The time change is not visible due to the short time of the algorithm itself, which is regarded as a normal phenomenon because of the short time and the small amount of sample change. Method (c) is considered in one sampling time, which conforms to the actual fault diagnosis process. From Figure 24, the average processing time is about 0.01 s, which can fully meet the realtime requirements of fault diagnosis compared with the actual sampling time of 0.64 s.
As it was considered unlikely that a high performance GPU would be provided in a real application, the running time of the algorithm was measured on the CPU (4core i51135G7) using method (c), as shown in Figure 25. The average processing time per data set on the CPU is 0.05 s, which still meets the algorithm's realtime requirements. Of course, the running time of the algorithm is also affected by other factors such as programming language, program structure, computer hardware, and computer environment. Hence, the measured time varies from time to time. Even if the above variations exist, it does not change our conclusion that the algorithm is realtime.
5 Conclusions
Cofrequency faults account for a large proportion of the actual factory production process. In order to solve the problem that the characteristics of the cofrequency vibration signal are not obvious and the diagnosis is difficult, a realtime fault extraction method based on LWCNN is proposed in this paper. The following conclusions verified by experiments are as follows:

(1)
A complete fault diagnosis process is proposed for the three most common cofrequency faults in rotor faults: Imbalance, misalignment, and looseness, and correlation coefficients are proposed for the data enhancement part of it to quantitatively assess the correlation before and after data enhancement.

(2)
Using CNN and SVM for fault diagnosis with different data volumes, the classification performance of CNN is better than SVM. The average classification accuracy of LWCNN is above 95%, and using LWCNN can also reduce the training parameters and speed up the training speed.

(3)
The diagnosis time of the LWCNNbased realtime fault diagnosis method is tested using the measured fault diagnosis time method, and the results show that the time is 0.01 s on the highperformance GPU and 0.05 s on the CPU, both of which can meet the realtime requirements.
This is a public attempt to use data datadriven method to diagnose rotor cofrequency faults in real time. The deep learning method used in this paper is a relatively simple algorithm, and many more cuttingedge and esoteric algorithms are yet to be further applied in this direction to finally achieve highperformance, fast, and accurate diagnosis of rotor cofrequency faults.
Availability of Data And Materials
The datasets supporting the conclusions of this article are included within the article.
References
J J Gao. Artificial selfrecovery and machinery selfrecovery regulation system. Chinese Journal of Mechanical Engineering, 2018: 8394.
R Liu, B Yang, E Zio, et al. Artificial intelligence for fault diagnosis of rotating machinery: A review. Mechanical Systems and Signal Processing, 2018, 108: 3347.
X Pan, H T He, H Q Wu. Optimal design of novel electromagneticring active balancing actuator with radial excitation. Chinese Journal of Mechanical Engineering, 2021, 34: 9.
G Lai, J Liu, F Zeng. Application of multiobjective genetic algorithm in ship shafting alignment optimization. 2017 10th International Symposium on Computational Intelligence and Design (ISCID), Hangzhou, China, December 910, 2017: 275278.
X Ji, Y Ren, H Tang, et al. DSmTbased threelayer method using multiclassifier to detect faults in hydraulic systems. Mechanical Systems and Signal Processing, 2021, 153: 107513.
W M Wang, J J Gao, Z N Jiang. Theoretical and experimental study on the faul selfrecovering system of high speed turbo machinery. Journal of Vibration and Shock, 2006, 25(4): 3033.
M Lal. Modeling and estimation of speed dependent bearing and coupling misalignment faults in a turbine generator system. Mechanical Systems and Signal Processing, 2021, 151: 107365.
A K Jalan, A R Mohanty. Model based fault diagnosis of a rotorâ€“bearing system for misalignment and unbalance under steadystate condition. Journal of Sound and Vibration, 2009, 327(35): 604622.
N E Huang, Z Shen, S R Long, et al. The empirical mode decomposition and the Hilbert spectrum for nonlinear and nonstationary time series analysis. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, 1998, 454(1917): 903995.
Z Wu, N E Huang. Ensemble empirical mode decomposition: A noiseassisted data analysis method. Advances in Adaptive Data Analysis, 2011, 1(1): 141.
S Zhou, S Qian, W Chang, et al. A novel bearing multifault diagnosis approach based on weighted permutation entropy and an improved SVM ensemble classifier. Sensors (Basel), 2018, 18(6): 1934.
H C Wu, J Zhou, C H Xie, et al. Twodimensional time series sample entropy algorithm: Applications to rotor axis orbit feature identification. Mechanical Systems and Signal Processing, 2021, 147: 107123.
A Tabrizi, L Garibaldi, A Fasana, et al. Early damage detection of roller bearings using wavelet packet decomposition, ensemble empirical mode decomposition and support vector machine. Meccanica, 2015, 50(3): 865874.
M Pei, H Li, H Yu. Degradation state identification for hydraulic pumps based on multiscale ternary dynamic analysis, NSGAII and SVM. Measurement Science Review, 2021, 21(3): 8292.
B Ahmad, B K Mishra, M Ghufran, et al. Intelligent predictive maintenance model for rolling components of a machine based on speed and vibration. 2021 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), Republic of Korea, Jeju Island, April 2023, 2021: 459464.
X B Wang, L Luo, L Tang, et al. Automatic representation and detection of fault bearings in inwheel motors under variable load conditions. Advanced Engineering Informatics, 2021, 49: 101321.
C Li, K Yang, H Tang, et al. Fault diagnosis for rolling bearings of a freight train under limited fault data: fewshot learning method. Journal of Transportation Engineering, Part A: Systems, 2021, 147(8): 04021041.
M Cerrada, R V SÃ¡nchez, C Li, et al. A review on datadriven fault severity assessment in rolling bearings. Mechanical Systems and Signal Processing, 2018, 99(15): 169196.
B Liu, S Riemenschneider, Y Xu. Gearbox fault diagnosis using empirical mode decomposition and Hilbert spectrum. Mechanical Systems and Signal Processing, 2006, 20(3): 718734.
J H Zhong, P K Wong, Z X Yang. Fault diagnosis of rotating machinery based on multiple probabilistic classifiers. Mechanical Systems and Signal Processing, 2018, 108: 99114.
J D MartÃnezMorales, E R PalaciosHernÃ¡ndez, D U CamposDelgado. Multiplefault diagnosis in induction motors through support vector machine classification at variable operating conditions. Electrical Engineering, 2016, 100: 5973.
I Goodfellow, Y Bengio, A Courville. Deep learning. Massachusetts: MIT press, 2016.
A Krizhevsky, I Sutskever, G Hinton. Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 2012, 25: 10971105.
K Simonyan, A Zisserman. Very deep convolutional networks for largescale image recognition, 2014, arXiv: 1409.1556.
C Szegedy, W Liu, Y Jia, et al. Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Massachusetts, USA, June 712, 2015: 19.
K He, X Zhang, S Ren, et al. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, June 26July 1, 2016: 770778.
Z Yin, J Hou. Recent advances on SVM based fault diagnosis and process monitoring in complicated industrial processes. Neurocomputing, 2016, 174(22): 643650.
A Widodo, B S Yang. Support vector machine in machine condition monitoring and fault diagnosis. Mechanical Systems And Signal Processing, 2007, 21(6): 25602574.
Funding
Supported by National Natural Science Foundation of China (Grant Nos. 51875031, 52242507), Beijing Municipal Natural Science Foundation of China (Grant No. 3212010) and Beijing Municipal Youth Backbone Personal Project of China (Grant No. 2017000020124 G018).
Author information
Authors and Affiliations
Contributions
XP and ZJ were in charge of the whole trial; XZ and XP wrote the manuscript and analysed the data; GB and JN assisted with sampling and laboratory analyses. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics Approval and Consent to participate
Not applicable
Consent for Publication
Not applicable
Competing Interests
The authors declare no competing financial interests.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pan, X., Zhang, X., Jiang, Z. et al. RealTime Intelligent Diagnosis of Cofrequency Vibration Faults in Rotating Machinery Based on LightweightConvolutional Neural Networks. Chin. J. Mech. Eng. 37, 41 (2024). https://doi.org/10.1186/s10033024010219
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1186/s10033024010219