Bearing Fault Diagnosis Based on Image Information Fusion and Vision Transformer Transfer Learning Model

Zhang, Zichen; Li, Jing; Cai, Chaozhi; Ren, Jianhua; Xue, Yingfang

doi:10.3390/app14072706

Applied Sciences

2024

DOI: 10.3390/app14072706

|View full text |Cite

Bearing Fault Diagnosis Based on Image Information Fusion and Vision Transformer Transfer Learning Model

Zichen Zhang,

Jing Li,

Chaozhi Cai

et al.

Abstract: In order to improve the accuracy of bearing fault diagnosis under a small sample, variable load, and noise conditions, a new fault diagnosis method based on an image information fusion and Vision Transformer (ViT) transfer learning model is proposed in this paper. Firstly, the method applies continuous wavelet transform (CWT), Gramian angular summation field (GASF), and Gramian angular difference field (GADF) to the time series data, and generates three grayscale images. Then, the generated three grayscale ima… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article2

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

TSViT: A Time Series Vision Transformer for Fault Diagnosis of Rotating Machinery

Zhang,

Zhou,

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

Efficient and accurate fault diagnosis of rotating machinery is extremely important. Fault diagnosis methods using vibration signals based on convolutional neural networks (CNNs) have become increasingly mature. They often struggle with capturing the temporal dynamics of vibration signals. To overcome this, the application of Transformer-based Vision Transformer (ViT) methods to fault diagnosis is gaining attraction. Nonetheless, these methods typically require extensive preprocessing, which increases computational complexity, potentially reducing the efficiency of the diagnosis process. Addressing this gap, this paper presents the Time Series Vision Transformer (TSViT), tailored for effective fault diagnosis. The TSViT incorporates a convolutional layer to extract local features from vibration signals alongside a transformer encoder to discern long-term temporal patterns. A thorough experimental comparison of three diverse datasets demonstrates the TSViT’s effectiveness and adaptability. Moreover, the paper delves into the influence of hyperparameter tuning on the model’s performance, computational demand, and parameter count. Remarkably, the TSViT achieves an unprecedented 100% average accuracy on two of the test sets and 99.99% on the other, showcasing its exceptional fault diagnosis capabilities for rotating machinery. The implementation of this model will bring significant economic benefits.

show abstract

TSViT: A Time Series Vision Transformer for Fault Diagnosis of Rotating Machinery

Zhang,

Zhou,

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

show abstract

A Wind Power Combination Forecasting Method Based on GASF Image Representation and UniFormer

Guo,

Xu,

Zhao

et al. 2024

JMSE

View full text Add to dashboard Cite

In the field of wind power prediction, traditional methods typically rely on one-dimensional time-series data for feature extraction and prediction. In this study, we propose an innovative short-term wind power forecasting approach using a “visual” 2D image prediction method that effectively utilizes spatial pattern information in time-series data by combining wind power series and related environmental features into a 2D GASF image. Firstly, the wind power data are decomposed using the ICEEMDAN algorithm optimized by the BWO (Beluga Whale Optimization) algorithm, extracting the submodal IMF (Intrinsic Mode Function) components with different frequencies. Then, modal reconstruction is performed on the basis of the permutation entropy value of the IMF components, selecting meteorological features highly correlated with reconstructed components through Spearman correlation analysis for data splicing and superposition before converting them into GASF images. Finally, the GASF images are input into the UniFormer model for wind power sequence prediction. By leveraging wind power data predictions from a coastal wind farm in East China and Sotavento in Spain, this study demonstrates the significant benefits and potential applications of this methodology for precise wind power forecasting. This research combines the advantages of image feature extraction and time-series prediction to offer novel perspectives and tools for predicting renewable energy sources such as wind power.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Bearing Fault Diagnosis Based on Image Information Fusion and Vision Transformer Transfer Learning Model

Cited by 2 publications

References 37 publications

TSViT: A Time Series Vision Transformer for Fault Diagnosis of Rotating Machinery

TSViT: A Time Series Vision Transformer for Fault Diagnosis of Rotating Machinery

A Wind Power Combination Forecasting Method Based on GASF Image Representation and UniFormer

Contact Info

Product

Resources

About