13 GB
9 files
Updated 4 days ago
NameSize
src
.gitattributes2.56 kB
xet
README.md2.8 kB
xet
corpus.json1.51 MB
xet
data.hdf513 GB
xet
demo_data.json1.52 MB
xet
metadata.parquet836 kB
xet
metadata.sqlite4.46 MB
xet
README.md
logo

Multimodal Bearing Health Management Dataset

Python PyTorch arXiv Dataset GitHub Repo stars

⚡️ Download

Due to the capacity limitation of GitHub, please download the data file on huggingface.

📚 Introduction

The MBHM dataset is the first multimodal dataset designed for the study of bearing health management. It is divided into two parts: vibration signals and health management corpus. The vibration signals and condition information are derived from 9 publicly available datasets. The thousands of working conditions pose more difficult challenges for the identification model and better represent real-world usage scenarios.

In the dataset, vibration signals from different datasets have been converted to the same length (24000) by Discrete Cosine Normalization (DCN). For more information about the implementation of DCN, please refer to the paper or code.

💻 Demo

We provide a demo script to show how to load the MRCHM dataset and output the data shape. Please check the demo for more details.

📖 Citation

Please cite the following paper if you use this dataset in your research:

@article{pengBearLLMPriorKnowledgeEnhanced2025,
  title = {{{BearLLM}}: {{A Prior Knowledge-Enhanced Bearing Health Management Framework}} with {{Unified Vibration Signal Representation}}},
  author = {Peng, Haotian and Liu, Jiawei and Du, Jinsong and Gao, Jie and Wang, Wei},
  year = {2025},
  month = apr,
  journal = {Proceedings of the AAAI Conference on Artificial Intelligence},
  volume = {39},
  number = {19},
  pages = {19866--19874},
  issn = {2374-3468},
  doi = {10.1609/aaai.v39i19.34188},
  urldate = {2025-04-11},
}
Total size
13 GB
Files
9
Last updated
May 23
Pre-warmed CDN
US EU US EU

Contributors