File size: 3,351 Bytes
70ba70b 72b46ac f55609b 72b46ac c82d4fe 72b46ac f55609b 72b46ac 04c1c4d 72b46ac f55609b 72b46ac 04c1c4d 72b46ac f55609b 72b46ac f55609b 72b46ac f55609b 72b46ac f55609b 72b46ac f55609b 72b46ac f55609b 72b46ac f55609b 72b46ac f55609b 72b46ac f55609b 72b46ac cdf33db 858bbed 638b360 7b61f83 638b360 7b61f83 638b360 72b46ac |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 |
---
license: mit
---
# Intro
The Chinese Traditional Instrument Sound Model is a cutting-edge research outcome based on the Chinese Traditional Instrument Sound Dataset. This model covers recordings of over 200 types of Chinese traditional musical instruments, modified instruments, and ethnic minority instruments. Notably, it includes some instruments that are rarely seen among the general public in China. By conducting in-depth analysis and learning of the sound characteristics of these instruments, the model can accurately identify and classify the sounds of various instruments, providing strong technical support for music information retrieval, musical instrument research, and the preservation and inheritance of ethnic music.
## Demo (inference code)
<https://huggingface.co/spaces/ccmusic-database/CTIS>
## Usage
```python
from huggingface_hub import snapshot_download
model_dir = snapshot_download("ccmusic-database/CTIS")
```
## Maintenance
```bash
GIT_LFS_SKIP_SMUDGE=1 git clone git@hf.co:ccmusic-database/CTIS
cd CTIS
```
## Results
| Backbone | Size(M) | Mel | CQT | Chroma |
| :-----------: | :-----: | :---------: | :-------------------------: | :---------: |
| vit_l_32 | 306.5 | 0.936 | 0.921 | **_0.845_** |
| swin_t | 28.3 | **_0.956_** | **_0.940_** | 0.759 |
| | | | | |
| regnet_y_32gf | 145 | **_0.973_** | [**_0.980_**](#best-result) | 0.848 |
| vgg19_bn | 143.7 | 0.966 | 0.965 | **_0.852_** |
| alexnet | 61.1 | 0.936 | 0.921 | 0.661 |
| resnet101 | 44.5 | 0.953 | 0.949 | 0.782 |
| inception_v3 | 27.2 | 0.860 | 0.855 | 0.664 |
### Best result
<table>
<tr>
<th>Loss curve</th>
<td><img src="https://www.modelscope.cn/models/ccmusic-database/CTIS/resolve/master/regnet_y_32gf_cqt_2024-12-02_15-05-57/loss.jpg"></td>
</tr>
<tr>
<th>Training and validation accuracy</th>
<td><img src="https://www.modelscope.cn/models/ccmusic-database/CTIS/resolve/master/regnet_y_32gf_cqt_2024-12-02_15-05-57/acc.jpg"></td>
</tr>
<tr>
<th>Confusion matrix</th>
<td><img src="https://www.modelscope.cn/models/ccmusic-database/CTIS/resolve/master/regnet_y_32gf_cqt_2024-12-02_15-05-57/mat.jpg"></td>
</tr>
</table>
## Dataset
<https://huggingface.co/datasets/ccmusic-database/CTIS>
## Mirror
<https://www.modelscope.cn/models/ccmusic-database/CTIS>
## Evaluation
<https://github.com/monetjoe/ccmusic_eval>
## Cite
```bibtex
@article{Zhou-2025,
author = {Monan Zhou and Shenyang Xu and Zhaorui Liu and Zhaowen Wang and Feng Yu and Wei Li and Baoqiang Han},
title = {CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research},
journal = {Transactions of the International Society for Music Information Retrieval},
volume = {8},
number = {1},
pages = {22--38},
month = {Mar},
year = {2025},
url = {https://doi.org/10.5334/tismir.194},
doi = {10.5334/tismir.194}
}
``` |