admin
commited on
Commit
·
f55609b
1
Parent(s):
72b46ac
upd md
Browse files
README.md
CHANGED
|
@@ -2,64 +2,62 @@
|
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
|
| 5 |
-
# Intro
|
| 6 |
The Chinese Traditional Instrument Sound Model is a cutting-edge research outcome based on the Chinese Traditional Instrument Sound Dataset. This model covers recordings of over 200 types of Chinese traditional musical instruments, modified instruments, and ethnic minority instruments. Notably, it includes some instruments that are rarely seen among the general public in China. By conducting in-depth analysis and learning of the sound characteristics of these instruments, the model can accurately identify and classify the sounds of various instruments, providing strong technical support for music information retrieval, musical instrument research, and the preservation and inheritance of ethnic music.
|
| 7 |
|
| 8 |
-
|
| 9 |
-
|
| 10 |
-
## Demo 在线演示
|
| 11 |
<https://huggingface.co/spaces/ccmusic-database/CTIS>
|
| 12 |
|
| 13 |
-
## Usage
|
| 14 |
```python
|
| 15 |
from modelscope import snapshot_download
|
| 16 |
model_dir = snapshot_download("ccmusic-database/CTIS")
|
| 17 |
```
|
| 18 |
|
| 19 |
-
## Maintenance
|
| 20 |
```bash
|
| 21 |
git clone git@hf.co:ccmusic-database/CTIS
|
| 22 |
cd CTIS
|
| 23 |
```
|
| 24 |
|
| 25 |
-
## Results
|
| 26 |
-
| Backbone | Size(M) | Mel |
|
| 27 |
-
| :-----------: | :-----: | :---------: |
|
| 28 |
-
| vit_l_32 | 306.5 | 0.936 |
|
| 29 |
-
| swin_t | 28.3 | **_0.956_** |
|
| 30 |
-
| | | |
|
| 31 |
-
| regnet_y_32gf | 145 | **_0.973_** | [**_0.980_**](#best-result
|
| 32 |
-
| vgg19_bn | 143.7 | 0.966 |
|
| 33 |
-
| alexnet | 61.1 | 0.936 |
|
| 34 |
-
| resnet101 | 44.5 | 0.953 |
|
| 35 |
-
| inception_v3 | 27.2 | 0.860 |
|
| 36 |
|
| 37 |
-
### Best result
|
| 38 |
<table>
|
| 39 |
<tr>
|
| 40 |
<th>Loss curve</th>
|
| 41 |
-
<td><img src="https://www.modelscope.cn/
|
| 42 |
</tr>
|
| 43 |
<tr>
|
| 44 |
<th>Training and validation accuracy</th>
|
| 45 |
-
<td><img src="https://www.modelscope.cn/
|
| 46 |
</tr>
|
| 47 |
<tr>
|
| 48 |
<th>Confusion matrix</th>
|
| 49 |
-
<td><img src="https://www.modelscope.cn/
|
| 50 |
</tr>
|
| 51 |
</table>
|
| 52 |
|
| 53 |
-
## Dataset
|
| 54 |
<https://huggingface.co/datasets/ccmusic-database/CTIS>
|
| 55 |
|
| 56 |
-
## Mirror
|
| 57 |
<https://www.modelscope.cn/models/ccmusic-database/CTIS>
|
| 58 |
|
| 59 |
-
## Evaluation
|
| 60 |
<https://github.com/monetjoe/ccmusic_eval>
|
| 61 |
|
| 62 |
-
## Cite
|
| 63 |
```bibtex
|
| 64 |
@dataset{zhaorui_liu_2021_5676893,
|
| 65 |
author = {Monan Zhou, Shenyang Xu, Zhaorui Liu, Zhaowen Wang, Feng Yu, Wei Li and Baoqiang Han},
|
|
|
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
|
| 5 |
+
# Intro
|
| 6 |
The Chinese Traditional Instrument Sound Model is a cutting-edge research outcome based on the Chinese Traditional Instrument Sound Dataset. This model covers recordings of over 200 types of Chinese traditional musical instruments, modified instruments, and ethnic minority instruments. Notably, it includes some instruments that are rarely seen among the general public in China. By conducting in-depth analysis and learning of the sound characteristics of these instruments, the model can accurately identify and classify the sounds of various instruments, providing strong technical support for music information retrieval, musical instrument research, and the preservation and inheritance of ethnic music.
|
| 7 |
|
| 8 |
+
## Demo
|
|
|
|
|
|
|
| 9 |
<https://huggingface.co/spaces/ccmusic-database/CTIS>
|
| 10 |
|
| 11 |
+
## Usage
|
| 12 |
```python
|
| 13 |
from modelscope import snapshot_download
|
| 14 |
model_dir = snapshot_download("ccmusic-database/CTIS")
|
| 15 |
```
|
| 16 |
|
| 17 |
+
## Maintenance
|
| 18 |
```bash
|
| 19 |
git clone git@hf.co:ccmusic-database/CTIS
|
| 20 |
cd CTIS
|
| 21 |
```
|
| 22 |
|
| 23 |
+
## Results
|
| 24 |
+
| Backbone | Size(M) | Mel | CQT | Chroma |
|
| 25 |
+
| :-----------: | :-----: | :---------: | :-------------------------: | :---------: |
|
| 26 |
+
| vit_l_32 | 306.5 | 0.936 | 0.921 | **_0.845_** |
|
| 27 |
+
| swin_t | 28.3 | **_0.956_** | **_0.940_** | 0.759 |
|
| 28 |
+
| | | | | |
|
| 29 |
+
| regnet_y_32gf | 145 | **_0.973_** | [**_0.980_**](#best-result) | 0.848 |
|
| 30 |
+
| vgg19_bn | 143.7 | 0.966 | 0.965 | **_0.852_** |
|
| 31 |
+
| alexnet | 61.1 | 0.936 | 0.921 | 0.661 |
|
| 32 |
+
| resnet101 | 44.5 | 0.953 | 0.949 | 0.782 |
|
| 33 |
+
| inception_v3 | 27.2 | 0.860 | 0.855 | 0.664 |
|
| 34 |
|
| 35 |
+
### Best result
|
| 36 |
<table>
|
| 37 |
<tr>
|
| 38 |
<th>Loss curve</th>
|
| 39 |
+
<td><img src="https://www.modelscope.cn/models/ccmusic-database/CTIS/resolve/master/regnet_y_32gf_cqt_2024-12-02_15-05-57/loss.jpg"></td>
|
| 40 |
</tr>
|
| 41 |
<tr>
|
| 42 |
<th>Training and validation accuracy</th>
|
| 43 |
+
<td><img src="https://www.modelscope.cn/models/ccmusic-database/CTIS/resolve/master/regnet_y_32gf_cqt_2024-12-02_15-05-57/acc.jpg"></td>
|
| 44 |
</tr>
|
| 45 |
<tr>
|
| 46 |
<th>Confusion matrix</th>
|
| 47 |
+
<td><img src="https://www.modelscope.cn/models/ccmusic-database/CTIS/resolve/master/regnet_y_32gf_cqt_2024-12-02_15-05-57/mat.jpg"></td>
|
| 48 |
</tr>
|
| 49 |
</table>
|
| 50 |
|
| 51 |
+
## Dataset
|
| 52 |
<https://huggingface.co/datasets/ccmusic-database/CTIS>
|
| 53 |
|
| 54 |
+
## Mirror
|
| 55 |
<https://www.modelscope.cn/models/ccmusic-database/CTIS>
|
| 56 |
|
| 57 |
+
## Evaluation
|
| 58 |
<https://github.com/monetjoe/ccmusic_eval>
|
| 59 |
|
| 60 |
+
## Cite
|
| 61 |
```bibtex
|
| 62 |
@dataset{zhaorui_liu_2021_5676893,
|
| 63 |
author = {Monan Zhou, Shenyang Xu, Zhaorui Liu, Zhaowen Wang, Feng Yu, Wei Li and Baoqiang Han},
|