ccmusic-database
/

erhu_playing_tech

Audio Classification

English

music

art

Model card Files Files and versions

xet

admin commited on Jul 15, 2024

Commit

313fe5f

1 Parent(s): 9ad6c3e

Update README.md

Browse files

Files changed (1) hide show

README.md +78 -59

README.md CHANGED Viewed

@@ -1,59 +1,78 @@
----
-license: mit
-datasets:
-- ccmusic-database/erhu_playing_tech
-language:
-- en
-metrics:
-- accuracy
-pipeline_tag: audio-classification
-tags:
-- music
-- art
----
-The Erhu Performance Technique Recognition Model is an audio analysis tool based on deep learning techniques, aiming to automatically distinguish different techniques in erhu performance. By deeply analyzing the acoustic characteristics of erhu music, the model is able to recognize 11 basic playing techniques, including split bow, pad bow, overtone, continuous bow, glissando, big glissando, strike bow, pizzicato, throw bow, staccato bow, vibrato, tremolo and vibrato. Through time-frequency conversion, feature extraction and pattern recognition, the model can accurately categorize the complex techniques of erhu performance, which provides an efficient technical support for music information retrieval, music education, and research on the art of erhu performance. The application of this model not only enriches the research in the field of music acoustics, but also opens up a new way for the inheritance and innovation of traditional music.
-## Usage
-```python
-from modelscope import snapshot_download
-model_dir = snapshot_download('ccmusic-database/erhu_playing_tech')
-```
-## Maintenance
-```bash
-GIT_LFS_SKIP_SMUDGE=1 git clone git@hf.co:ccmusic-database/erhu_playing_tech
-cd erhu_playing_tech
-```
-## Results
-A demo result of Swin-S fine-tuning by mel:
-<style>
-  #pianos td {
-    vertical-align: middle !important;
-    text-align: center;
-  }
-  #pianos th {
-    text-align: center;
-  }
-</style>
-<table id="pianos">
-    <tr>
-        <th>Loss curve</th>
-        <td><img src="./loss.jpg"></td>
-    </tr>
-    <tr>
-        <th>Training and validation accuracy</th>
-        <td><img src="./acc.jpg"></td>
-    </tr>
-    <tr>
-        <th>Confusion matrix</th>
-        <td><img src="./mat.jpg"></td>
-    </tr>
-</table>
-## Mirror
-<https://www.modelscope.cn/models/ccmusic-database/erhu_playing_tech>
-## Reference
-[1] <https://github.com/monetjoe/ccmusic_eval>

+---
+license: mit
+datasets:
+- ccmusic-database/erhu_playing_tech
+language:
+- en
+metrics:
+- accuracy
+pipeline_tag: audio-classification
+tags:
+- music
+- art
+---
+The Erhu Performance Technique Recognition Model is an audio analysis tool based on deep learning techniques, aiming to automatically distinguish different techniques in erhu performance. By deeply analyzing the acoustic characteristics of erhu music, the model is able to recognize 11 basic playing techniques, including split bow, pad bow, overtone, continuous bow, glissando, big glissando, strike bow, pizzicato, throw bow, staccato bow, vibrato, tremolo and vibrato. Through time-frequency conversion, feature extraction and pattern recognition, the model can accurately categorize the complex techniques of erhu performance, which provides an efficient technical support for music information retrieval, music education, and research on the art of erhu performance. The application of this model not only enriches the research in the field of music acoustics, but also opens up a new way for the inheritance and innovation of traditional music.
+## Demo
+<https://huggingface.co/spaces/ccmusic-database/erhu-playing-tech>
+## Usage
+```python
+from modelscope import snapshot_download
+model_dir = snapshot_download('ccmusic-database/erhu_playing_tech')
+```
+## Maintenance
+```bash
+GIT_LFS_SKIP_SMUDGE=1 git clone git@hf.co:ccmusic-database/erhu_playing_tech
+cd erhu_playing_tech
+```
+## Results
+A demo result of Swin-S fine-tuning by mel:
+<style>
+  #pianos td {
+    vertical-align: middle !important;
+    text-align: center;
+  }
+  #pianos th {
+    text-align: center;
+  }
+</style>
+<table id="pianos">
+    <tr>
+        <th>Loss curve</th>
+        <td><img src="./loss.jpg"></td>
+    </tr>
+    <tr>
+        <th>Training and validation accuracy</th>
+        <td><img src="./acc.jpg"></td>
+    </tr>
+    <tr>
+        <th>Confusion matrix</th>
+        <td><img src="./mat.jpg"></td>
+    </tr>
+</table>
+## Dataset
+<https://huggingface.co/datasets/ccmusic-database/erhu_playing_tech>
+## Mirror
+<https://www.modelscope.cn/models/ccmusic-database/erhu_playing_tech>
+## Evaluation
+<https://github.com/monetjoe/ccmusic_eval>
+## Cite
+```bibtex
+@dataset{zhaorui_liu_2021_5676893,
+  author       = {Monan Zhou, Shenyang Xu, Zhaorui Liu, Zhaowen Wang, Feng Yu, Wei Li and Baoqiang Han},
+  title        = {CCMusic: an Open and Diverse Database for Chinese and General Music Information Retrieval Research},
+  month        = {mar},
+  year         = {2024},
+  publisher    = {HuggingFace},
+  version      = {1.2},
+  url          = {https://huggingface.co/ccmusic-database}
+}
+```