unilight
/

sheet-models

Model card Files Files and versions

unilight commited on Sep 26, 2025

Commit

8676e76

·

verified ·

1 Parent(s): c18bc7a

Update README.md

Files changed (1) hide show

README.md +27 -7

README.md CHANGED Viewed

@@ -6,13 +6,15 @@ license: mit
 This model card describes the models implemented in the [SHEET](https://github.com/unilight/sheet) toolkit trained using the training sets in MOS-Bench and benchmarked using the test sets in MOS-Bench.
 ## Model Details
 - **Developed by:** Wen-Chin Huang
 - **Model type:** SSL-MOS or AlignNet
 - **License:** MIT
 - **Repository:** [SHEET](https://github.com/unilight/sheet)
-- **Paper:** to be uploaded
 - **Demo :** https://huggingface.co/spaces/unilight/sheet-demo
 ## Uses
@@ -39,21 +41,39 @@ Please refer to the [`egs` folder in the sheet repo](https://github.com/unilight
 #### Metrics
-Please refer to the MOS-Bench paper (to be uploaded) for details.
 ### Results
-Please refer to the MOS-Bench paper (to be uploaded) for details.
 ## Citation
 **BibTeX:**
-To be updated.
-<!-- ## Glossary [optional] -->
-<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
 ## Model Card Contact

 This model card describes the models implemented in the [SHEET](https://github.com/unilight/sheet) toolkit trained using the training sets in MOS-Bench and benchmarked using the test sets in MOS-Bench.
+The task is subjective speech quality assessment (SSQA), which aims to predict the perceptual quality score of speech.
 ## Model Details
 - **Developed by:** Wen-Chin Huang
 - **Model type:** SSL-MOS or AlignNet
 - **License:** MIT
 - **Repository:** [SHEET](https://github.com/unilight/sheet)
+- **Paper:** [[SHEET](https://arxiv.org/abs/2505.15061)] [[MOS-Bench (arXiv; 2024)](https://arxiv.org/abs/2411.03715)]
 - **Demo :** https://huggingface.co/spaces/unilight/sheet-demo
 ## Uses
 #### Metrics
+Commonly used metrics for SQA are MSE, LCC, SRCC and KTAU. A code snippet for calculating them can be found here: https://gist.github.com/unilight/883726c94640cca1f4d4068e29c3d20f
+Please refer to the [MOS-Bench (arXiv; 2024)](https://arxiv.org/abs/2411.03715) paper for details.
 ### Results
+Please refer to the [MOS-Bench (arXiv; 2024)](https://arxiv.org/abs/2411.03715) paper for details.
 ## Citation
 **BibTeX:**
+```
+@inproceedings{sheet,
+  title     = {{SHEET: A Multi-purpose Open-source Speech Human Evaluation Estimation Toolkit}},
+  author    = {Wen-Chin Huang and Erica Cooper and Tomoki Toda},
+  year      = {2025},
+  booktitle = {{Proc. Interspeech}},
+  pages     = {2355--2359},
+}
+@article{huang2024,
+      title={MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models},
+      author={Wen-Chin Huang and Erica Cooper and Tomoki Toda},
+      year={2024},
+      eprint={2411.03715},
+      archivePrefix={arXiv},
+      primaryClass={cs.SD},
+      url={https://arxiv.org/abs/2411.03715},
+}
+```
 ## Model Card Contact