NVMOS / README.md
maimai11's picture
Release NVMOS SPEAR-L9 scorer
f1cffe5 verified
|
Raw
History Blame Contribute Delete
598 Bytes
---
license: apache-2.0
tags:
- speech
- audio
- mos
- non-verbal-vocalization
- speech-quality-assessment
---
# NVMOS SPEAR-L9 Scorer
This repository hosts the released NVMOS downstream scorer checkpoint for non-verbal vocalization quality assessment.
Files:
- `nvmos_spear_l9.pt`: PyTorch state dict for the text-query cross-attention scorer.
- `config.json`: inference configuration, including upstream encoder model IDs and scorer dimensions.
- `training_run_config.json`: training-time configuration record.
The full inference code is available at https://github.com/yongaifadian1/NVMOS.