NVMOS / README.md
maimai11's picture
Release NVMOS SPEAR-L9 scorer
f1cffe5 verified
|
Raw
History Blame Contribute Delete
598 Bytes
metadata
license: apache-2.0
tags:
  - speech
  - audio
  - mos
  - non-verbal-vocalization
  - speech-quality-assessment

NVMOS SPEAR-L9 Scorer

This repository hosts the released NVMOS downstream scorer checkpoint for non-verbal vocalization quality assessment.

Files:

  • nvmos_spear_l9.pt: PyTorch state dict for the text-query cross-attention scorer.
  • config.json: inference configuration, including upstream encoder model IDs and scorer dimensions.
  • training_run_config.json: training-time configuration record.

The full inference code is available at https://github.com/yongaifadian1/NVMOS.