ASLP-lab
/

FM-Speech

Audio Classification

Model card Files Files and versions

ASLP-lab commited on 7 days ago

Commit

026f52a

·

verified ·

1 Parent(s): cffb5e1

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ Leveraging the multi-dimensional fine-grained annotations produced by our pipeli
 > 🎙️ **Input:** Raw Speech Audio &emsp; ➔ &emsp; 📊 **Output:** 14-Dimension Fine-Grained Speech Attributes (Structured JSON)
-To overcome modality gaps and text-conditioned hallucinations, FM-Speech is trained using a **Progressive Curriculum Fine-Tuning** framework, decoupling complex auditory comprehension into three incremental stages: Warm-up (MCQ/QA) $\rightarrow$ Capability Ramp-up $\rightarrow$ Final Alignment (Full JSON).
 ### 🚀 Usage & Environment Setup

 > 🎙️ **Input:** Raw Speech Audio &emsp; ➔ &emsp; 📊 **Output:** 14-Dimension Fine-Grained Speech Attributes (Structured JSON)
+To overcome modality gaps and text-conditioned hallucinations, FM-Speech is trained using a **Progressive Curriculum Fine-Tuning** framework, decoupling complex auditory comprehension into three incremental stages: Warm-up (MCQ/QA) --> Capability Ramp-up --> Final Alignment (Full JSON).
 ### 🚀 Usage & Environment Setup