Update README.md
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ Leveraging the multi-dimensional fine-grained annotations produced by our pipeli
|
|
| 9 |
|
| 10 |
> ποΈ **Input:** Raw Speech Audio   β   π **Output:** 14-Dimension Fine-Grained Speech Attributes (Structured JSON)
|
| 11 |
|
| 12 |
-
To overcome modality gaps and text-conditioned hallucinations, FM-Speech is trained using a **Progressive Curriculum Fine-Tuning** framework, decoupling complex auditory comprehension into three incremental stages: Warm-up (MCQ/QA)
|
| 13 |
|
| 14 |
### π Usage & Environment Setup
|
| 15 |
|
|
|
|
| 9 |
|
| 10 |
> ποΈ **Input:** Raw Speech Audio   β   π **Output:** 14-Dimension Fine-Grained Speech Attributes (Structured JSON)
|
| 11 |
|
| 12 |
+
To overcome modality gaps and text-conditioned hallucinations, FM-Speech is trained using a **Progressive Curriculum Fine-Tuning** framework, decoupling complex auditory comprehension into three incremental stages: Warm-up (MCQ/QA) --> Capability Ramp-up --> Final Alignment (Full JSON).
|
| 13 |
|
| 14 |
### π Usage & Environment Setup
|
| 15 |
|