Update README.md
Browse files
README.md
CHANGED
|
@@ -12,17 +12,11 @@ language:
|
|
| 12 |
- nb
|
| 13 |
---
|
| 14 |
|
| 15 |
-
# Prat-9B
|
| 16 |
|
| 17 |
-
A Norwegian (Bokmal) text-to-speech model fine-tuned for the
|
| 18 |
-
|
| 19 |
-
|
| 20 |
-
|
| 21 |
-
This model was trained using a progressive 3-stage fine-tuning approach:
|
| 22 |
-
|
| 23 |
-
1. **Stage 1**: Initial Norwegian (Bokmal) training on Mozilla Common Voice
|
| 24 |
-
2. **Stage 2**: Continued training on broader Norwegian data
|
| 25 |
-
3. **Stage 3**: Dialect-specific fine-tuning for Ostnorsk/Oslo dialect
|
| 26 |
|
| 27 |
## Usage
|
| 28 |
|
|
@@ -41,7 +35,9 @@ outputs = model.generate(**inputs)
|
|
| 41 |
|
| 42 |
## Base Model
|
| 43 |
|
| 44 |
-
This model is
|
|
|
|
|
|
|
| 45 |
|
| 46 |
## Acknowledgments
|
| 47 |
|
|
|
|
| 12 |
- nb
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# Prat-9B (preview)
|
| 16 |
|
| 17 |
+
A Norwegian (Bokmal) text-to-speech model fine-tuned for the Østnorsk/Oslo dialect.
|
| 18 |
+
This model is currently in preview, You can expect things like weird artefacts,
|
| 19 |
+
But generally, per our testing, it outperforms VibeVoice 7B per our unscientific qualitative eval.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
|
| 21 |
## Usage
|
| 22 |
|
|
|
|
| 35 |
|
| 36 |
## Base Model
|
| 37 |
|
| 38 |
+
This model is based on [VibeVoice-7B](https://huggingface.co/vibevoice/VibeVoice-7B).
|
| 39 |
+
Note that despite the name, VibeVoice-7B is actually a 9B parameter model.
|
| 40 |
+
The 7B only refers to the size of the llm backbone based on Qwen2.5 7B
|
| 41 |
|
| 42 |
## Acknowledgments
|
| 43 |
|