Update README.md
Browse files
README.md
CHANGED
|
@@ -1,7 +1,7 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
language:
|
| 4 |
-
-
|
| 5 |
- en
|
| 6 |
tags:
|
| 7 |
- text-to-speech
|
|
@@ -12,19 +12,19 @@ tags:
|
|
| 12 |
- bitsandbytes
|
| 13 |
- 4bit
|
| 14 |
- quantized
|
| 15 |
-
base_model: aoi-ot/VibeVoice-Large
|
| 16 |
datasets:
|
| 17 |
- heiertech/vibevoice-norwegian-mcv
|
| 18 |
pipeline_tag: text-to-speech
|
| 19 |
---
|
| 20 |
|
| 21 |
-
#
|
| 22 |
|
| 23 |
-
A 4-bit quantized version of
|
| 24 |
|
| 25 |
## Model Description
|
| 26 |
|
| 27 |
-
This model is a bitsandbytes 4-bit (NF4) quantized version of [heiertech/
|
|
|
|
| 28 |
|
| 29 |
### Quantization Details
|
| 30 |
|
|
@@ -91,7 +91,7 @@ bnb_config = BitsAndBytesConfig(
|
|
| 91 |
)
|
| 92 |
|
| 93 |
model = VibeVoiceForConditionalGenerationInference.from_pretrained(
|
| 94 |
-
"heiertech/
|
| 95 |
quantization_config=bnb_config,
|
| 96 |
device_map="auto",
|
| 97 |
torch_dtype=torch.bfloat16,
|
|
@@ -115,14 +115,4 @@ with torch.no_grad():
|
|
| 115 |
)
|
| 116 |
|
| 117 |
audio = outputs.speech_outputs[0] # 24kHz audio
|
| 118 |
-
```
|
| 119 |
-
|
| 120 |
-
## Related Models
|
| 121 |
-
|
| 122 |
-
- [heiertech/vibevoice-7b-nob](https://huggingface.co/heiertech/vibevoice-7b-nob) - LoRA adapter
|
| 123 |
-
- [heiertech/vibevoice-7b-nob-lora-merged](https://huggingface.co/heiertech/vibevoice-7b-nob-lora-merged) - Full bf16 merged model
|
| 124 |
-
- [aoi-ot/VibeVoice-Large](https://huggingface.co/aoi-ot/VibeVoice-Large) - Original base model
|
| 125 |
-
|
| 126 |
-
## License
|
| 127 |
-
|
| 128 |
-
Apache 2.0
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
language:
|
| 4 |
+
- 'no'
|
| 5 |
- en
|
| 6 |
tags:
|
| 7 |
- text-to-speech
|
|
|
|
| 12 |
- bitsandbytes
|
| 13 |
- 4bit
|
| 14 |
- quantized
|
|
|
|
| 15 |
datasets:
|
| 16 |
- heiertech/vibevoice-norwegian-mcv
|
| 17 |
pipeline_tag: text-to-speech
|
| 18 |
---
|
| 19 |
|
| 20 |
+
# Prat-9b-nob (4-bit Quantized)
|
| 21 |
|
| 22 |
+
A 4-bit quantized version of Prat-9b-nob fine-tuned for Norwegian text-to-speech synthesis.
|
| 23 |
|
| 24 |
## Model Description
|
| 25 |
|
| 26 |
+
This model is a bitsandbytes 4-bit (NF4) quantized version of [heiertech/Prat-9b-nob](https://huggingface.co/heiertech/Prat-9b-nob),
|
| 27 |
+
which was fine-tuned from [vibevoice/VibeVoice-7b](https://huggingface.co/aoi-ot/VibeVoice-Large) on Norwegian speech data.
|
| 28 |
|
| 29 |
### Quantization Details
|
| 30 |
|
|
|
|
| 91 |
)
|
| 92 |
|
| 93 |
model = VibeVoiceForConditionalGenerationInference.from_pretrained(
|
| 94 |
+
"heiertech/Prat-9b-nob-bnb-4bit",
|
| 95 |
quantization_config=bnb_config,
|
| 96 |
device_map="auto",
|
| 97 |
torch_dtype=torch.bfloat16,
|
|
|
|
| 115 |
)
|
| 116 |
|
| 117 |
audio = outputs.speech_outputs[0] # 24kHz audio
|
| 118 |
+
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|