File size: 3,656 Bytes
27d53c0 1f0f8be addbaf0 531073e bd79386 bb2be69 dec0721 d105531 dec0721 bb2be69 dee43bc bb2be69 34320bf bb2be69 34320bf bb2be69 34320bf bb2be69 34320bf bb2be69 34320bf dec0721 d105531 dec0721 9726b10 d105531 9726b10 d105531 9726b10 d105531 9726b10 bb2be69 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 |
---
language:
- en
base_model:
- sesame/csm-1b
- senstella/csm-expressiva-1b
- meta-llama/Llama-3.2-1B
- Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct
- fixie-ai/ultravox-v0_5-llama-3_2-1b
pipeline_tag: text-to-speech
---
**The model supports multilingual transcription, but voice output is only in English or English-like languages.**
Models:
CSM: [sesame/csm-1b](https://huggingface.co/sesame/csm-1b)
CSM-EXPRESSIVA(WHISPERING & NO VC): [senstella/csm-expressiva-1b](https://huggingface.co/senstella/csm-expressiva-1b)
LLAMA: [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B)
LLAMA-VIKHR: [Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct](https://huggingface.co/Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct)
LLAMA-ULTRAVOX: [fixie-ai/ultravox-v0_5-llama-3_2-1b](https://huggingface.co/fixie-ai/ultravox-v0_5-llama-3_2-1b)
### CSM:
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/csm/examples/conversational_a.wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/csm/examples/conversational_b.wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/csm/examples/read_speech_a.wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/csm/examples/read_speech_b.wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/csm/examples/read_speech_c.wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/csm/examples/read_speech_d.wav?download=true" type="audio/mpeg">
</audio>
### CSM-EXPRESSIVA(WHISPERING & NO VC):
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/csm-expressiva/examples/demo.wav?download=true" type="audio/mpeg">
</audio>
### LLAMA:
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama/real-examples/audio.wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama/real-examples/audio_1(VC).wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama/real-examples/audio_2(VC).wav?download=true" type="audio/mpeg">
</audio>
### LLAMA-VIKHR:
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama-vikhr/real-examples/audio.wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama-vikhr/real-examples/audio_1(VC).wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama-vikhr/real-examples/audio_2(VC).wav?download=true" type="audio/mpeg">
</audio>
### LLAMA-ULTRAVOX:
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama-ultravox/real-examples/audio.wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama-ultravox/real-examples/audio_1(VC).wav?download=true" type="audio/mpeg">
</audio>
<audio controls>
<source src="https://huggingface.co/Derur/csm-models/resolve/main/llama-ultravox/real-examples/audio_2(VC).wav?download=true" type="audio/mpeg">
</audio> |