herimor
/

voxtream2

@@ -1,18 +1,23 @@
 ---
-license: cc-by-4.0
 datasets:
 - amphion/Emilia-Dataset
 - nvidia/hifitts-2
 language:
 - en
 pipeline_tag: text-to-speech
 tags:
 - text-to-speech
 ---
 # Model Card for VoXtream2
-VoXtream2 is a zero-shot full-stream TTS model with dynamic speaking-rate control that can be updated mid-utterance on the fly.
 ### Key features
@@ -22,10 +27,10 @@ VoXtream2 is a zero-shot full-stream TTS model with dynamic speaking-rate contro
 ### Model Sources
-- **Repository:** [repo](https://github.com/herimor/voxtream)
-- **Paper:** [paper](https://arxiv.org/pdf/2603.13518)
-- **Demo Page:** [demo page](https://herimor.github.io/voxtream2)
-- **Live Demo:** [live demo](https://huggingface.co/spaces/herimor/voxtream2)
 ## Get started
@@ -84,7 +89,7 @@ The model was trained on [Emilia](https://huggingface.co/datasets/amphion/Emilia
 ## Citation
-```
 @inproceedings{torgashov2026voxtream,
   title={Vo{X}tream: Full-Stream Text-to-Speech with Extremely Low Latency},
   author={Torgashov, Nikita and Henter, Gustav Eje and Skantze, Gabriel},

 ---
 datasets:
 - amphion/Emilia-Dataset
 - nvidia/hifitts-2
 language:
 - en
+license: cc-by-4.0
 pipeline_tag: text-to-speech
+library_name: voxtream
 tags:
 - text-to-speech
+- zero-shot
+- streaming
 ---
 # Model Card for VoXtream2
+VoXtream2 is a zero-shot full-stream TTS model with dynamic speaking-rate control that can be updated mid-utterance on the fly. It was introduced in the paper [VoXtream2: Full-stream TTS with dynamic speaking rate control](https://huggingface.co/papers/2603.13518).
+**Developed by:** Nikita Torgashov, Gustav Eje Henter, Gabriel Skantze
 ### Key features
 ### Model Sources
+- **Repository:** [https://github.com/herimor/voxtream](https://github.com/herimor/voxtream)
+- **Paper:** [https://huggingface.co/papers/2603.13518](https://huggingface.co/papers/2603.13518)
+- **Demo Page:** [https://herimor.github.io/voxtream2](https://herimor.github.io/voxtream2)
+- **Live Demo:** [https://huggingface.co/spaces/herimor/voxtream2](https://huggingface.co/spaces/herimor/voxtream2)
 ## Get started
 ## Citation
+```bibtex
 @inproceedings{torgashov2026voxtream,
   title={Vo{X}tream: Full-Stream Text-to-Speech with Extremely Low Latency},
   author={Torgashov, Nikita and Henter, Gustav Eje and Skantze, Gabriel},