fluxions
/

vui

speech-synthesis

Model card Files Files and versions

harrycb commited on Jun 5, 2025

Commit

7fed7ad

·

verified ·

1 Parent(s): e7282ac

Update README.md

Files changed (1) hide show

README.md +36 -3

README.md CHANGED Viewed

@@ -1,3 +1,36 @@
----
-license: mit
----

+---
+license: mit
+language:
+- en
+pipeline_tag: text-to-speech
+---
+# vui
+Small Conversational speech models that can run on device
+# Installation
+```sh
+uv pip install -e .
+```
+# Demo
+```sh
+python demo.py
+````
+# Models
+Vui.BASE is base checkpoint trained on 40k hours of audio conversations
+Vui.ABRAHAM is a single speaker model that can reply with context awareness.
+Vui.COHOST is checkpoint with two speakers that can talk to each other.
+# Voice Cloning
+You can clone with the base model quite well but it's not perfect as hasn't seen that much audio / wasn't trained for long
+# FAQ
+1) Was developed with on two 4090's https://x.com/harrycblum/status/1752698806184063153
+2) Hallucinations: yes the model does hallucinate, but this is the best I could do with limited resources! :(