vui / README.md

Update library tag for better download tracking and code snippets! (#1)

df6f32c verified 8 months ago

874 Bytes

	---
	license: mit
	language:
	- en
	pipeline_tag: text-to-speech
	library_name: vui

	---
	# vui

	[DEMO](https://fluxions.ai)

	https://github.com/fluxions-ai/vui

	Small Conversational speech models that can run on device

	# Installation

	```sh
	uv pip install -e .
	```

	# Demo

	```sh
	python demo.py
	````

	# Models

	Vui.BASE is base checkpoint trained on 40k hours of audio conversations
	Vui.ABRAHAM is a single speaker model that can reply with context awareness.
	Vui.COHOST is checkpoint with two speakers that can talk to each other.

	# Voice Cloning

	You can clone with the base model quite well but it's not perfect as hasn't seen that much audio / wasn't trained for long

	# FAQ

	1) Was developed with on two 4090's https://x.com/harrycblum/status/1752698806184063153
	2) Hallucinations: yes the model does hallucinate, but this is the best I could do with limited resources! :(