metadata
title: SPITITOUT
emoji: 🔥
colorFrom: red
colorTo: green
sdk: docker
app_port: 7860
SPITITOUT
This version is set up for a Hugging Face Docker Space. It does not use Gemini or any external model API; the Space loads local Hugging Face models for chat, speech recognition, and speech synthesis.
Models
- Text on CPU Space:
Qwen/Qwen3-1.7B-GGUFwith llama.cpp / GGUF quantization - Text on GPU Space:
Qwen/Qwen3-4B-Instruct-2507with Transformers, or vLLM if you add a dedicated vLLM server - Speech to text:
openai/whisper-tiny - Text to speech:
hexgrad/Kokoro-82Mthrough thekokoroPython package, Mandarin voicezf_xiaobei
For a CPU-only Space, keep LLM_BACKEND=llamacpp. To switch back to Transformers, set LLM_BACKEND=transformers and update TEXT_MODEL.
Run locally
npm install
npm run build
pip install -r requirements.txt
python app.py
Open http://localhost:7860.
Deploy to Hugging Face Spaces
Create a Docker Space, then push this folder. The included Dockerfile builds the React frontend and serves it from the FastAPI backend.