Spaces:

3bitquantizers
/

station-vision-tts

Runtime error

fix space metadata

8a161ec verified 1 day ago

1.02 kB

	---
	title: Station Vision TTS
	emoji: 🚉
	colorFrom: blue
	colorTo: green
	sdk: gradio
	sdk_version: 4.44.1
	app_file: app.py
	pinned: false
	---

	# Station Vision TTS Space

	Remote Gradio demo for the Japanese station-announcement pipeline:

	1. `LiquidAI/LFM2.5-VL-450M-Extract` extracts compact JSON from an image.
	2. `LiquidAI/LFM2-350M` turns the JSON into one short Japanese announcement.
	3. `3bitquantizers/lfm25-audio-jp-station-yamanote-merged-v1-all-langs-static` synthesizes live audio.

	## Deploy

	Create a GPU-backed Gradio Space under `3bitquantizers`, then copy these files to the Space root:

	```bash
	cp app.py requirements.txt README.md /path/to/space/
	cp ../image_to_station_audio.py /path/to/space/
	```

	Set Space secrets:

	- `HF_TOKEN`: read access to private `3bitquantizers` model repos.

	Optional Space variables:

	- `VL_MODEL_ID`
	- `PROMPT_MODEL_ID`
	- `TTS_MODEL_ID`
	- `MAX_TTS_TOKENS`

	Use a 24 GB GPU tier or larger for the first smoke. The app keeps models warm and queues requests one at a time.