--- title: Station Vision TTS emoji: 🚉 colorFrom: blue colorTo: green sdk: gradio sdk_version: 4.44.1 app_file: app.py pinned: false --- # Station Vision TTS Space Remote Gradio demo for the Japanese station-announcement pipeline: 1. `LiquidAI/LFM2.5-VL-450M-Extract` extracts compact JSON from an image. 2. `LiquidAI/LFM2-350M` turns the JSON into one short Japanese announcement. 3. `3bitquantizers/lfm25-audio-jp-station-yamanote-merged-v1-all-langs-static` synthesizes live audio. ## Deploy Create a GPU-backed Gradio Space under `3bitquantizers`, then copy these files to the Space root: ```bash cp app.py requirements.txt README.md /path/to/space/ cp ../image_to_station_audio.py /path/to/space/ ``` Set Space secrets: - `HF_TOKEN`: read access to private `3bitquantizers` model repos. Optional Space variables: - `VL_MODEL_ID` - `PROMPT_MODEL_ID` - `TTS_MODEL_ID` - `MAX_TTS_TOKENS` Use a 24 GB GPU tier or larger for the first smoke. The app keeps models warm and queues requests one at a time.