--- title: Image Describer emoji: 🏃 colorFrom: purple colorTo: red sdk: gradio sdk_version: 6.1.0 app_file: app.py pinned: false --- # Image Describer (vit-gpt2) — Hugging Face Space This Space runs nlpconnect/vit-gpt2-image-captioning with an optional T5 rewriter. - CPU-friendly and returns model outputs uncensored (no filter implemented). - Optional prompt box to bias the paraphrase step. To run locally: 1. Create a Python 3.8+ venv and install requirements: `pip install -r requirements.txt` 2. Run: `python app.py` Notes: - For production/throughput on CPU, reduce max_length/num_beams or convert to ONNX + quantize. - This Space uses the public HF model weights; no HF tokens are required to run the demo.