File size: 724 Bytes
f5286c1
 
 
 
 
 
 
 
 
 
 
0894d1c
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
title: Image Describer
emoji: πŸƒ
colorFrom: purple
colorTo: red
sdk: gradio
sdk_version: 6.1.0
app_file: app.py
pinned: false
---

# Image Describer (vit-gpt2) β€” Hugging Face Space

This Space runs nlpconnect/vit-gpt2-image-captioning with an optional T5 rewriter.
- CPU-friendly and returns model outputs uncensored (no filter implemented).
- Optional prompt box to bias the paraphrase step.

To run locally:
1. Create a Python 3.8+ venv and install requirements: `pip install -r requirements.txt`
2. Run: `python app.py`

Notes:
- For production/throughput on CPU, reduce max_length/num_beams or convert to ONNX + quantize.
- This Space uses the public HF model weights; no HF tokens are required to run the demo.