Spaces:
Sleeping
Sleeping
metadata
title: Image2Wiki
emoji: 🖼️
colorFrom: blue
colorTo: green
sdk: docker
pinned: false
Image2Wiki
Image2Wiki is a service that generates Wikipedia-style articles based on an uploaded image. It uses a fine-tuned VisionEncoderDecoder model (tuman/vit-rugpt2-image-captioning with a LoRA adapter) to generate structured text (title, lead, sections, paragraphs) from images.
Features
- FastAPI based web service
- Wikipedia-like UI for generated articles
- Fine-tuned model for structured article generation
Setup
- Install dependencies:
pip install -r requirements.txt
- Run the service:
uvicorn app.main:app --port 8013 --reload
Project Structure
app/- FastAPI application and UI templatesadapted_best_embed2/- Fine-tuned LoRA adapter weightscollect_data.py&collect_data_async.py- Scripts for collecting training datafinetune.ipynb- Notebook used for fine-tuning the model