supib4132
/

RAGEXplo

Model card Files Files and versions

RAGEXplo / README.md

supib4132's picture

Update README.md

cd469b0 verified 10 months ago

|

history blame contribute delete

447 Bytes

🏛️ RAG Image Captioning with Landmark Location

This model generates captions for monument/landmark images using a retrieval-augmented generation approach.

How it works:

Uses CLIP to extract image embeddings.
Retrieves top-k similar captions via FAISS.
Generates a detailed caption with name and location using T5.

Example

Input: 🏰 Image of the Taj Mahal
Output: " is a white marble mausoleum located in Agra, India."