Utkarsh430's picture
Update README.md
c719f2f verified

A newer version of the Gradio SDK is available: 6.9.0

Upgrade
metadata
title: Ai Image Caption Generator
emoji: 
colorFrom: pink
colorTo: indigo
sdk: gradio
sdk_version: 6.3.0
app_file: app.py
pinned: false
license: mit
short_description: image to caption generation using image processing and NLP

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference AI Image Caption Generator

Overview

The AI Image Caption Generator is an end-to-end deep learning application that automatically generates meaningful natural-language descriptions for images. It demonstrates the practical use of vision–language models, covering the complete machine learning workflow from model inference to web deployment.

The application is deployed on Hugging Face Spaces and provides a simple web interface for real-time image caption generation.

Live Demo: https://huggingface.co/spaces/Utkarsh430/ai-image-caption-generator

Key Features

1.Generates natural language captions for images 2.Uses a state-of-the-art vision–language transformer model 3.Supports common image formats such as JPEG and PNG 4.Optimized inference using lazy model loading 5.Clean and interactive web interface using Gradio 6.Fully deployed on Hugging Face Spaces (CPU) 7.Automatic deployment on every commit

Model Details

Model Name: Salesforce/blip-image-captioning-base

Task: Image-to-Text Generation

Architecture: Vision–Language Transformer

Framework: PyTorch

BLIP (Bootstrapped Language-Image Pretraining) is trained on large-scale image–text datasets and is capable of understanding visual context to produce accurate and descriptive captions.

Technology Stack

Category Tools / Libraries
Programming Language Python
Deep Learning Framework PyTorch
Vision & NLP Hugging Face Transformers
Image Processing Pillow
Web Interface Gradio
Deployment Platform Hugging Face Spaces