Spaces:

Utkarsh430
/

ai-image-caption-generator

Sleeping

App Files Files Community

Utkarsh430 commited on Jan 20

Commit

c719f2f

verified ·

1 Parent(s): 1f034ba

Update README.md

Browse files

Files changed (1) hide show

README.md +45 -0

README.md CHANGED Viewed

@@ -12,3 +12,48 @@ short_description: image to caption generation using image processing and NLP
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
+AI Image Caption Generator
+Overview
+The AI Image Caption Generator is an end-to-end deep learning application that automatically generates meaningful natural-language descriptions for images.
+It demonstrates the practical use of vision–language models, covering the complete machine learning workflow from model inference to web deployment.
+The application is deployed on Hugging Face Spaces and provides a simple web interface for real-time image caption generation.
+Live Demo:
+https://huggingface.co/spaces/Utkarsh430/ai-image-caption-generator
+Key Features
+1.Generates natural language captions for images
+2.Uses a state-of-the-art vision–language transformer model
+3.Supports common image formats such as JPEG and PNG
+4.Optimized inference using lazy model loading
+5.Clean and interactive web interface using Gradio
+6.Fully deployed on Hugging Face Spaces (CPU)
+7.Automatic deployment on every commit
+Model Details
+Model Name: Salesforce/blip-image-captioning-base
+Task: Image-to-Text Generation
+Architecture: Vision–Language Transformer
+Framework: PyTorch
+BLIP (Bootstrapped Language-Image Pretraining) is trained on large-scale image–text datasets and is capable of understanding visual context to produce accurate and descriptive captions.
+Technology Stack
+| Category                | Tools / Libraries         |
+| ----------------------- | ------------------------- |
+| Programming Language    | Python                    |
+| Deep Learning Framework | PyTorch                   |
+| Vision & NLP            | Hugging Face Transformers |
+| Image Processing        | Pillow                    |
+| Web Interface           | Gradio                    |
+| Deployment Platform     | Hugging Face Spaces       |