Utkarsh430 commited on
Commit
c719f2f
·
verified ·
1 Parent(s): 1f034ba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -0
README.md CHANGED
@@ -12,3 +12,48 @@ short_description: image to caption generation using image processing and NLP
12
  ---
13
 
14
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  ---
13
 
14
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
15
+ AI Image Caption Generator
16
+
17
+ Overview
18
+
19
+ The AI Image Caption Generator is an end-to-end deep learning application that automatically generates meaningful natural-language descriptions for images.
20
+ It demonstrates the practical use of vision–language models, covering the complete machine learning workflow from model inference to web deployment.
21
+
22
+ The application is deployed on Hugging Face Spaces and provides a simple web interface for real-time image caption generation.
23
+
24
+ Live Demo:
25
+ https://huggingface.co/spaces/Utkarsh430/ai-image-caption-generator
26
+
27
+
28
+ Key Features
29
+
30
+ 1.Generates natural language captions for images
31
+ 2.Uses a state-of-the-art vision–language transformer model
32
+ 3.Supports common image formats such as JPEG and PNG
33
+ 4.Optimized inference using lazy model loading
34
+ 5.Clean and interactive web interface using Gradio
35
+ 6.Fully deployed on Hugging Face Spaces (CPU)
36
+ 7.Automatic deployment on every commit
37
+
38
+ Model Details
39
+
40
+ Model Name: Salesforce/blip-image-captioning-base
41
+
42
+ Task: Image-to-Text Generation
43
+
44
+ Architecture: Vision–Language Transformer
45
+
46
+ Framework: PyTorch
47
+
48
+ BLIP (Bootstrapped Language-Image Pretraining) is trained on large-scale image–text datasets and is capable of understanding visual context to produce accurate and descriptive captions.
49
+
50
+ Technology Stack
51
+ | Category | Tools / Libraries |
52
+ | ----------------------- | ------------------------- |
53
+ | Programming Language | Python |
54
+ | Deep Learning Framework | PyTorch |
55
+ | Vision & NLP | Hugging Face Transformers |
56
+ | Image Processing | Pillow |
57
+ | Web Interface | Gradio |
58
+ | Deployment Platform | Hugging Face Spaces |
59
+