Abs6187 commited on
Commit
cffdb30
Β·
verified Β·
1 Parent(s): 3ae85fa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -44
README.md CHANGED
@@ -1,45 +1,51 @@
1
- # 🍌 Nano Banana: Dynamic Image Creation
2
-
3
- **Powered by Gemini 2.5 Flash Image Preview (Nano Banana)**
4
-
5
- Transform images with words, blend realities, and create dynamic visual content using Google's state-of-the-art Nano Banana model.
6
-
7
- ## πŸš€ Live Demo
8
-
9
- Access the application: [Hugging Face Space](your-space-url-here)
10
-
11
- ## ✨ Core Features
12
-
13
- - **Word-Based Editing**: Transform images using natural language prompts
14
- - **Reality Blending**: Seamlessly fuse different visual elements
15
- - **Dynamic Creation**: Real-time image transformations
16
- - **Multiple Modes**: Complete, Edit, and Blend operations
17
- - **Style Control**: Realistic, Futuristic, and Artistic outputs
18
-
19
- ## πŸ› οΈ Optional Enhancements
20
-
21
- - **Structure Detection**: YOLO-based object detection
22
- - **Voice Narration**: ElevenLabs audio descriptions
23
-
24
- ## 🎯 Competition Submission
25
-
26
- Built for the Google Nano Banana Competition showcasing:
27
- - Gemini 2.5 Flash Image Preview as the primary model
28
- - Advanced image editing capabilities
29
- - Dynamic visual storytelling
30
- - Natural language photo editing
31
-
32
- ## πŸ“ Setup
33
-
34
- Set environment variables:
35
- ```
36
- GEMINI_API_KEY=your_gemini_key
37
- ELEVENLABS_API_KEY=your_elevenlabs_key (optional)
38
- ```
39
-
40
- ## πŸ† Technical Highlights
41
-
42
- - **Primary**: Gemini 2.5 Flash Image (Nano Banana)
43
- - **Optional**: YOLO detection, ElevenLabs voice
44
- - **Framework**: Gradio for interactive UI
 
 
 
 
 
 
45
  - **Deployment**: Optimized for Hugging Face Spaces
 
1
+ ---
2
+ sdk: gradio
3
+ thumbnail: >-
4
+ https://cdn-uploads.huggingface.co/production/uploads/651bfdb7164539754da51704/ON47OW1gYhYS2IWADMzlu.png
5
+ short_description: AI-Powered Completion of Unfinished Constructions
6
+ ---
7
+ # 🍌 Nano Banana: Dynamic Image Creation
8
+
9
+ **Powered by Gemini 2.5 Flash Image Preview (Nano Banana)**
10
+
11
+ Transform images with words, blend realities, and create dynamic visual content using Google's state-of-the-art Nano Banana model.
12
+
13
+ ## πŸš€ Live Demo
14
+
15
+ Access the application: [Hugging Face Space](your-space-url-here)
16
+
17
+ ## ✨ Core Features
18
+
19
+ - **Word-Based Editing**: Transform images using natural language prompts
20
+ - **Reality Blending**: Seamlessly fuse different visual elements
21
+ - **Dynamic Creation**: Real-time image transformations
22
+ - **Multiple Modes**: Complete, Edit, and Blend operations
23
+ - **Style Control**: Realistic, Futuristic, and Artistic outputs
24
+
25
+ ## πŸ› οΈ Optional Enhancements
26
+
27
+ - **Structure Detection**: YOLO-based object detection
28
+ - **Voice Narration**: ElevenLabs audio descriptions
29
+
30
+ ## 🎯 Competition Submission
31
+
32
+ Built for the Google Nano Banana Competition showcasing:
33
+ - Gemini 2.5 Flash Image Preview as the primary model
34
+ - Advanced image editing capabilities
35
+ - Dynamic visual storytelling
36
+ - Natural language photo editing
37
+
38
+ ## πŸ“ Setup
39
+
40
+ Set environment variables:
41
+ ```
42
+ GEMINI_API_KEY=your_gemini_key
43
+ ELEVENLABS_API_KEY=your_elevenlabs_key (optional)
44
+ ```
45
+
46
+ ## πŸ† Technical Highlights
47
+
48
+ - **Primary**: Gemini 2.5 Flash Image (Nano Banana)
49
+ - **Optional**: YOLO detection, ElevenLabs voice
50
+ - **Framework**: Gradio for interactive UI
51
  - **Deployment**: Optimized for Hugging Face Spaces