BuildTheFuture / README.md
Abs6187's picture
Upload 16 files
e98d661 verified
|
raw
history blame
1.46 kB
# 🍌 Nano Banana: Dynamic Image Creation
**Powered by Gemini 2.5 Flash Image Preview (Nano Banana)**
Transform images with words, blend realities, and create dynamic visual content using Google's state-of-the-art Nano Banana model.
## πŸš€ Live Demo
Access the application: [Hugging Face Space](your-space-url-here)
## ✨ Core Features
- **Word-Based Editing**: Transform images using natural language prompts
- **Reality Blending**: Seamlessly fuse different visual elements
- **Dynamic Creation**: Real-time image transformations
- **Multiple Modes**: Complete, Edit, and Blend operations
- **Style Control**: Realistic, Futuristic, and Artistic outputs
## πŸ› οΈ Optional Enhancements
- **Structure Detection**: YOLO-based object detection
- **Voice Narration**: ElevenLabs audio descriptions
## 🎯 Competition Submission
Built for the Google Nano Banana Competition showcasing:
- Gemini 2.5 Flash Image Preview as the primary model
- Advanced image editing capabilities
- Dynamic visual storytelling
- Natural language photo editing
## πŸ“ Setup
Set environment variables:
```
GEMINI_API_KEY=your_gemini_key
ELEVENLABS_API_KEY=your_elevenlabs_key (optional)
```
## πŸ† Technical Highlights
- **Primary**: Gemini 2.5 Flash Image (Nano Banana)
- **Optional**: YOLO detection, ElevenLabs voice
- **Framework**: Gradio for interactive UI
- **Deployment**: Optimized for Hugging Face Spaces