Spaces:
Sleeping
Sleeping
| # π Nano Banana: Dynamic Image Creation | |
| **Powered by Gemini 2.5 Flash Image Preview (Nano Banana)** | |
| Transform images with words, blend realities, and create dynamic visual content using Google's state-of-the-art Nano Banana model. | |
| ## π Live Demo | |
| Access the application: [Hugging Face Space](your-space-url-here) | |
| ## β¨ Core Features | |
| - **Word-Based Editing**: Transform images using natural language prompts | |
| - **Reality Blending**: Seamlessly fuse different visual elements | |
| - **Dynamic Creation**: Real-time image transformations | |
| - **Multiple Modes**: Complete, Edit, and Blend operations | |
| - **Style Control**: Realistic, Futuristic, and Artistic outputs | |
| ## π οΈ Optional Enhancements | |
| - **Structure Detection**: YOLO-based object detection | |
| - **Voice Narration**: ElevenLabs audio descriptions | |
| ## π― Competition Submission | |
| Built for the Google Nano Banana Competition showcasing: | |
| - Gemini 2.5 Flash Image Preview as the primary model | |
| - Advanced image editing capabilities | |
| - Dynamic visual storytelling | |
| - Natural language photo editing | |
| ## π Setup | |
| Set environment variables: | |
| ``` | |
| GEMINI_API_KEY=your_gemini_key | |
| ELEVENLABS_API_KEY=your_elevenlabs_key (optional) | |
| ``` | |
| ## π Technical Highlights | |
| - **Primary**: Gemini 2.5 Flash Image (Nano Banana) | |
| - **Optional**: YOLO detection, ElevenLabs voice | |
| - **Framework**: Gradio for interactive UI | |
| - **Deployment**: Optimized for Hugging Face Spaces |