Spaces:
Sleeping
Sleeping
π Nano Banana: Dynamic Image Creation
Powered by Gemini 2.5 Flash Image Preview (Nano Banana)
Transform images with words, blend realities, and create dynamic visual content using Google's state-of-the-art Nano Banana model.
π Live Demo
Access the application: Hugging Face Space
β¨ Core Features
- Word-Based Editing: Transform images using natural language prompts
- Reality Blending: Seamlessly fuse different visual elements
- Dynamic Creation: Real-time image transformations
- Multiple Modes: Complete, Edit, and Blend operations
- Style Control: Realistic, Futuristic, and Artistic outputs
π οΈ Optional Enhancements
- Structure Detection: YOLO-based object detection
- Voice Narration: ElevenLabs audio descriptions
π― Competition Submission
Built for the Google Nano Banana Competition showcasing:
- Gemini 2.5 Flash Image Preview as the primary model
- Advanced image editing capabilities
- Dynamic visual storytelling
- Natural language photo editing
π Setup
Set environment variables:
GEMINI_API_KEY=your_gemini_key
ELEVENLABS_API_KEY=your_elevenlabs_key (optional)
π Technical Highlights
- Primary: Gemini 2.5 Flash Image (Nano Banana)
- Optional: YOLO detection, ElevenLabs voice
- Framework: Gradio for interactive UI
- Deployment: Optimized for Hugging Face Spaces