BuildTheFuture / README.md
Abs6187's picture
Upload 16 files
e98d661 verified
|
raw
history blame
1.46 kB

🍌 Nano Banana: Dynamic Image Creation

Powered by Gemini 2.5 Flash Image Preview (Nano Banana)

Transform images with words, blend realities, and create dynamic visual content using Google's state-of-the-art Nano Banana model.

πŸš€ Live Demo

Access the application: Hugging Face Space

✨ Core Features

  • Word-Based Editing: Transform images using natural language prompts
  • Reality Blending: Seamlessly fuse different visual elements
  • Dynamic Creation: Real-time image transformations
  • Multiple Modes: Complete, Edit, and Blend operations
  • Style Control: Realistic, Futuristic, and Artistic outputs

πŸ› οΈ Optional Enhancements

  • Structure Detection: YOLO-based object detection
  • Voice Narration: ElevenLabs audio descriptions

🎯 Competition Submission

Built for the Google Nano Banana Competition showcasing:

  • Gemini 2.5 Flash Image Preview as the primary model
  • Advanced image editing capabilities
  • Dynamic visual storytelling
  • Natural language photo editing

πŸ“ Setup

Set environment variables:

GEMINI_API_KEY=your_gemini_key
ELEVENLABS_API_KEY=your_elevenlabs_key (optional)

πŸ† Technical Highlights

  • Primary: Gemini 2.5 Flash Image (Nano Banana)
  • Optional: YOLO detection, ElevenLabs voice
  • Framework: Gradio for interactive UI
  • Deployment: Optimized for Hugging Face Spaces