Spaces:

leave-everything
/

nano

Running

App Files Files Community

leave-everything commited on Nov 22, 2025

Commit

9944771

verified ·

1 Parent(s): 1e1f199

Upload 4 files

Browse files

Files changed (4) hide show

.gitignore +35 -13
README.md +128 -0
app.py +187 -0
requirements.txt +3 -20

.gitignore CHANGED Viewed

@@ -1,19 +1,31 @@
-# Environment variables
-.env
-.env.local
-# Generated images
-generated_images/
 # Python
 __pycache__/
 *.py[cod]
 *$py.class
 *.so
-# Virtual environment
 venv/
-env/
 ENV/
 # IDE
@@ -21,14 +33,24 @@ ENV/
 .idea/
 *.swp
 *.swo
-# OS
 .DS_Store
-Thumbs.db
 # Logs
 *.log
 # Temporary files
 *.tmp
-temp/

 # Python
 __pycache__/
 *.py[cod]
 *$py.class
 *.so
+.Python
+env/
+venv/
+ENV/
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+# Virtual Environment
+.venv/
 venv/
 ENV/
 # IDE
 .idea/
 *.swp
 *.swo
+*~
 .DS_Store
+# Gradio
+gradio_cached_examples/
+flagged/
+# API Keys and Secrets
+.env
+.env.local
+*.key
+secrets.json
 # Logs
 *.log
+logs/
 # Temporary files
 *.tmp
+temp/
+tmp/

README.md ADDED Viewed

	@@ -0,0 +1,128 @@

+---
+title: Gemini Image Generation
+emoji: 🎨
+colorFrom: blue
+colorTo: purple
+sdk: gradio
+sdk_version: 5.49.1
+app_file: app.py
+pinned: false
+license: mit
+---
+# 🎨 Gemini Image Generation App
+A simple and intuitive Gradio application for generating images using Google's Gemini image models (Nano Banana 🍌).
+## Features
+- 🔑 Secure API key input
+- 🤖 **Model Selection**: Choose between Gemini 3 Pro Image (Nano Banana Pro) or 2.5 Flash Image (Nano Banana)
+- 📝 Text-to-image generation with state-of-the-art quality
+- 📐 Multiple aspect ratio options (1:1, 16:9, 9:16, 4:3, 3:4)
+- 💡 Example prompts for inspiration
+- 🎨 Clean and modern UI
+## Models Available
+### 🍌⭐ Gemini 3 Pro Image (Nano Banana Pro)
+- **Release**: November 20, 2025
+- **Best for**: Professional-grade image generation
+- **Features**: High-resolution (1K/2K/4K), advanced text rendering, complex compositions
+- **Model ID**: `gemini-3-pro-image-preview`
+### 🍌⚡ Gemini 2.5 Flash Image (Nano Banana)
+- **Release**: August 26, 2025
+- **Best for**: Fast, efficient image generation
+- **Features**: 1024px resolution, 2-3x faster than competitors, high-volume tasks
+- **Model ID**: `gemini-2.5-flash-image`
+## How to Use
+1. **Get API Key**: Visit [Google AI Studio](https://aistudio.google.com/app/apikey) to obtain your API key
+2. **Enter API Key**: Paste your API key in the secure input field
+3. **Select Model**: Choose between Gemini 3 Pro (quality) or 2.5 Flash (speed)
+4. **Write Prompt**: Describe the image you want to generate in detail
+5. **Select Aspect Ratio**: Choose your preferred aspect ratio
+6. **Generate**: Click the "Generate Image" button and wait for your image
+## Tips for Better Results
+- Be specific and detailed in your prompts
+- Include information about:
+  - Style (realistic, artistic, abstract, etc.)
+  - Mood and atmosphere
+  - Lighting conditions
+  - Color palette
+  - Composition details
+- Reference specific artists or art movements if desired
+- Experiment with different aspect ratios for your use case
+## Requirements
+- Google AI API key (free tier available)
+- Internet connection
+- Modern web browser
+## Technical Details
+- **Models**:
+  - Gemini 3 Pro Image (`gemini-3-pro-image-preview`)
+  - Gemini 2.5 Flash Image (`gemini-2.5-flash-image`)
+- **Framework**: Gradio 5.49.1 (stable)
+- **API**: Google GenAI Python SDK (google-genai 1.52.0+)
+- **Deployment**: Hugging Face Spaces
+- **Python**: 3.10+
+## Privacy & Security
+- Your API key is not stored or logged
+- API key is only used for the current generation request
+- All processing happens through Google's secure API
+- No image data is stored by this application
+## Local Development
+To run this app locally:
+```bash
+# Clone the repository
+git clone <repository-url>
+cd <repository-name>
+# Install dependencies
+pip install -r requirements.txt
+# Run the app
+python app.py
+```
+The app will be available at `http://localhost:7860`
+## API Costs
+Image generation using Gemini API may incur costs. Please refer to [Google's pricing page](https://ai.google.dev/pricing) for current rates.
+## Limitations
+- Generation time: 10-30 seconds per image
+- Rate limits apply based on your API tier
+- Content policy restrictions apply
+- Some prompts may be filtered for safety
+## Support
+For issues or questions:
+- Check [Google AI documentation](https://ai.google.dev/)
+- Review [Gradio documentation](https://www.gradio.app/docs)
+- Open an issue in this repository
+## License
+MIT License - feel free to use and modify for your projects
+## Acknowledgments
+- Google GenAI SDK for unified API access to Imagen models
+- Gradio for the modern web interface framework
+- Hugging Face for hosting platform

app.py ADDED Viewed

	@@ -0,0 +1,187 @@

+import gradio as gr
+from google import genai
+from google.genai import types
+from PIL import Image
+import io
+# Model configuration
+MODELS = {
+    "Gemini 3 Pro Image (Nano Banana Pro) 🍌⭐": "gemini-3-pro-image-preview",
+    "Gemini 2.5 Flash Image (Nano Banana) 🍌⚡": "gemini-2.5-flash-image",
+}
+def generate_image(api_key, prompt, model_name, aspect_ratio, safety_filter):
+    """
+    Generate image using Google GenAI API (Gemini image models)
+    Args:
+        api_key: Google AI API key
+        prompt: Text description for image generation
+        model_name: Selected model name
+        aspect_ratio: Aspect ratio for the image
+        safety_filter: Safety filter level
+    Returns:
+        Generated image or error message
+    """
+    if not api_key:
+        return None, "❌ Please enter your API key"
+    if not prompt:
+        return None, "❌ Please enter a prompt"
+    try:
+        # Create GenAI client
+        client = genai.Client(api_key=api_key)
+        # Get model ID from selection
+        model_id = MODELS[model_name]
+        # Configure image generation settings
+        config = types.GenerateContentConfig(
+            response_modalities=["IMAGE"],
+            image_config=types.ImageConfig(
+                aspect_ratio=aspect_ratio,
+            ),
+        )
+        # Generate image using Gemini image model
+        response = client.models.generate_content(
+            model=model_id,
+            contents=[prompt],
+            config=config
+        )
+        # Extract and return the generated image
+        for part in response.parts:
+            if part.inline_data is not None:
+                image = part.as_image()
+                return image, f"✅ Image generated successfully!\nModel: {model_name}\nPrompt: {prompt}\nAspect Ratio: {aspect_ratio}"
+        return None, "❌ No image was generated. Please try again with a different prompt."
+    except Exception as e:
+        error_msg = str(e)
+        if "API_KEY_INVALID" in error_msg or "invalid api key" in error_msg.lower():
+            return None, "❌ Invalid API key. Please check your API key and try again."
+        elif "QUOTA_EXCEEDED" in error_msg or "quota" in error_msg.lower():
+            return None, "❌ Quota exceeded. Please check your API usage limits."
+        elif "permission" in error_msg.lower():
+            return None, "❌ Permission denied. Please ensure your API key has access to Gemini image models."
+        else:
+            return None, f"❌ Error: {error_msg}"
+# Create Gradio interface
+with gr.Blocks(title="Gemini Image Generation", theme=gr.themes.Soft()) as demo:
+    gr.Markdown(
+        """
+        # 🎨 Gemini Image Generation App
+        Generate images using Google's Gemini image models (Nano Banana 🍌).
+        **How to use:**
+        1. Get your API key from [Google AI Studio](https://aistudio.google.com/app/apikey)
+        2. Enter your API key below
+        3. Select your preferred model (3 Pro or 2.5 Flash)
+        4. Write a detailed prompt describing the image you want
+        5. Select aspect ratio
+        6. Click "Generate Image"
+        """
+    )
+    with gr.Row():
+        with gr.Column(scale=1):
+            api_key_input = gr.Textbox(
+                label="🔑 API Key",
+                type="password",
+                placeholder="Enter your Google AI API key",
+                info="Your API key is not stored and only used for this generation"
+            )
+            model_input = gr.Dropdown(
+                label="🤖 Model",
+                choices=list(MODELS.keys()),
+                value="Gemini 3 Pro Image (Nano Banana Pro) 🍌⭐",
+                info="Gemini 3 Pro: Professional quality | 2.5 Flash: Faster generation"
+            )
+            prompt_input = gr.Textbox(
+                label="📝 Prompt",
+                placeholder="A serene landscape with mountains and a lake at sunset...",
+                lines=5,
+                info="Describe the image you want to generate"
+            )
+            aspect_ratio_input = gr.Dropdown(
+                label="📐 Aspect Ratio",
+                choices=["1:1", "16:9", "9:16", "4:3", "3:4"],
+                value="1:1",
+                info="Select the aspect ratio for your image"
+            )
+            safety_filter_input = gr.Dropdown(
+                label="🛡️ Safety Filter",
+                choices=["Default", "Low", "High"],
+                value="Default",
+                info="Content safety filter level"
+            )
+            generate_btn = gr.Button("🎨 Generate Image", variant="primary", size="lg")
+        with gr.Column(scale=1):
+            output_image = gr.Image(
+                label="Generated Image",
+                type="pil",
+                height=500
+            )
+            status_output = gr.Textbox(
+                label="Status",
+                lines=3,
+                interactive=False
+            )
+    gr.Markdown(
+        """
+        ---
+        ### 💡 Tips for better results:
+        - **Choose the right model**: Gemini 3 Pro for professional quality, 2.5 Flash for speed
+        - Be specific and detailed in your prompts
+        - Include style, mood, lighting, and composition details
+        - Mention specific artists or art styles if desired
+        - Experiment with different aspect ratios
+        ### 🍌 Model Comparison:
+        - **Gemini 3 Pro Image (Nano Banana Pro)**: Professional-grade, high-resolution (1K/2K/4K), advanced text rendering
+        - **Gemini 2.5 Flash Image (Nano Banana)**: Fast generation, 1024px resolution, efficient for high-volume tasks
+        ### ⚠️ Notes:
+        - API key is required for each generation
+        - Generation may take 10-30 seconds
+        - API usage may incur costs based on Google's pricing
+        """
+    )
+    # Connect generate button to function
+    generate_btn.click(
+        fn=generate_image,
+        inputs=[api_key_input, prompt_input, model_input, aspect_ratio_input, safety_filter_input],
+        outputs=[output_image, status_output]
+    )
+    # Example prompts
+    gr.Examples(
+        examples=[
+            ["A futuristic cityscape at night with neon lights and flying cars", "16:9"],
+            ["A cute robot playing with a kitten in a garden", "1:1"],
+            ["An abstract painting with vibrant colors and geometric shapes", "4:3"],
+            ["A serene Japanese zen garden with cherry blossoms", "16:9"],
+        ],
+        inputs=[prompt_input, aspect_ratio_input],
+        label="Example Prompts"
+    )
+if __name__ == "__main__":
+    demo.launch()

requirements.txt CHANGED Viewed

@@ -1,20 +1,3 @@
-# Core dependencies
-gradio==4.19.2
-fastapi
-uvicorn[standard]>=0.20.0
-# Google Gemini API (Nano Banana)
-google-generativeai>=0.8.0
-# Image processing
-pillow>=10.0.0
-numpy>=1.24.0
-# Utilities
-python-dotenv>=1.0.0
-aiofiles>=23.2.1
-requests>=2.28.0
-# Hugging Face integration
-huggingface_hub==0.19.4
-datasets>=2.14.0

+gradio==5.49.1
+google-genai>=1.52.0
+Pillow>=10.0.0