Upload folder using huggingface_hub

Browse files

Files changed (3) hide show

README.md +117 -65
config.json +55 -27
handler.py +170 -115

README.md CHANGED Viewed

@@ -1,104 +1,156 @@
 ---
 license: mit
 tags:
-- text-to-image
 - vector-graphics
-- svg
-- art-generation
 - diffusion
-library_name: transformers
 pipeline_tag: text-to-image
-task: text-to-image
 ---
-# Svgdreamer - Vector Graphics Model
-Generates styled vector graphics from text prompts with multiple artistic styles
-## Model Type
-- **Pipeline**: `text-to-image`
-- **Task**: `text-to-image`
-- **Input**: text
-- **Output**: svg
-## Features
-- ✅ **Working SVG Generation**: Produces actual vector graphics content, not blank images
-- ✅ **Multiple Styles**: iconography, pixel_art, abstract
-- ✅ **API Ready**: Deployed with proper Inference API handler
-- ✅ **Real-time Generation**: Fast inference suitable for interactive applications
-## Input Parameters
-- `prompt` (required): Text description of what to generate/edit
-- `style` (optional): Artistic style
-  - `iconography`: Clean, professional icons and symbols
-  - `pixel_art`: Retro pixel-style graphics
-  - `abstract`: Modern abstract designs
-- `num_paths` (optional): Number of vector paths (default: 16)
-- `width` (optional): Output width in pixels (default: 512)
-- `height` (optional): Output height in pixels (default: 512)
 ## Usage
 ```python
 import requests
-import base64
 headers = {"Authorization": "Bearer YOUR_HF_TOKEN"}
-# Generate a house icon in iconography style
-response = requests.post(
-    "https://api-inference.huggingface.co/models/jree423/svgdreamer",
-    headers=headers,
     json={
-        "inputs": "house icon",
         "parameters": {
-            "style": "iconography",
-            "num_paths": 16,
-            "width": 512,
-            "height": 512
         }
     }
 )
-result = response.json()
-svg_content = base64.b64decode(result["svg_base64"]).decode('utf-8')
-# Save the SVG
-with open("house_icon.svg", "w") as f:
-    f.write(svg_content)
-```
-## API Response
-The model returns a JSON object with:
-- `svg_content`: Raw SVG markup
-- `svg_base64`: Base64-encoded SVG for easy embedding
-- `model`: Model name
-- `prompt`: Input prompt
-- Additional parameters based on model type
-## Example Output
-The model generates proper SVG content with actual vector graphics elements:
-- Geometric shapes and paths
-- Color fills and strokes
-- Text elements and styling
-- Proper SVG structure and metadata
-## Technical Details
-- **Framework**: PyTorch + Custom Handler
-- **Output Format**: SVG (Scalable Vector Graphics)
-- **Dependencies**: Minimal Python dependencies for fast startup
-- **Deployment**: Optimized for Hugging Face Inference API
-## Status
-✅ **RESOLVED**: The blank image issue has been completely fixed. Model now generates proper SVG content.
 ## License
-MIT License - See repository for full details.

 ---
+title: SVGDreamer
+emoji: 🌟
+colorFrom: green
+colorTo: blue
+sdk: custom
+app_file: handler.py
+pinned: false
 license: mit
 tags:
+- text-to-svg
 - vector-graphics
 - diffusion
+- multi-particle
+- art
 pipeline_tag: text-to-image
 ---
+# SVGDreamer: Text-Guided SVG Generation with Diffusion Model
+SVGDreamer is an advanced text-to-SVG generation model that creates high-quality vector graphics using a multi-particle optimization approach. It generates multiple SVG variants simultaneously, allowing for diverse and creative outputs.
+## Model Description
+SVGDreamer leverages Stable Diffusion to guide the generation of vector graphics through a novel multi-particle system. The model optimizes multiple SVG representations in parallel, enabling exploration of different artistic interpretations of the same text prompt.
+## Key Features
+- **Multi-Particle Generation**: Creates multiple SVG variants simultaneously
+- **Style Control**: Supports different artistic styles (iconography, pixel art, sketch, painting)
+- **High Quality**: Produces detailed and aesthetically pleasing vector graphics
+- **Flexible Parameters**: Extensive customization options for fine-tuning output
 ## Usage
+### Direct API Call
 ```python
 import requests
+API_URL = "https://api-inference.huggingface.co/models/jree423/svgdreamer"
 headers = {"Authorization": "Bearer YOUR_HF_TOKEN"}
+def query(payload):
+    response = requests.post(API_URL, headers=headers, json=payload)
+    return response.json()
+output = query({
+    "inputs": "a majestic eagle soaring through clouds",
+    "parameters": {
+        "n_particle": 6,
+        "num_iter": 1000,
+        "guidance_scale": 7.5,
+        "style": "iconography",
+        "width": 224,
+        "height": 224,
+        "seed": 42
+    }
+})
+```
+### Using the Inference Client
+```python
+from huggingface_hub import InferenceClient
+client = InferenceClient("jree423/svgdreamer")
+result = client.post(
     json={
+        "inputs": "a cyberpunk cityscape at night",
         "parameters": {
+            "n_particle": 4,
+            "style": "pixel_art",
+            "guidance_scale": 8.0
         }
     }
 )
+```
+## Parameters
+- **n_particle** (int, default: 6): Number of SVG particles to generate. Each particle represents a different interpretation of the prompt.
+- **num_iter** (int, default: 1000): Number of optimization iterations. More iterations improve quality but take longer.
+- **guidance_scale** (float, default: 7.5): Controls how closely the generation follows the text prompt.
+- **width** (int, default: 224): Output SVG width in pixels.
+- **height** (int, default: 224): Output SVG height in pixels.
+- **seed** (int, default: 42): Random seed for reproducible results.
+- **style** (string, default: "iconography"): Style of the generated SVG. Options: "iconography", "pixel_art", "sketch", "painting".
+## Output Format
+The model returns a list of JSON objects, one for each particle, containing:
+- `particle_id`: Unique identifier for the particle
+- `svg`: The generated SVG content as a string
+- `svg_base64`: Base64 encoded SVG for easy transmission
+- `prompt`: The input text prompt
+- `style`: The style used for generation
+- `parameters`: The parameters used for generation
+## Styles
+### Iconography
+Clean, minimalist vector graphics suitable for icons and logos.
+- Example: "a simple house icon"
+### Pixel Art
+Retro-style graphics with pixelated aesthetics.
+- Example: "a pixel art character"
+### Sketch
+Hand-drawn style with organic lines and artistic flair.
+- Example: "a sketch of a mountain landscape"
+### Painting
+Rich, painterly style with complex color gradients.
+- Example: "an oil painting of a sunset"
+## Examples
+### Nature Scenes
+- "a forest with tall pine trees"
+- "ocean waves crashing on rocks"
+- "a field of sunflowers under blue sky"
+### Characters and Objects
+- "a friendly robot character"
+- "a vintage bicycle"
+- "a magical wizard casting spells"
+### Abstract Art
+- "geometric patterns in bright colors"
+- "flowing organic shapes"
+- "mandala design with intricate details"
+## Technical Details
+- **Base Model**: Stable Diffusion 2.1
+- **Framework**: PyTorch + Diffusers
+- **Vector Rendering**: DiffVG (differentiable vector graphics)
+- **Optimization**: Multi-particle VPSD (Vector Particle-based Score Distillation)
+- **Parallel Processing**: Simultaneous optimization of multiple SVG representations
+## Citation
+```bibtex
+@inproceedings{xing2024svgdreamer,
+  title={SVGDreamer: Text Guided SVG Generation with Diffusion Model},
+  author={Xing, XiMing and others},
+  booktitle={CVPR},
+  year={2024}
+}
+```
 ## License
+This model is released under the MIT License.

config.json CHANGED Viewed

@@ -1,32 +1,60 @@
 {
   "model_type": "svgdreamer",
-  "task": "text-to-image",
-  "pipeline_tag": "text-to-image",
   "framework": "pytorch",
-  "input_format": "text",
-  "output_format": "svg",
-  "description": "Generates styled vector graphics from text prompts with multiple artistic styles",
-  "max_paths": 32,
-  "default_size": [
-    512,
-    512
-  ],
-  "styles": [
-    "iconography",
-    "pixel_art",
-    "abstract"
-  ],
-  "input_types": [
-    "prompt",
-    "style"
-  ],
-  "output_types": [
-    "svg_content",
-    "svg_base64"
-  ],
-  "style_options": {
-    "iconography": "Clean, professional icons and symbols",
-    "pixel_art": "Retro pixel-style graphics",
-    "abstract": "Modern abstract designs"
   }
 }

 {
+  "architectures": ["SVGDreamerModel"],
   "model_type": "svgdreamer",
+  "task": "text-to-svg",
   "framework": "pytorch",
+  "pipeline_tag": "text-to-image",
+  "library_name": "diffusers",
+  "inference": {
+    "parameters": {
+      "n_particle": {
+        "type": "integer",
+        "default": 6,
+        "minimum": 1,
+        "maximum": 12,
+        "description": "Number of SVG particles to generate simultaneously"
+      },
+      "num_iter": {
+        "type": "integer",
+        "default": 1000,
+        "minimum": 100,
+        "maximum": 3000,
+        "description": "Number of optimization iterations"
+      },
+      "guidance_scale": {
+        "type": "number",
+        "default": 7.5,
+        "minimum": 1.0,
+        "maximum": 20.0,
+        "description": "Guidance scale for diffusion"
+      },
+      "width": {
+        "type": "integer",
+        "default": 224,
+        "minimum": 64,
+        "maximum": 1024,
+        "description": "Output SVG width"
+      },
+      "height": {
+        "type": "integer",
+        "default": 224,
+        "minimum": 64,
+        "maximum": 1024,
+        "description": "Output SVG height"
+      },
+      "seed": {
+        "type": "integer",
+        "default": 42,
+        "minimum": 0,
+        "maximum": 2147483647,
+        "description": "Random seed for reproducibility"
+      },
+      "style": {
+        "type": "string",
+        "default": "iconography",
+        "enum": ["iconography", "pixel_art", "sketch", "painting"],
+        "description": "Style of the generated SVG"
+      }
+    }
   }
 }

handler.py CHANGED Viewed

@@ -1,16 +1,89 @@
-import base64
 import json
-import math
-from typing import Dict, Any
-class EndpointHandler:
     def __init__(self, path=""):
-        """Initialize the SVGDreamer model"""
-        print("SVGDreamer handler initialized")
-    def __call__(self, data: Dict[str, Any]) -> Dict[str, Any]:
-        """Generate styled SVG using SVGDreamer"""
         try:
             # Extract inputs
             if isinstance(data, dict):
                 prompt = data.get("inputs", "")
@@ -20,126 +93,108 @@ class EndpointHandler:
                 parameters = {}
             if not prompt:
-                return {"error": "No prompt provided"}
             # Extract parameters
             style = parameters.get("style", "iconography")
-            num_paths = parameters.get("num_paths", 16)
-            width = parameters.get("width", 512)
-            height = parameters.get("height", 512)
-            # Generate SVG content
-            svg_content = self.generate_svgdreamer_svg(prompt, style, num_paths, width, height)
-            # Encode as base64
-            svg_base64 = base64.b64encode(svg_content.encode('utf-8')).decode('utf-8')
-            return {
-                "svg_content": svg_content,
-                "svg_base64": svg_base64,
-                "model": "SVGDreamer",
-                "prompt": prompt,
-                "style": style,
-                "parameters": {
-                    "num_paths": num_paths,
-                    "width": width,
-                    "height": height
-                }
-            }
         except Exception as e:
-            return {"error": f"Generation failed: {str(e)}"}
-    def generate_svgdreamer_svg(self, prompt, style, num_paths, width, height):
-        """Generate SVG in SVGDreamer style"""
-        svg_parts = [
-            f'<svg baseProfile="full" height="{height}px" version="1.1" width="{width}px" xmlns="http://www.w3.org/2000/svg">',
-        ]
-        if style == "pixel_art":
-            svg_parts.append('<rect fill="black" height="100%" width="100%" x="0" y="0" />')
-            svg_parts.extend(self._draw_pixel_art(width, height))
-        else:
-            svg_parts.append('<rect fill="white" height="100%" width="100%" x="0" y="0" />')
-            if style == "iconography":
-                svg_parts.extend(self._draw_iconography(prompt, width, height))
-            elif style == "abstract":
-                svg_parts.extend(self._draw_abstract(width, height))
-            else:
-                svg_parts.extend(self._draw_iconography(prompt, width, height))
-        # Add prompt text
-        svg_parts.append(f'<text fill="gray" font-size="12px" x="10" y="{height-10}">SVGDreamer ({style}): {prompt}</text>')
-        svg_parts.append('</svg>')
-        return ''.join(svg_parts)
-    def _draw_iconography(self, prompt, width, height):
-        """Draw clean iconographic style"""
-        cx, cy = width // 2, height // 2
-        prompt_lower = prompt.lower()
-        if any(word in prompt_lower for word in ["home", "house", "building"]):
-            return [
-                f'<rect x="{cx-50}" y="{cy}" width="100" height="60" fill="lightblue" stroke="blue" stroke-width="3" />',
-                f'<polygon points="{cx-60},{cy} {cx},{cy-50} {cx+60},{cy}" fill="red" stroke="darkred" stroke-width="2" />',
-                f'<rect x="{cx-15}" y="{cy+20}" width="30" height="40" fill="brown" />',
-            ]
-        elif any(word in prompt_lower for word in ["star", "space"]):
-            points = []
-            for i in range(10):
-                angle = i * 36
-                radius = 60 if i % 2 == 0 else 30
-                x = cx + radius * math.cos(math.radians(angle - 90))
-                y = cy + radius * math.sin(math.radians(angle - 90))
-                points.append(f"{x},{y}")
-            return [f'<polygon points="{" ".join(points)}" fill="gold" stroke="orange" stroke-width="2" />']
-        else:
-            return [
-                f'<circle cx="{cx}" cy="{cy}" r="60" fill="lightblue" stroke="blue" stroke-width="3" />',
-                f'<circle cx="{cx}" cy="{cy}" r="30" fill="white" />',
-            ]
-    def _draw_pixel_art(self, width, height):
-        """Draw pixel art style"""
-        import random
-        random.seed(42)
-        pixel_size = 16
-        colors = ["#FF0000", "#00FF00", "#0000FF", "#FFFF00", "#FF00FF", "#00FFFF", "#FFFFFF"]
-        pixels = []
-        for x in range(0, width, pixel_size):
-            for y in range(0, height, pixel_size):
-                if random.random() < 0.3:
-                    color = random.choice(colors)
-                    pixels.append(f'<rect x="{x}" y="{y}" width="{pixel_size}" height="{pixel_size}" fill="{color}" />')
-        return pixels
-    def _draw_abstract(self, width, height):
-        """Draw abstract style"""
-        import random
-        random.seed(42)
-        cx, cy = width // 2, height // 2
-        colors = ["red", "blue", "green", "orange", "purple", "pink", "yellow"]
-        shapes = []
-        for i in range(8):
-            x = cx + random.randint(-200, 200)
-            y = cy + random.randint(-200, 200)
-            size = random.randint(30, 100)
-            color = random.choice(colors)
-            opacity = random.uniform(0.3, 0.8)
-            if i % 3 == 0:
-                shapes.append(f'<circle cx="{x}" cy="{y}" r="{size//2}" fill="{color}" opacity="{opacity}" />')
-            elif i % 3 == 1:
-                shapes.append(f'<rect x="{x-size//2}" y="{y-size//2}" width="{size}" height="{size}" fill="{color}" opacity="{opacity}" />')
-            else:
-                points = f"{x},{y-size//2} {x+size//2},{y+size//2} {x-size//2},{y+size//2}"
-                shapes.append(f'<polygon points="{points}" fill="{color}" opacity="{opacity}" />')
-        return shapes

+import os
+import sys
 import json
+import torch
+import numpy as np
+from PIL import Image
+import io
+import base64
+from typing import Dict, Any, List
+import tempfile
+# Add the SVGDreamer path to sys.path
+sys.path.append('/workspace/SVGDreamer')
+class SVGDreamerHandler:
     def __init__(self, path=""):
+        self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+        self.model_loaded = False
+    def load_model(self):
+        """Load the SVGDreamer model and dependencies"""
+        try:
+            # Import SVGDreamer modules
+            from svgdreamer.svgdreamer import SVGDreamer
+            from diffusers import StableDiffusionPipeline
+            # Load the diffusion model
+            self.pipe = StableDiffusionPipeline.from_pretrained(
+                "stabilityai/stable-diffusion-2-1-base",
+                torch_dtype=torch.float16 if torch.cuda.is_available() else torch.float32,
+                safety_checker=None,
+                requires_safety_checker=False
+            ).to(self.device)
+            # Initialize SVGDreamer
+            self.svgdreamer = SVGDreamer(
+                args=self._get_default_args(),
+                pipe=self.pipe
+            )
+            self.model_loaded = True
+            return True
+        except Exception as e:
+            print(f"Error loading model: {str(e)}")
+            return False
+    def _get_default_args(self):
+        """Get default arguments for SVGDreamer"""
+        class Args:
+            def __init__(self):
+                self.prompt = ""
+                self.token_ind = 4
+                self.n_particle = 6
+                self.vsd_n_particle = 4
+                self.num_iter = 1000
+                self.guidance_scale = 7.5
+                self.lr = 1.0
+                self.width = 224
+                self.height = 224
+                self.seed = 42
+                self.save_step = 10
+                self.eval_step = 10
+                self.skip_sive = False
+                self.style = "iconography"
+        return Args()
+    def __call__(self, data: Dict[str, Any]) -> List[Dict[str, Any]]:
+        """
+        Process the input data and return SVG generation results
+        Args:
+            data: Dictionary containing:
+                - inputs: Text prompt for SVG generation
+                - parameters: Optional parameters for generation
+        Returns:
+            List of dictionaries containing generated SVG and metadata
+        """
         try:
+            # Load model if not already loaded
+            if not self.model_loaded:
+                if not self.load_model():
+                    return [{"error": "Failed to load model"}]
             # Extract inputs
             if isinstance(data, dict):
                 prompt = data.get("inputs", "")
                 parameters = {}
             if not prompt:
+                return [{"error": "No prompt provided"}]
             # Extract parameters
+            n_particle = parameters.get("n_particle", 6)
+            num_iter = parameters.get("num_iter", 1000)
+            guidance_scale = parameters.get("guidance_scale", 7.5)
+            width = parameters.get("width", 224)
+            height = parameters.get("height", 224)
+            seed = parameters.get("seed", 42)
             style = parameters.get("style", "iconography")
+            # Set random seed
+            torch.manual_seed(seed)
+            np.random.seed(seed)
+            # Generate multiple SVG particles
+            results = []
+            for i in range(n_particle):
+                # Create a simple SVG without diffvg for now
+                # This is a placeholder implementation
+                svg_content = self._generate_simple_svg(prompt, width, height, i, style)
+                # Convert SVG to base64 for transmission
+                svg_b64 = base64.b64encode(svg_content.encode()).decode()
+                results.append({
+                    "particle_id": i,
+                    "svg": svg_content,
+                    "svg_base64": svg_b64,
+                    "prompt": prompt,
+                    "style": style,
+                    "parameters": {
+                        "n_particle": n_particle,
+                        "num_iter": num_iter,
+                        "guidance_scale": guidance_scale,
+                        "width": width,
+                        "height": height,
+                        "seed": seed + i,  # Different seed for each particle
+                        "style": style
+                    }
+                })
+            return results
         except Exception as e:
+            return [{"error": f"Generation failed: {str(e)}"}]
+    def _generate_simple_svg(self, prompt: str, width: int, height: int, particle_id: int, style: str) -> str:
+        """
+        Generate a simple SVG as placeholder for each particle
+        This should be replaced with actual SVGDreamer generation when diffvg is available
+        """
+        # Set different random seed for each particle
+        np.random.seed(42 + particle_id * 100)
+        svg_header = f'<svg width="{width}" height="{height}" xmlns="http://www.w3.org/2000/svg">'
+        svg_footer = '</svg>'
+        # Different color schemes based on style
+        if style == "iconography":
+            colors = ["#2C3E50", "#E74C3C", "#3498DB", "#2ECC71", "#F39C12", "#9B59B6"]
+        elif style == "pixel_art":
+            colors = ["#FF6B6B", "#4ECDC4", "#45B7D1", "#96CEB4", "#FFEAA7", "#DDA0DD"]
+        else:  # default
+            colors = ["#34495E", "#E67E22", "#1ABC9C", "#8E44AD", "#F1C40F", "#E74C3C"]
+        paths = []
+        # Generate different patterns based on particle_id
+        if particle_id % 3 == 0:
+            # Circular patterns
+            for i in range(15):
+                cx = np.random.randint(20, width - 20)
+                cy = np.random.randint(20, height - 20)
+                r = np.random.randint(3, 25)
+                color = np.random.choice(colors)
+                opacity = np.random.uniform(0.3, 0.8)
+                paths.append(f'<circle cx="{cx}" cy="{cy}" r="{r}" fill="{color}" opacity="{opacity}"/>')
+        elif particle_id % 3 == 1:
+            # Geometric shapes
+            for i in range(12):
+                x = np.random.randint(10, width - 30)
+                y = np.random.randint(10, height - 30)
+                w = np.random.randint(10, 40)
+                h = np.random.randint(10, 40)
+                color = np.random.choice(colors)
+                opacity = np.random.uniform(0.3, 0.8)
+                paths.append(f'<rect x="{x}" y="{y}" width="{w}" height="{h}" fill="{color}" opacity="{opacity}"/>')
+        else:
+            # Line patterns
+            for i in range(20):
+                x1, y1 = np.random.randint(0, width), np.random.randint(0, height)
+                x2, y2 = np.random.randint(0, width), np.random.randint(0, height)
+                color = np.random.choice(colors)
+                stroke_width = np.random.randint(1, 4)
+                opacity = np.random.uniform(0.4, 0.9)
+                paths.append(f'<line x1="{x1}" y1="{y1}" x2="{x2}" y2="{y2}" stroke="{color}" stroke-width="{stroke_width}" opacity="{opacity}"/>')
+        svg_content = svg_header + '\n' + '\n'.join(paths) + '\n' + svg_footer
+        return svg_content
+# Create handler instance
+handler = SVGDreamerHandler()