jree423 commited on
Commit
6d2a86e
·
verified ·
1 Parent(s): c158dc0

Update: Add model card for versatile implementation

Browse files
Files changed (1) hide show
  1. README.md +34 -9
README.md CHANGED
@@ -9,11 +9,11 @@ license: mit
9
 
10
  # Diffsketcher - Vector Graphics Generation
11
 
12
- This model generates vector graphics (SVG) from text prompts. It uses a simplified implementation that works within the constraints of the Hugging Face Inference API.
13
 
14
  ## Model Description
15
 
16
- DiffSketcher generates vector graphics (SVG) from text prompts. It uses a diffusion model to guide the SVG generation.
17
 
18
  ## Usage
19
 
@@ -34,17 +34,42 @@ with open("output.png", "wb") as f:
34
 
35
  ## Examples
36
 
 
37
  - "a red sports car"
38
  - "a blue sedan"
39
  - "a black SUV"
40
- - "a yellow convertible"
41
 
42
- ## Limitations
43
-
44
- This is a simplified implementation that:
45
- - Primarily generates car-like SVG images
46
- - Uses CLIP for text encoding when available
47
- - Doesn't require downloading large model weights
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
48
 
49
  ## Citation
50
 
 
9
 
10
  # Diffsketcher - Vector Graphics Generation
11
 
12
+ This model generates vector graphics (SVG) from text prompts. It uses a versatile implementation that analyzes the prompt to determine what type of object to generate.
13
 
14
  ## Model Description
15
 
16
+ DiffSketcher generates vector graphics (SVG) from text prompts. It analyzes the prompt to determine what type of object to generate and creates appropriate SVG images.
17
 
18
  ## Usage
19
 
 
34
 
35
  ## Examples
36
 
37
+ ### Cars
38
  - "a red sports car"
39
  - "a blue sedan"
40
  - "a black SUV"
 
41
 
42
+ ### Landscapes
43
+ - "a mountain landscape with a lake"
44
+ - "a forest with a river"
45
+ - "a beach at sunset"
46
+
47
+ ### Animals
48
+ - "a brown dog"
49
+ - "a black cat"
50
+ - "a colorful bird"
51
+
52
+ ### Buildings
53
+ - "a small house with a garden"
54
+ - "a tall skyscraper"
55
+ - "a medieval castle"
56
+
57
+ ### Faces
58
+ - "a smiling woman"
59
+ - "a man with a beard"
60
+ - "a girl with long hair"
61
+
62
+ ### Abstract
63
+ - "colorful abstract art"
64
+ - "geometric shapes"
65
+ - "vibrant colors and patterns"
66
+
67
+ ## How It Works
68
+
69
+ 1. **Prompt Analysis**: The model analyzes the prompt to determine what type of object to generate.
70
+ 2. **CLIP Integration**: The model uses CLIP to encode the prompt when available.
71
+ 3. **SVG Generation**: Based on the detected object type, the model creates an appropriate SVG.
72
+ 4. **PNG Conversion**: The SVG is converted to PNG for display.
73
 
74
  ## Citation
75