Alae65 commited on
Commit
66b053e
·
verified ·
1 Parent(s): 2d60280

Add comprehensive documentation for HunyuanImage-3.0

Browse files
Files changed (1) hide show
  1. README.md +69 -5
README.md CHANGED
@@ -1,13 +1,77 @@
1
  ---
2
- title: HunyuanImage 3
3
- emoji: 🖼
4
  colorFrom: purple
5
- colorTo: red
6
  sdk: gradio
7
  sdk_version: 5.44.0
8
  app_file: app.py
9
  pinned: false
10
- short_description: Text-to-Image generation using Tencent HunyuanImage-3.0 mode
11
  ---
12
 
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: HunyuanImage-3.0
3
+ emoji: 🎨
4
  colorFrom: purple
5
+ colorTo: blue
6
  sdk: gradio
7
  sdk_version: 5.44.0
8
  app_file: app.py
9
  pinned: false
10
+ short_description: Text-to-Image generation using Tencent HunyuanImage-3.0
11
  ---
12
 
13
+ # 🎨 HunyuanImage-3.0 Text-to-Image Generation
14
+
15
+ This Space provides an interface for the **Tencent HunyuanImage-3.0** model, a powerful native multimodal model for image generation.
16
+
17
+ ## About HunyuanImage-3.0
18
+
19
+ HunyuanImage-3.0 is a groundbreaking model that:
20
+ - Features 80B total parameters with 13B activated per token (MoE architecture)
21
+ - Unifies multimodal understanding and generation in an autoregressive framework
22
+ - Achieves performance comparable to leading closed-source models
23
+ - Supports intelligent prompt understanding and automatic elaboration
24
+
25
+ ## ⚠️ Important Notes
26
+
27
+ **Hardware Requirements:**
28
+ - Direct inference requires **3×80GB GPU memory** (240GB total)
29
+ - ZeroGPU is insufficient for full model inference
30
+ - For production use, consider:
31
+ - Using Inference API endpoints
32
+ - Deploying on appropriate hardware (4×80GB GPUs recommended)
33
+ - Using inference providers like FAL AI
34
+
35
+ **Current Implementation:**
36
+ This Space demonstrates the UI structure and configuration. For actual inference:
37
+ 1. The model needs to be loaded with proper hardware
38
+ 2. Or integrate with Inference API/providers
39
+ 3. Or use model quantization techniques
40
+
41
+ ## Model Information
42
+
43
+ - **Model:** [tencent/HunyuanImage-3.0](https://huggingface.co/tencent/HunyuanImage-3.0)
44
+ - **Architecture:** Autoregressive MoE (64 experts)
45
+ - **Parameters:** 80B total, 13B active per token
46
+ - **License:** tencent-hunyuan-community
47
+ - **Paper:** [arXiv:2509.23951](https://arxiv.org/abs/2509.23951)
48
+
49
+ ## Features
50
+
51
+ - 🎯 Advanced prompt understanding
52
+ - 🖼️ Multiple resolution support (auto, 1024x1024, 1280x768, 768x1280)
53
+ - 🎲 Seed control for reproducibility
54
+ - ⚙️ Configurable diffusion steps
55
+ - 📝 Example prompts included
56
+
57
+ ## API Endpoint (Coming Soon)
58
+
59
+ This Space will support API endpoints for integration with n8n and other workflow tools.
60
+
61
+ ## Links
62
+
63
+ - [Official Website](https://hunyuan.tencent.com/image)
64
+ - [GitHub Repository](https://github.com/Tencent-Hunyuan/HunyuanImage-3.0)
65
+ - [Technical Paper](https://arxiv.org/pdf/2509.23951)
66
+ - [Model Card](https://huggingface.co/tencent/HunyuanImage-3.0)
67
+
68
+ ## Citation
69
+
70
+ ```bibtex
71
+ @article{cao2025hunyuanimage,
72
+ title={HunyuanImage 3.0 Technical Report},
73
+ author={Cao, Siyu and Chen, Hangting and others},
74
+ journal={arXiv preprint arXiv:2509.23951},
75
+ year={2025}
76
+ }
77
+ ```