Update README.md
Browse files
README.md
CHANGED
|
@@ -1,8 +1,18 @@
|
|
| 1 |
---
|
| 2 |
-
license: other
|
| 3 |
pipeline_tag: image-to-image
|
|
|
|
|
|
|
| 4 |
---
|
| 5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
|
| 7 |
[中文文档](./README_zh_CN.md)
|
| 8 |
|
|
@@ -533,4 +543,4 @@ We extend our heartfelt gratitude to the following open-source projects and comm
|
|
| 533 |
[](https://github.com/Tencent-Hunyuan/HunyuanImage-3.0)
|
| 534 |
|
| 535 |
|
| 536 |
-
[](https://www.star-history.com/#Tencent-Hunyuan/HunyuanImage-3.0&Date)
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
pipeline_tag: image-to-image
|
| 3 |
+
base_model:
|
| 4 |
+
- tencent/HunyuanImage-3.0-Instruct-Distil
|
| 5 |
---
|
| 6 |
|
| 7 |
+
# What's New
|
| 8 |
+
|
| 9 |
+
Quantized to FP8 using LLM-compressor with similar recipe from [HunyuanImage-3.0-Instruct-Distil-INT8-v2](https://huggingface.co/EricRollei/HunyuanImage-3.0-Instruct-Distil-INT8-v2)
|
| 10 |
+
|
| 11 |
+
Enabling quantized inference with FlashInfer that fits in a single DGX Spark. Tested on a DGX Spark Founder Edition with FlashInfer==0.6.8 and a modified [Comfy_HunyuanImage3](https://github.com/EricRollei/Comfy_HunyuanImage3) node in ComfyUI.
|
| 12 |
+
|
| 13 |
+
Inference time with cot_recaption reduced from ~1400s to ~200s on DGX Spark.
|
| 14 |
+
|
| 15 |
+
# Original README
|
| 16 |
|
| 17 |
[中文文档](./README_zh_CN.md)
|
| 18 |
|
|
|
|
| 543 |
[](https://github.com/Tencent-Hunyuan/HunyuanImage-3.0)
|
| 544 |
|
| 545 |
|
| 546 |
+
[](https://www.star-history.com/#Tencent-Hunyuan/HunyuanImage-3.0&Date)
|