RedstoneWhite commited on
Commit
4f7de87
·
verified ·
1 Parent(s): 339455b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -1,8 +1,18 @@
1
  ---
2
- license: other
3
  pipeline_tag: image-to-image
 
 
4
  ---
5
 
 
 
 
 
 
 
 
 
 
6
 
7
  [中文文档](./README_zh_CN.md)
8
 
@@ -533,4 +543,4 @@ We extend our heartfelt gratitude to the following open-source projects and comm
533
  [![GitHub forks](https://img.shields.io/github/forks/Tencent-Hunyuan/HunyuanImage-3.0?style=social)](https://github.com/Tencent-Hunyuan/HunyuanImage-3.0)
534
 
535
 
536
- [![Star History Chart](https://api.star-history.com/svg?repos=Tencent-Hunyuan/HunyuanImage-3.0&type=Date)](https://www.star-history.com/#Tencent-Hunyuan/HunyuanImage-3.0&Date)
 
1
  ---
 
2
  pipeline_tag: image-to-image
3
+ base_model:
4
+ - tencent/HunyuanImage-3.0-Instruct-Distil
5
  ---
6
 
7
+ # What's New
8
+
9
+ Quantized to FP8 using LLM-compressor with similar recipe from [HunyuanImage-3.0-Instruct-Distil-INT8-v2](https://huggingface.co/EricRollei/HunyuanImage-3.0-Instruct-Distil-INT8-v2)
10
+
11
+ Enabling quantized inference with FlashInfer that fits in a single DGX Spark. Tested on a DGX Spark Founder Edition with FlashInfer==0.6.8 and a modified [Comfy_HunyuanImage3](https://github.com/EricRollei/Comfy_HunyuanImage3) node in ComfyUI.
12
+
13
+ Inference time with cot_recaption reduced from ~1400s to ~200s on DGX Spark.
14
+
15
+ # Original README
16
 
17
  [中文文档](./README_zh_CN.md)
18
 
 
543
  [![GitHub forks](https://img.shields.io/github/forks/Tencent-Hunyuan/HunyuanImage-3.0?style=social)](https://github.com/Tencent-Hunyuan/HunyuanImage-3.0)
544
 
545
 
546
+ [![Star History Chart](https://api.star-history.com/svg?repos=Tencent-Hunyuan/HunyuanImage-3.0&type=Date)](https://www.star-history.com/#Tencent-Hunyuan/HunyuanImage-3.0&Date)