Update README.md
Browse files
README.md
CHANGED
|
@@ -14,7 +14,9 @@ tags:
|
|
| 14 |
library_name: transformers
|
| 15 |
pipeline_tag: text-generation
|
| 16 |
---
|
| 17 |
-
# Note: If you have a multi-GPU SM120 Blackwell system (RTX 50/Pro), try my vLLM fork to resolve P2P / TP=2 issues (Pending PR into upstream).
|
|
|
|
|
|
|
| 18 |
https://github.com/Gadflyii/vllm/tree/main
|
| 19 |
|
| 20 |
# GLM-4.7-Flash MXFP4
|
|
|
|
| 14 |
library_name: transformers
|
| 15 |
pipeline_tag: text-generation
|
| 16 |
---
|
| 17 |
+
# Note: If you have a multi-GPU SM120 Blackwell system (RTX 50/Pro), try my vLLM fork to resolve P2P / TP=2 issues (Pending PR into upstream).
|
| 18 |
+
# Note: If you are running this MXFP4 model on SM120 GPU's, you also will need to use my fork until PR into upstream is merged, however it is significantly slower than NVFP4.
|
| 19 |
+
|
| 20 |
https://github.com/Gadflyii/vllm/tree/main
|
| 21 |
|
| 22 |
# GLM-4.7-Flash MXFP4
|