GadflyII commited on
Commit
4f9ffce
·
verified ·
1 Parent(s): 3b2e2f2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -14,7 +14,9 @@ tags:
14
  library_name: transformers
15
  pipeline_tag: text-generation
16
  ---
17
- # Note: If you have a multi-GPU SM120 Blackwell system (RTX 50/Pro), try my vLLM fork to resolve P2P / TP=2 issues (Pending PR into upstream). You will also need to use this fork for MXFP4 support on SM120.
 
 
18
  https://github.com/Gadflyii/vllm/tree/main
19
 
20
  # GLM-4.7-Flash MXFP4
 
14
  library_name: transformers
15
  pipeline_tag: text-generation
16
  ---
17
+ # Note: If you have a multi-GPU SM120 Blackwell system (RTX 50/Pro), try my vLLM fork to resolve P2P / TP=2 issues (Pending PR into upstream).
18
+ # Note: If you are running this MXFP4 model on SM120 GPU's, you also will need to use my fork until PR into upstream is merged, however it is significantly slower than NVFP4.
19
+
20
  https://github.com/Gadflyii/vllm/tree/main
21
 
22
  # GLM-4.7-Flash MXFP4