Zhiyu Cheng
AI & ML interests
None yet
Recent Activity
new activity about 22 hours ago
nvidia/GLM-5-NVFP4:Update README.md updated a model 4 days ago
nvidia/GLM-5-NVFP4 updated a model 4 days ago
nvidia/MiniMax-M2.5-NVFP4Organizations
Update README.md
1
#5 opened 4 days ago
by
kaihangj
Update model card with evaluation results
#6 opened 16 days ago
by
jingyux-nv
Fix: add .model after language_model in quantization ignore/exclude_modules
#5 opened about 1 month ago
by
zhiyucheng
Fix: add .model after language_model in quantization ignore/exclude_modules
#4 opened about 1 month ago
by
zhiyucheng
Transformers v5 support
#3 opened 2 months ago
by
nv-fszarwacki
update config for exclude modules
#3 opened 4 months ago
by
shengliangx
update config for exclude modules
#1 opened 4 months ago
by
shengliangx
update config for exclude modules
#3 opened 4 months ago
by
shengliangx
Use actual module path in ignore
2
#2 opened 4 months ago
by
shengliangx
Update README.md
#3 opened 5 months ago
by
alejandrar
Update README.md
#2 opened 5 months ago
by
alejandrar
Update README.md
1
#2 opened 5 months ago
by
alejandrar
Update README.md
#1 opened 6 months ago
by
huizimao
Update README.md
1
#2 opened 11 months ago
by
RestingCodeFace
Update README.md
#1 opened 11 months ago
by
omrialmog
Request for Detailed Benchmarking Setup with TensorRT-LLM on B200
➕ 4
1
#6 opened about 1 year ago
by
StardusterLiu
Benchmark results compared to orig fp8 / int4 quants etc?
➕ 15
6
#1 opened about 1 year ago
by
CHNtentes
censored or uncensored
5
#5 opened about 1 year ago
by
harisnaeem
Add library_name
1
#4 opened about 1 year ago
by
nielsr
Update README.md
#1 opened about 1 year ago
by
omrialmog