Zhiyu Cheng

zhiyucheng

nvidia

·

AI & ML interests

None yet

Recent Activity

updated a model 16 days ago

nvidia/GLM-5.2-NVFP4

liked a model 17 days ago

nvidia/MiniMax-M3-NVFP4

liked a model 17 days ago

nvidia/GLM-5.2-NVFP4

View all activity

Organizations

New activity in nvidia/DeepSeek-V4-Flash-NVFP4 28 days ago

docs: add verified vLLM Docker image to Deploy section

#2 opened 28 days ago by

New activity in nvidia/Gemma-4-31B-IT-NVFP4 3 months ago

Update chat_template.jinja

#11 opened 3 months ago by

Update tokenizer_config.json

#12 opened 3 months ago by

Update README.md

#10 opened 3 months ago by

tstarkey-nvidia

New activity in nvidia/GLM-5-NVFP4 3 months ago

Update README.md

#5 opened 3 months ago by

New activity in nvidia/Kimi-K2.5-NVFP4 4 months ago

Update model card with evaluation results

#6 opened 4 months ago by

Fix: add .model after language_model in quantization ignore/exclude_modules

#5 opened 4 months ago by

Fix: add .model after language_model in quantization ignore/exclude_modules

#4 opened 4 months ago by

New activity in nvidia/Kimi-K2-Thinking-NVFP4 5 months ago

Transformers v5 support

#3 opened 5 months ago by

New activity in nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4 7 months ago

update config for exclude modules

#3 opened 7 months ago by

New activity in nvidia/Llama-4-Scout-17B-16E-Instruct-FP8 7 months ago

update config for exclude modules

#1 opened 7 months ago by

New activity in nvidia/Qwen2.5-VL-7B-Instruct-FP8 7 months ago

update config for exclude modules

#3 opened 7 months ago by

New activity in nvidia/Qwen2.5-VL-7B-Instruct-NVFP4 7 months ago

Use actual module path in ignore

#2 opened 7 months ago by

New activity in nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-FP8 8 months ago

Update README.md

#3 opened 8 months ago by

New activity in nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-NVFP4-QAD 8 months ago

Update README.md

#2 opened 8 months ago by

New activity in nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16 8 months ago

Update README.md

#2 opened 8 months ago by

New activity in nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD 9 months ago

Update README.md

#1 opened 9 months ago by

New activity in nvidia/Llama-3.3-70B-Instruct-FP8 about 1 year ago

Update README.md

#2 opened about 1 year ago by

RestingCodeFace

Update README.md

#1 opened about 1 year ago by

New activity in nvidia/DeepSeek-R1-NVFP4 over 1 year ago

Request for Detailed Benchmarking Setup with TensorRT-LLM on B200

#6 opened over 1 year ago by