Commit History

Un-quantize MTP block (revert to native precision)
9e7e88e
verified

jingyux-nv commited on

Drop FlashInfer MoE env vars from vLLM deploy command
b9e53ec
verified

jingyux-nv commited on

Rename TensorRT Model Optimizer link to Model Optimizer (NVIDIA/Model-Optimizer)
994894a
verified

jingyux-nv commited on

Update nvidia-modelopt version to v0.44
3f53361
verified

jingyux-nv commited on

Fix malformed Release Date to 05/27/2026
d7d01c5
verified

jingyux-nv commited on

Remove stale Max OSL note from Evaluation section
ce43847
verified

jingyux-nv commited on

Update README: replace eval table (GPQA/AA-LCR/τ²-Bench/SciCode/IFBench); switch runtime to SGLang and vLLM
2e68f25
verified

jingyux-nv commited on

Add files using upload-large-folder tool
cac567f
verified

jingyux-nv commited on

Add files using upload-large-folder tool
bc21101
verified

jingyux-nv commited on

initial commit
cb75a81
verified

jingyux-nv commited on