Laguna-XS.2-INT4 / README.md

Commit History

Drop VLLM_USE_DEEP_GEMM=0 from vllm serve recipe (DeepGEMM is supported on Hopper and datacenter Blackwell)
8c57f62
verified

joerowell commited on

Enable thinking by default in non-Hopper FP8-KV serve command
c7a758e
verified

joerowell commited on

Update non-Hopper FP8-KV serve command and link to vLLM recipes page
2c0d22b
verified

joerowell commited on

Laguna XS.2 upload
f82b43d

joerowell commited on