Qwen3.5-35B-A3B: show 601 tok/s (23.4x vs reference), varied ~455 + peak 10,516 in role; method withheld a95c4e6 verified openfree commited on about 2 hours ago
Update Qwen3.5-35B-A3B row: 25.7->455 tok/s (17.7x, FP8+optimized), peak 10,516 tok/s; method withheld 6a96b27 verified openfree commited on about 2 hours ago
Redact acceleration-method disclosure (graph/vLLM/SDPA) from public rows; keep claim-scoped generic labels 199aedc verified openfree commited on about 19 hours ago
remove 12B/26B/31B rows; keep E4B showcase + 398B flagship b80b31b verified openfree commited on 2 days ago
remove Gemma E2B row (package-validation only, 1.06x) 6d997a3 verified openfree commited on 2 days ago
Add JGOS-398B (Qwen3.5-MoE) large-MoE serving-acceleration lane: 4.33x (88->382 TPS, B200x6 TP2PP3); method internals withheld 6232437 verified openfree commited on 3 days ago
Update VKAE Gemma model packages and preview benchmark metadata 7812d76 verified openfree commited on 5 days ago
Separate optimized VKAE claims from preview measurements 1b3a758 verified openfree commited on 5 days ago
Update VKAE H200 BF16 fallback benchmark results: h200_bf16_complete c284199 verified openfree commited on 5 days ago
Update VKAE H200 BF16 fallback benchmark results: h200_bf16_running 02dce3c verified openfree commited on 5 days ago
Update VKAE H200 BF16 fallback benchmark results: h200_bf16_running 332d199 verified openfree commited on 5 days ago
Update VKAE H200 BF16 fallback benchmark results: h200_bf16_running b803a90 verified openfree commited on 5 days ago
Update VKAE H200 BF16 fallback benchmark results: h200_bf16_running e63b533 verified openfree commited on 5 days ago
Update VKAE H200 BF16 fallback benchmark results: h200_bf16_running bbe5fec verified openfree commited on 5 days ago
Update VKAE H200 BF16 fallback benchmark results: h200_bf16_running 1050c9f verified openfree commited on 5 days ago
Update VKAE H200 BF16 fallback benchmark results: h200_bf16_running 8c2f6f8 verified openfree commited on 5 days ago