Commit History

Add arXiv paper link
06fb73d
verified

Zenalyze commited on

Clarify: KV cache optimized, not smaller file
228619a
verified

Zenalyze commited on

Update card with correct k=32 numbers
d7dedab
verified

Zenalyze commited on

Update to k=32 results
0277873
verified

Zenalyze commited on

Update model card with runtime comparison
847903c
verified

Zenalyze commited on

fraQtl compressed: k=32 INT3, delta=+0.4671
96b6596
verified

Zenalyze commited on

fraQtl compressed: k=16 INT3, delta=+0.7151
667ffd6
verified

Zenalyze commited on

initial commit
da4545b
verified

Zenalyze commited on