harshithsaiv
/

kv-cache-compression

memory-efficient

inference-optimization

4-bit precision

mixed-precision

Model card Files Files and versions

kv-cache-compression / results

66.3 kB

Ctrl+K

Ctrl+K

2 contributors

History: 10 commits

harshithsaiv's picture

feat: complete 4-method benchmark with honest memory reporting

0774ec2 about 2 months ago

llama-3-8b
feat: complete 4-method benchmark with honest memory reporting about 2 months ago
mistral-7b
feat: complete 4-method benchmark with honest memory reporting about 2 months ago