Pre-computed Q-Filters for efficient KV cache compression.
Nathan Godey
nthngdy
AI & ML interests
None yet
Recent Activity
submitted
a paper
about 16 hours ago
Lost in Backpropagation: The LM Head is a Gradient Bottleneck updated
a model 4 days ago
nthngdy/matryoshka-baselines published
a model 4 days ago
nthngdy/matryoshka-baselines