Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ngocbh 's Collections
TrimKV

TrimKV

updated 9 days ago

A set of models that can run with bounded memory

Upvote
-

  • Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

    Paper • 2512.03324 • Published 23 days ago

  • ngocbh/TrimKV-Qwen3-4B-Math

    Updated 9 days ago • 54

  • ngocbh/TrimKV-Qwen3-1.7B-Math

    Updated 9 days ago • 43

  • ngocbh/TrimKV-Qwen3-4B-Instruct-2507

    Updated 9 days ago • 30

  • ngocbh/TrimKV-Phi-3-mini-128k-instruct

    Updated 9 days ago • 28

  • ngocbh/TrimKV-Qwen3-8B-Math

    Updated 9 days ago • 30

  • ngocbh/TrimKV-Qwen3-14B-Math

    Updated 9 days ago • 26

  • ngocbh/TrimKV-DeepSeek-R1-Distill-Llama-8B

    Updated 9 days ago • 19
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs