Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
ngocbh 's Collections
TrimKV

TrimKV

updated 8 days ago

A set of models that can run with bounded memory

Upvote
1

  • Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

    Paper • 2512.03324 • Published Dec 3, 2025 • 2

  • ngocbh/TrimKV-Qwen3-4B-Math

    Text Generation • Updated 7 days ago • 49

  • ngocbh/TrimKV-Qwen3-1.7B-Math

    Text Generation • Updated 7 days ago • 36

  • ngocbh/TrimKV-Qwen3-4B-Instruct-2507

    Text Generation • Updated 7 days ago • 32

  • ngocbh/TrimKV-Phi-3-mini-128k-instruct

    Text Generation • Updated 7 days ago • 35

  • ngocbh/TrimKV-Qwen3-8B-Math

    Text Generation • Updated 7 days ago • 44

  • ngocbh/TrimKV-Qwen3-14B-Math

    Text Generation • Updated 7 days ago • 38

  • ngocbh/TrimKV-DeepSeek-R1-Distill-Llama-8B

    Updated 7 days ago • 21

  • Make Each Token Count: Towards Improving Long-Context Performance with KV Cache Eviction

    Paper • 2605.09649 • Published 10 days ago • 11

  • ngocbh/DBTrimKV-Qwen3-4B-Math

    Text Generation • Updated 7 days ago • 53

  • ngocbh/DBTrimKV-Qwen3-4B-Instruct-2507

    Text Generation • Updated 7 days ago • 39

  • ngocbh/DBTrimKV-Qwen3-VL-8B-Thinking

    Image-Text-to-Text • Updated 7 days ago • 129

  • ngocbh/DBTrimKV-Qwen3-VL-4B-Instruct

    Image-Text-to-Text • Updated 7 days ago • 68
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs