Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Kante Yin's picture

Kante Yin

kerthcet
https://ky.dev
  • kerthcet
  • kerthcet

AI & ML interests

Building AI Infrastructures.

Organizations

InftyAI's profile picture TechTrek's profile picture quintessa.ai's profile picture 90min.ai's profile picture techtrek.ai's profile picture infiniai.io's profile picture freesolo.ai's profile picture boxe.ai's profile picture roamcloud.ai's profile picture matrixy.ai's profile picture modelspec.ai's profile picture antrix.ai's profile picture hiverge.ai's profile picture Humanity's Last Hackathon's profile picture

Collections 1

Inference
  • ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

    Paper • 2402.15220 • Published Feb 23, 2024 • 20
Inference
  • ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

    Paper • 2402.15220 • Published Feb 23, 2024 • 20

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs