Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Weilin Zhao's picture
3 6 14

Weilin Zhao

Achazwl
21world's profile picture mzwing's profile picture Cadena's profile picture
·
https://weilin-zhao.com
  • acha_William_
  • Achazwl

AI & ML interests

Efficient LLM

Recent Activity

liked a model 1 day ago
openbmb/VoxCPM1.5
liked a dataset 16 days ago
openbmb/InfLLM-V2-data-5B
authored a paper 2 months ago
InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation
View all activity

Organizations

OpenBMB's profile picture

upvoted a paper 2 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29 • 14
upvoted a collection 5 months ago

FR-Spec

Collection
Released ckpt for arxiv.org/abs/2502.14856 • 6 items • Updated Jul 2 • 1
upvoted a collection 6 months ago

MiniCPM4

Collection
MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8 • 80
upvoted a paper 6 months ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 93
upvoted 2 papers 9 months ago

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Paper • 2502.12085 • Published Feb 17 • 4

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published Feb 20 • 8
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs