Efficient Intelligence and Systems

community

AI & ML interests

Low-bit Quantization of Large Language Models (LLMs)

Recent Activity

Xingyu-Zheng authored a paper 14 days ago

First-Order Error Matters: Accurate Compensation for Quantized Large Language Models

AaronHuangWei authored a paper 29 days ago

Anchor Forcing: Anchor Memory and Tri-Region RoPE for Interactive Streaming Video Diffusion

AaronHuangWei authored a paper 29 days ago

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

View all activity

Efficient-ML 's collections 2