zijie tian
zijie-tian
AI & ML interests
Storage for AI
Recent Activity
liked
a model 8 days ago
Qwen/Qwen3.5-397B-A17B upvoted a paper 2 months ago
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading upvoted a paper 2 months ago
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference