Di Liu's picture

3

Di Liu

diliu0349

AI & ML interests

None yet

Organizations

None yet

upvoted a paper about 1 year ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 29

upvoted an article over 1 year ago

Article

MInference 1.0: 10x Faster Million Context Inference with a Single GPU

liyucheng

•

Jul 11, 2024

• 14

upvoted a paper over 1 year ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 43