Juchan's picture

5

Juchan

praisechan

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

upvoted a paper 3 months ago

RelayGen: Intra-Generation Model Switching for Efficient Reasoning

upvoted a paper 3 months ago

Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

View all activity

Organizations

None yet

upvoted a paper about 23 hours ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

Paper • 2605.16839 • Published 4 days ago • 10

upvoted 2 papers 3 months ago

RelayGen: Intra-Generation Model Switching for Efficient Reasoning

Paper • 2602.06454 • Published Feb 6 • 12

Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

Paper • 2602.03216 • Published Feb 3 • 13

upvoted a paper 12 months ago

Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

Paper • 2505.13866 • Published May 20, 2025 • 17

upvoted a paper over 1 year ago

FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration

Paper • 2502.01068 • Published Feb 3, 2025 • 18