Beomseok Kang's picture

8

Beomseok Kang

beomseokg

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

upvoted a paper 5 months ago

RelayGen: Intra-Generation Model Switching for Efficient Reasoning

upvoted a paper 5 months ago

LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents

View all activity

Organizations

upvoted a paper about 1 month ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

Paper • 2605.16839 • Published May 16 • 14

upvoted 3 papers 5 months ago

RelayGen: Intra-Generation Model Switching for Efficient Reasoning

Paper • 2602.06454 • Published Feb 6 • 12

LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents

Paper • 2602.01053 • Published Feb 1 • 8

Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

Paper • 2602.03216 • Published Feb 3 • 14

upvoted a paper 8 months ago

LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning

Paper • 2510.14211 • Published Oct 16, 2025 • 9

upvoted a paper 9 months ago

QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models

Paper • 2509.17428 • Published Sep 22, 2025 • 9

upvoted a paper about 1 year ago

Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

Paper • 2505.13866 • Published May 20, 2025 • 17

upvoted a paper over 1 year ago

FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration

Paper • 2502.01068 • Published Feb 3, 2025 • 18