arxiv:2406.16758
Taehyeon Kim
Kthyeon
AI & ML interests
LLM Inference: Parallel, Speculative, Instructive Decoding
Recent Activity
upvoted
a
paper
about 13 hours ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
upvoted
a
paper
13 days ago
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors
liked
a model
26 days ago
LGAI-EXAONE/K-EXAONE-236B-A23B