Large Multimodal Models as General In-Context Classifiers Paper • 2602.23229 • Published 16 days ago • 22
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning Paper • 2602.11149 • Published about 1 month ago • 15
view article Article Re-understanding KL Approximation from an RL-for-LLM Lens: Notes on “Approximating KL Divergence” Aug 11, 2025 • 10