Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning Paper • 2605.06241 • Published May 7 • 5
Vision-Language Instruction Tuning: A Review and Analysis Paper • 2311.08172 • Published Nov 14, 2023 • 1
meta-llama/Meta-Llama-3-8B-Instruct Text Generation • 8B • Updated Jun 18, 2025 • 1.33M • • 4.63k