Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published 9 days ago • 39 • 3
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published 9 days ago • 39
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published 6 days ago • 18 • 8
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published 6 days ago • 18
LeanAgent: Lifelong Learning for Formal Theorem Proving Paper • 2410.06209 • Published Oct 8, 2024 • 2
The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs Paper • 2509.03730 • Published Sep 3, 2025 • 2
Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean Paper • 2404.12534 • Published Apr 18, 2024 • 1
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models Paper • 2409.15454 • Published Sep 23, 2024 • 2
Creative and Context-Aware Translation of East Asian Idioms with GPT-4 Paper • 2410.00988 • Published Oct 1, 2024 • 2
LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction Paper • 2502.17925 • Published Feb 25, 2025 • 1
LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction Paper • 2502.17925 • Published Feb 25, 2025 • 1
The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs Paper • 2509.03730 • Published Sep 3, 2025 • 2
Creative and Context-Aware Translation of East Asian Idioms with GPT-4 Paper • 2410.00988 • Published Oct 1, 2024 • 2
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models Paper • 2409.15454 • Published Sep 23, 2024 • 2
LeanAgent: Lifelong Learning for Formal Theorem Proving Paper • 2410.06209 • Published Oct 8, 2024 • 2