LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published 2 days ago • 29
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation Paper • 2603.09723 • Published 3 days ago • 6
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation Paper • 2603.09723 • Published 3 days ago • 6
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published 8 days ago • 26
LongCLI-Bench: A Preliminary Benchmark and Study for Long-horizon Agentic Programming in Command-Line Interfaces Paper • 2602.14337 • Published 26 days ago • 13
Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs Paper • 2602.21198 • Published 17 days ago • 4
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 17 days ago • 94