MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 2 days ago • 26
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 127
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models Paper • 2305.04091 • Published May 6, 2023 • 3