Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping Paper • 2402.07610 • Published Feb 12, 2024 • 9
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt Paper • 2404.05331 • Published Apr 8, 2024
Language-based Trial and Error Falls Behind in the Era of Experience Paper • 2601.21754 • Published 1 day ago • 13
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning Paper • 2601.22069 • Published 1 day ago • 7
Language-based Trial and Error Falls Behind in the Era of Experience Paper • 2601.21754 • Published 1 day ago • 13
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 16 days ago • 126
A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory Paper • 2510.02373 • Published Sep 29, 2025 • 10
UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios Paper • 2509.21766 • Published Sep 26, 2025 • 24
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper • 2509.22638 • Published Sep 26, 2025 • 70