Sleeping Agents 1 RobustBench-TC Leaderboard 🛠 1 Sim-to-real robustness leaderboard for tool-use LLM agents
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 229