AutoNumerics: An Autonomous, PDE-Agnostic Multi-Agent Pipeline for Scientific Computing Paper • 2602.17607 • Published Feb 19
OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents Paper • 2504.16918 • Published Jan 21
PerspectiveGap: A Benchmark for Multi-Agent Orchestration Prompting Paper • 2606.08878 • Published 5 days ago • 1
PerspectiveGap Benchmark Collection Paper, dataset, and leaderboard for multi-agent orchestration prompting. • 3 items • Updated 1 day ago
Running Agents PerspectiveGap Leaderboard 🧠Explore AI model rankings on the PerspectiveGap benchmark
Running Agents PerspectiveGap Leaderboard 🧠Explore AI model rankings on the PerspectiveGap benchmark
PerspectiveGap: A Benchmark for Multi-Agent Orchestration Prompting Paper • 2606.08878 • Published 5 days ago • 1