Reinforcement World Model Learning for LLM-based Agents Paper • 2602.05842 • Published Feb 5 • 27
MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published Feb 2 • 65
Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention Paper • 2602.03338 • Published Feb 3 • 26
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 217
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 306
Medical Leaderboards Collection Healthcare evals and benchmarks targeting AI solutions • 1 item • Updated 30 days ago • 6
Running 23 FACTS Grounding Leaderboard 🚀 23 This is FACTS Grounding Leaderboard, but for Open LLMs!
Medical & Clinical NER Collection State-of-the-art medical, biomedical, and clinical Named Entity Recognition models • 384 items • Updated 12 days ago • 42