CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents Paper • 2407.01511 • Published Jul 1, 2024
CCS: Controllable and Constrained Sampling with Diffusion Models via Initial Noise Perturbation Paper • 2502.04670 • Published Feb 7, 2025
SEAR: Schema-Based Evaluation and Routing for LLM Gateways Paper • 2603.26728 • Published 12 days ago • 8
SEAR: Schema-Based Evaluation and Routing for LLM Gateways Paper • 2603.26728 • Published 12 days ago • 8 • 3
SEAR: Schema-Based Evaluation and Routing for LLM Gateways Paper • 2603.26728 • Published 12 days ago • 8
SEAR: Schema-Based Evaluation and Routing for LLM Gateways Paper • 2603.26728 • Published 12 days ago • 8
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning Paper • 2506.09033 • Published Jun 10, 2025 • 7
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13, 2025 • 100