CASCADE: Case-Based Continual Adaptation for Large Language Models During Deployment Paper • 2605.06702 • Published 7 days ago • 2
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation Paper • 2604.28196 • Published 12 days ago • 70
mradermacher/Qwen3.5-4B-SFT-Claude-Opus-Reasoning-Unsloth-i1-GGUF Text Generation • 4B • Updated 4 days ago • 3.32k • 1
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 12 days ago • 57
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 29 days ago • 101
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 627
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 341
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 350