Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation Paper • 2601.20614 • Published Jan 28 • 119
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 27 days ago • 289
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection Paper • 2503.02101 • Published Mar 3, 2025
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published Jan 8 • 170
Tree Search for LLM Agent Reinforcement Learning Paper • 2509.21240 • Published Sep 25, 2025 • 92
Game4Loc: A UAV Geo-Localization Benchmark from Game Data Paper • 2409.16925 • Published Sep 25, 2024 • 8