T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning Paper • 2605.02178 • Published May 4 • 10
HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness Paper • 2606.12882 • Published 15 days ago • 13
T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning Paper • 2605.02178 • Published May 4 • 10
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning Paper • 2602.21534 • Published Feb 25 • 26
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models Paper • 2307.10635 • Published Jul 20, 2023 • 9
Efficient Evolutionary Search Over Chemical Space with Large Language Models Paper • 2406.16976 • Published Jun 23, 2024 • 1
Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks Paper • 2310.00115 • Published Sep 29, 2023