view article Article Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL 3 days ago • 1