GTA1 Collection A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 5
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published Jan 31, 2025 • 39