Learning Native Continuation for Action Chunking Flow Policies Paper • 2602.12978 • Published Feb 13 • 3
Future-KL Regularized GRPO: Process-Level Credit Assignment from $f$-Divergence Regularization Paper • 2601.10201 • Published May 23 • 10
For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal Paper • 2304.04591 • Published Apr 10, 2023 • 2