RLinf/RLinf-lingbotvla-place-shoe-grpo
Robotics • 4B • Updated • 11
None defined yet.
LaWAM: Latent World Action Models for Efficient Dynamics-Aware Robot Policies
WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL