RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora Reinforcement Learning • 8B • Updated Dec 21, 2025 • 2.07k