Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration Paper • 2405.14314 • Published May 23, 2024 • 1
Align-Then-stEer: Adapting the Vision-Language Action Models through Unified Latent Guidance Paper • 2509.02055 • Published Sep 2 • 1
Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach Paper • 2512.02834 • Published 10 days ago • 39
Behavior Contrastive Learning for Unsupervised Skill Discovery Paper • 2305.04477 • Published May 8, 2023
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning Paper • 2308.09405 • Published Aug 18, 2023
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness Paper • 2309.16973 • Published Sep 29, 2023
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning Paper • 2305.18459 • Published May 29, 2023
Cross-Domain Policy Adaptation via Value-Guided Data Filtering Paper • 2305.17625 • Published May 28, 2023
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning Paper • 2207.14800 • Published Jul 29, 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing Paper • 2206.02829 • Published Jun 6, 2022