Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 24 days ago • 50
Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations Paper • 2606.10614 • Published 17 days ago • 25
RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models Paper • 2603.21341 • Published Mar 22 • 24
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published Mar 23 • 46
RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning Paper • 2602.18742 • Published Feb 21 • 11
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment NormalUhr • Feb 11, 2025 • 126