Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, and Simulation 16 days ago • 23
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 113
Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 12 days ago • 17
SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation 10 days ago • 15
Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, and Simulation 16 days ago • 23
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 113
Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 12 days ago • 17
SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation 10 days ago • 15