SmolVLA-SafaiBot-v0 (Phase 0 Stub)

Base: Stub 4-layer MLP (state -&gt; action), NOT SmolVLA
Input: 18-dim proprioceptive state (joint_pos + joint_vel + ee_pos + ee_quat + base_pos)
Output: 7-DOF action
Parameters: ~105K total
Training: 20 steps, MSE loss, Adam optimizer

Toy checkpoint from Phase 0 of the safai-vla project.

This is NOT a production model. It is a stub MLP trained on 500 MuJoCo episodes to validate the training pipeline end-to-end. Real model training happens in Phase 1 with Isaac Sim on DGX Cloud.