gr00t model - 🧪 phosphobot training pipeline
- Dataset: Prachikawtikwar1/first
- Wandb run id: None
Error Traceback
We faced an issue while training your model.
Traceback (most recent call last):
File "/root/src/helper.py", line 139, in train_gr00t_on_modal
trainer.train(
File "/root/phosphobot/am/gr00t.py", line 1250, in train
asyncio.run(
File "/usr/local/lib/python3.11/asyncio/runners.py", line 190, in run
return runner.run(main)
^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/asyncio/runners.py", line 118, in run
return self._loop.run_until_complete(task)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/asyncio/base_events.py", line 654, in run_until_complete
return future.result()
^^^^^^^^^^^^^^^
File "/root/phosphobot/am/gr00t.py", line 1462, in _call_training_script
raise RuntimeError(error_msg)
RuntimeError: Training process failed with exit code 1:
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/gr00t/gr00t/data/dataset.py", line 545, in get_step_data
self.curr_traj_data = self.get_trajectory_data(trajectory_id)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/gr00t/gr00t/data/dataset.py", line 561, in get_trajectory_data
assert parquet_path.exists(), f"Parquet file not found at {parquet_path}"
^^^^^^^^^^^^^^^^^^^^^
AssertionError: Parquet file not found at /tmp/outputs/data/data/chunk-000/episode_000058.parquet
0%| | 101/46700 [00:18<2:24:53, 5.36it/s]
Training parameters
{
"validation_dataset_name": null,
"batch-size": 1,
"num-epochs": 4,
"save-steps": 3000,
"learning_rate": 0.0001,
"data_dir": "/tmp/outputs/data",
"validation_data_dir": "/tmp/outputs/validation_data",
"output_dir": "/tmp/outputs/train"
}
📖 Get Started: docs.phospho.ai
🤖 Get your robot: robots.phospho.ai