| | --- |
| | datasets: |
| | - commaai/commabody |
| | pipeline_tag: robotics |
| | --- |
| | |
| | This model has been trained on a larger version (194 minutes total) of the commabody dataset. |
| | It includes a [vqgan](https://github.com/CompVis/taming-transformers) encoder/decoder fine tuned from imagenet. It compresses images of size 160x256 to 10x16 tokens. |
| |
|
| | It also includes a GPT2 model trained to predict the next frame, wheel speeds and actions. It can be used either as a simulator or as a policy. More details in [our blog post](https://blog.comma.ai/a-drive-in-the-office/). |
| |
|
| | <video title="imagined rollouts from 3s of context" controls> |
| | <source src="https://blog.comma.ai/img/body_patrol/rollouts.webm" type="video/webm"> |
| | </video> |
| |
|
| | You can run it on a comma body using [our example script in body-jim](https://github.com/commaai/body-jim/blob/master/examples/roam.py). |
| |
|