nvidia
/

Cosmos-Policy-ALOHA-Predict2-2B

cosmos-policy

Model card Files Files and versions

xet

Community

Add robotics pipeline tag and license metadata

by nielsr HF Staff - opened 14 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+14

-4

Files changed (1) hide show

README.md +14 -4

README.md CHANGED Viewed

@@ -1,9 +1,12 @@
 ---
-datasets:
-- nvidia/ALOHA-Cosmos-Policy
 base_model:
 - nvidia/Cosmos-Predict2-2B-Video2World
 ---
 # **Cosmos-Policy-ALOHA-Predict2-2B**
 [**Cosmos Policy**](https://huggingface.co/collections/nvidia/cosmos-policy) | [**Code**](http://github.com/NVlabs/cosmos-policy) | [**White Paper**](https://arxiv.org/abs/2601.16163) | [**Website**](https://research.nvidia.com/labs/dir/cosmos-policy/)
@@ -258,11 +261,18 @@ Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.
 - **Base Model**: [Cosmos-Predict2-2B-Video2World](https://huggingface.co/nvidia/Cosmos-Predict2-2B-Video2World)
 - **Training Dataset**: [ALOHA-Cosmos-Policy](https://huggingface.co/datasets/nvidia/ALOHA-Cosmos-Policy)
 - **Planning Model Checkpoint**: [Cosmos-Policy-ALOHA-Planning-Model-Predict2-2B](https://huggingface.co/nvidia/Cosmos-Policy-ALOHA-Planning-Model-Predict2-2B)
-- **Paper**: https://arxiv.org/abs/2601.16163
 - **Original ALOHA**: [Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware](https://arxiv.org/abs/2304.13705)
 ## Citation
 If you use this model, please cite the Cosmos Policy paper:
-(Cosmos Policy BibTeX citation coming soon!)

 ---
 base_model:
 - nvidia/Cosmos-Predict2-2B-Video2World
+datasets:
+- nvidia/ALOHA-Cosmos-Policy
+license: other
+pipeline_tag: robotics
 ---
 # **Cosmos-Policy-ALOHA-Predict2-2B**
 [**Cosmos Policy**](https://huggingface.co/collections/nvidia/cosmos-policy) | [**Code**](http://github.com/NVlabs/cosmos-policy) | [**White Paper**](https://arxiv.org/abs/2601.16163) | [**Website**](https://research.nvidia.com/labs/dir/cosmos-policy/)
 - **Base Model**: [Cosmos-Predict2-2B-Video2World](https://huggingface.co/nvidia/Cosmos-Predict2-2B-Video2World)
 - **Training Dataset**: [ALOHA-Cosmos-Policy](https://huggingface.co/datasets/nvidia/ALOHA-Cosmos-Policy)
 - **Planning Model Checkpoint**: [Cosmos-Policy-ALOHA-Planning-Model-Predict2-2B](https://huggingface.co/nvidia/Cosmos-Policy-ALOHA-Planning-Model-Predict2-2B)
+- **Paper**: [Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning](https://arxiv.org/abs/2601.16163)
 - **Original ALOHA**: [Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware](https://arxiv.org/abs/2304.13705)
 ## Citation
 If you use this model, please cite the Cosmos Policy paper:
+```bibtex
+@article{kim2026cosmos,
+  title={Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning},
+  author={Kim, Moo Jin and Gao, Yihuai and Lin, Tsung-Yi and Lin, Yen-Chen and Ge, Yunhao and Lam, Grace and Liang, Percy and Song, Shuran and Liu, Ming-Yu and Finn, Chelsea and Gu, Jinwei},
+  journal={arXiv preprint arXiv:2601.16163},
+  year={2026}
+}
+```