moritzknaust
/

openvla-7b

Image-Text-to-Text

feature-extraction

Model card Files Files and versions

skaramcheti commited on Jun 14, 2024

Commit

c67d07b

·

1 Parent(s): 77de19a

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ The model takes language instructions and camera images as input and generates r
 All OpenVLA checkpoints, as well as our [training codebase](https://github.com/openvla/openvla) are released under an MIT License.
-For full details, please read [our paper](https://openvla.github.io/) and see [our project page](https://openvla.github.io/).
 ## Model Summary
@@ -32,7 +32,7 @@ For full details, please read [our paper](https://openvla.github.io/) and see [o
   + **Language Model**: Llama-2
 - **Pretraining Dataset:** [Open X-Embodiment](https://robotics-transformer-x.github.io/) -- specific component datasets can be found [here](https://github.com/openvla/openvla).
 - **Repository:** [https://github.com/openvla/openvla](https://github.com/openvla/openvla)
-- **Paper:** [OpenVLA: An Open-Source Vision-Language-Action Model](https://openvla.github.io/)
 - **Project Page & Videos:** [https://openvla.github.io/](https://openvla.github.io/)
 ## Uses
@@ -93,7 +93,7 @@ For more examples, including scripts for fine-tuning OpenVLA models on your own
 @article{kim24openvla,
     title={OpenVLA: An Open-Source Vision-Language-Action Model},
     author={{Moo Jin} Kim and Karl Pertsch and Siddharth Karamcheti and Ted Xiao and Ashwin Balakrishna and Suraj Nair and Rafael Rafailov and Ethan Foster and Grace Lam and Pannag Sanketi and Quan Vuong and Thomas Kollar and Benjamin Burchfiel and Russ Tedrake and Dorsa Sadigh and Sergey Levine and Percy Liang and Chelsea Finn},
-    journal = {arXiv preprint},
     year={2024}
 }
 ```

 All OpenVLA checkpoints, as well as our [training codebase](https://github.com/openvla/openvla) are released under an MIT License.
+For full details, please read [our paper](https://arxiv.org/abs/2406.09246) and see [our project page](https://openvla.github.io/).
 ## Model Summary
   + **Language Model**: Llama-2
 - **Pretraining Dataset:** [Open X-Embodiment](https://robotics-transformer-x.github.io/) -- specific component datasets can be found [here](https://github.com/openvla/openvla).
 - **Repository:** [https://github.com/openvla/openvla](https://github.com/openvla/openvla)
+- **Paper:** [OpenVLA: An Open-Source Vision-Language-Action Model](https://arxiv.org/abs/2406.09246)
 - **Project Page & Videos:** [https://openvla.github.io/](https://openvla.github.io/)
 ## Uses
 @article{kim24openvla,
     title={OpenVLA: An Open-Source Vision-Language-Action Model},
     author={{Moo Jin} Kim and Karl Pertsch and Siddharth Karamcheti and Ted Xiao and Ashwin Balakrishna and Suraj Nair and Rafael Rafailov and Ethan Foster and Grace Lam and Pannag Sanketi and Quan Vuong and Thomas Kollar and Benjamin Burchfiel and Russ Tedrake and Dorsa Sadigh and Sergey Levine and Percy Liang and Chelsea Finn},
+    journal = {arXiv preprint arXiv:2406.09246},
     year={2024}
 }
 ```