File size: 950 Bytes
b712cc3 ead6722 cc6f3cb d80e951 cc6f3cb | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 | ---
license: apache-2.0
---
<h3 align="center" style="font-size:24px; font-weight:bold; color:#9C276A; margin: 0;">
<a href="https://arxiv.org/abs/2602.10098" style="color:#9C276A; text-decoration: none;">
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model
</a>
</h3>
<div align="center">
<p>
<a href="https://arxiv.org/abs/2602.10098">
<img src="https://img.shields.io/badge/Paper-PDF-orange.svg" alt="Paper PDF">
</a>
<a href="https://ginwind.github.io/VLA-JEPA/">
<img src="https://img.shields.io/badge/Project-Page-Green.svg" alt="Project Page">
</a>
<a href="https://huggingface.co/ginwind/VLA-JEPA">
<img src="https://img.shields.io/badge/🤗-Hugging_Face-yellow.svg" alt="Hugging Face">
</a>
<a href="https://github.com/tatsu-lab/stanford_alpaca/blob/main/LICENSE">
<img src="https://img.shields.io/badge/Code%20License-Apache_2.0-green.svg" alt="Code License">
</a>
</p>
</div> |