zjunlp
/

LabVLA

Safetensors

Model card Files Files and versions

xet

Community

Add robotics pipeline tag and improve model card

by nielsr HF Staff - opened about 9 hours ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+25

-2

Files changed (1) hide show

README.md +25 -2

README.md CHANGED Viewed

@@ -1,6 +1,8 @@
 ---
 license: mit
 ---
 <div align="center">
 <p align="center">
@@ -21,7 +23,13 @@ license: mit
 ## Model Description
-**LabVLA** is the first vision–language–action (VLA) model designed for scientific laboratory environments. It combines a **Qwen3-VL-4B-Instruct** vision–language backbone with a **DiT flow-matching action expert**, trained with the π0.5 recipe to enable real-time robot control in lab settings.
 ## How to Use
@@ -41,4 +49,19 @@ cd LabVLA
 bash deployment/deploy.sh
 ```
-For training, data preparation, and more details, please refer to our [GitHub repository](https://github.com/zjunlp/LabVLA).

 ---
 license: mit
+pipeline_tag: robotics
 ---
 <div align="center">
 <p align="center">
 ## Model Description
+**LabVLA** is the first vision–language–action (VLA) model designed specifically for scientific laboratory environments, as introduced in [LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories](https://huggingface.co/papers/2606.13578).
+It combines a **Qwen3-VL-4B-Instruct** vision–language backbone with a **DiT flow-matching action expert**. The model is trained using a two-stage recipe:
+1. **FAST action token pretraining**: Makes the backbone action-aware.
+2. **Flow matching posttraining**: Attaches the DiT action expert under knowledge insulation to enable continuous control.
+LabVLA addresses the gap in existing policies that are mostly trained on household data, enabling autonomous execution of scientific protocols involving laboratory instruments and transparent liquids.
 ## How to Use
 bash deployment/deploy.sh
 ```
+For training, data preparation, and more details, please refer to the [GitHub repository](https://github.com/zjunlp/LabVLA).
+## Citation
+```bibtex
+@article{ren2026labvla,
+  title   = {LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories},
+  author  = {Ren, Baochang and Liu, Xinjie and Chen, Xi and Liu, Yanshuo and
+             Li, Chenxi and Gao, Daqi and Su, Zeqin and Xing, Jintao and
+             Xue, Zirui and Li, Rui and Zhao, Xiangyu and Qiao, Shuofei and
+             Pan, Minting and Zuo, Wangmeng and Bai, Lei and Zhou, Dongzhan and
+             Zhang, Ningyu and Chen, Huajun},
+  journal = {arXiv preprint arXiv:2606.13578},
+  year    = {2026}
+}
+```