LiAuto-DSR
/

avavla-calvin-abc2d

feature-extraction

Model card Files Files and versions

nickshawn commited on 5 days ago

Commit

652a7d3

·

verified ·

1 Parent(s): 6c73553

Update README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

@@ -1,3 +1,24 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+pipeline_tag: robotics
+library_name: transformers
 ---
+# AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention
+This repository contains the AVA-VLA checkpoint trained on CALVIN ABC→D setting, as described in [AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention](https://arxiv.org/abs/2511.18960). AVA-VLA reformulates vision-language-action policy learning from a partially observable perspective and uses a recurrent state to summarize task history for action generation.
+Project Page: https://liauto-dsr.github.io/AVA-VLA-Page/
+Code: https://github.com/LiAuto-DSR/AVA-VLA
+## Citation
+```bibtex
+@article{xiao2025ava,
+  title={AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention},
+  author={Xiao, Lei and Li, Jifeng and Gao, Juntao and Ye, Feiyang and Jin, Yan and Qian, Jingjing and Zhang, Jing and Wu, Yong and Yu, Xiaoyuan},
+  journal={arXiv preprint arXiv:2511.18960},
+  year={2025}
+}
+```