nickshawn commited on
Commit
652a7d3
·
verified ·
1 Parent(s): 6c73553

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md CHANGED
@@ -1,3 +1,24 @@
1
  ---
2
  license: apache-2.0
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ pipeline_tag: robotics
4
+ library_name: transformers
5
  ---
6
+
7
+ # AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention
8
+
9
+ This repository contains the AVA-VLA checkpoint trained on CALVIN ABC→D setting, as described in [AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention](https://arxiv.org/abs/2511.18960). AVA-VLA reformulates vision-language-action policy learning from a partially observable perspective and uses a recurrent state to summarize task history for action generation.
10
+
11
+ Project Page: https://liauto-dsr.github.io/AVA-VLA-Page/
12
+
13
+ Code: https://github.com/LiAuto-DSR/AVA-VLA
14
+
15
+ ## Citation
16
+
17
+ ```bibtex
18
+ @article{xiao2025ava,
19
+ title={AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention},
20
+ author={Xiao, Lei and Li, Jifeng and Gao, Juntao and Ye, Feiyang and Jin, Yan and Qian, Jingjing and Zhang, Jing and Wu, Yong and Yu, Xiaoyuan},
21
+ journal={arXiv preprint arXiv:2511.18960},
22
+ year={2025}
23
+ }
24
+ ```