nielsr HF Staff commited on
Commit
e67a145
·
verified ·
1 Parent(s): 3400193

Add pipeline tag and links to paper/code

Browse files

Hi! I'm Niels, part of the community science team at Hugging Face.

I noticed that this model card could benefit from some additional metadata and links to the research it belongs to. This PR updates the README to include:
- The `video-classification` pipeline tag in the metadata for better discoverability.
- Links to the [official paper](https://huggingface.co/papers/2603.12254), project page, and GitHub repository.
- A BibTeX citation section.

These changes help researchers and users find and cite your work more easily!

Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -2,15 +2,20 @@
2
  license: other
3
  license_name: nvidia
4
  license_link: LICENSE
 
5
  ---
6
 
 
7
 
 
 
 
8
 
9
  ## Model Overview
10
 
11
  ### Description:
12
 
13
- VideoMAE model used for training AutoGaze. This model is for research and development only. <br>
14
 
15
  ### License/Terms of Use:
16
 
@@ -124,3 +129,17 @@ The raw videos are collected from public dataset including Ego4D, 100DoH, Intern
124
  NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report model quality, risk, security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/). <br>
125
 
126
  Please make sure you have proper rights and permissions for all input image and video content; if image or video includes people, personal health information, or intellectual property, the image or video generated will not blur or maintain proportions of image subjects included. <br>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: other
3
  license_name: nvidia
4
  license_link: LICENSE
5
+ pipeline_tag: video-classification
6
  ---
7
 
8
+ # VideoMAE_AutoGaze
9
 
10
+ [**Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing**](https://huggingface.co/papers/2603.12254)
11
+
12
+ [**Project Page**](https://autogaze.github.io/) | [**GitHub**](https://github.com/NVlabs/AutoGaze) | [**Demo**](https://huggingface.co/spaces/bfshi/AutoGaze)
13
 
14
  ## Model Overview
15
 
16
  ### Description:
17
 
18
+ VideoMAE model used for training AutoGaze. This model is for research and development only. <br>
19
 
20
  ### License/Terms of Use:
21
 
 
129
  NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report model quality, risk, security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/). <br>
130
 
131
  Please make sure you have proper rights and permissions for all input image and video content; if image or video includes people, personal health information, or intellectual property, the image or video generated will not blur or maintain proportions of image subjects included. <br>
132
+
133
+ ## Citation
134
+
135
+ ```bibtex
136
+ @misc{shi2026attendattentionefficientscalable,
137
+ title={Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing},
138
+ author={Baifeng Shi and Stephanie Fu and Long Lian and Hanrong Ye and David Eigen and Aaron Reite and Boyi Li and Jan Kautz and Song Han and David M. Chan and Pavlo Molchanov and Trevor Darrell and Hongxu Yin},
139
+ year={2026},
140
+ eprint={2603.12254},
141
+ archivePrefix={arXiv},
142
+ primaryClass={cs.CV},
143
+ url={https://arxiv.org/abs/2603.12254},
144
+ }
145
+ ```