Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -146,11 +146,14 @@ similarities = torch.einsum("btd,bd->bt", audio_embeds, text_embeds)
|
|
| 146 |
## Citation
|
| 147 |
|
| 148 |
```bibtex
|
| 149 |
-
@
|
| 150 |
-
|
| 151 |
-
|
| 152 |
-
|
| 153 |
-
|
|
|
|
|
|
|
|
|
|
| 154 |
}
|
| 155 |
```
|
| 156 |
|
|
|
|
| 146 |
## Citation
|
| 147 |
|
| 148 |
```bibtex
|
| 149 |
+
@misc{vyas2025pushingfrontieraudiovisualperception,
|
| 150 |
+
title={Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning},
|
| 151 |
+
author={Apoorv Vyas and Heng-Jui Chang and Cheng-Fu Yang and Po-Yao Huang and Luya Gao and Julius Richter and Sanyuan Chen and Matt Le and Piotr Dollár and Christoph Feichtenhofer and Ann Lee and Wei-Ning Hsu},
|
| 152 |
+
year={2025},
|
| 153 |
+
eprint={2512.19687},
|
| 154 |
+
archivePrefix={arXiv},
|
| 155 |
+
primaryClass={cs.SD},
|
| 156 |
+
url={https://arxiv.org/abs/2512.19687},
|
| 157 |
}
|
| 158 |
```
|
| 159 |
|