Add model card for GLAD
Browse filesHi! I'm Niels, part of the community science team at Hugging Face. I noticed this repository was missing a model card. This PR adds a README (model card) including metadata, a summary of the paper's approach, and links to the official code and paper to improve discoverability.
README.md
ADDED
|
@@ -0,0 +1,35 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
pipeline_tag: object-detection
|
| 3 |
+
tags:
|
| 4 |
+
- vision-language-tracking
|
| 5 |
+
- diffusion-models
|
| 6 |
+
- visual-tracking
|
| 7 |
+
---
|
| 8 |
+
|
| 9 |
+
# GLAD: Generative Language-Assisted Visual Tracking for Low-Semantic Templates
|
| 10 |
+
|
| 11 |
+
This repository contains the weights for **GLAD**, a vision-language tracking model introduced in the paper [GLAD: Generative Language-Assisted Visual Tracking for Low-Semantic Templates](https://huggingface.co/papers/2602.00570).
|
| 12 |
+
|
| 13 |
+
## Overview
|
| 14 |
+
|
| 15 |
+
GLAD (Generative Language-AssisteD tracking) is a pioneering model that utilizes diffusion models for generative multi-modal fusion of text descriptions and template images.
|
| 16 |
+
|
| 17 |
+
Current vision-language trackers often struggle with "low-semantic" images (such as those with significant blur or low resolution) because traditional discriminative fusion paradigms have limited effectiveness in bridging the gap between text and degraded visual features. GLAD addresses this by leveraging the reconstruction capabilities of generative models to bolster compatibility between language and images, effectively enhancing the semantic information of the template for more robust tracking.
|
| 18 |
+
|
| 19 |
+
## Resources
|
| 20 |
+
|
| 21 |
+
- **Paper:** [GLAD: Generative Language-Assisted Visual Tracking for Low-Semantic Templates](https://huggingface.co/papers/2602.00570)
|
| 22 |
+
- **GitHub Repository:** [https://github.com/Confetti-lxy/GLAD](https://github.com/Confetti-lxy/GLAD)
|
| 23 |
+
|
| 24 |
+
## Citation
|
| 25 |
+
|
| 26 |
+
If you find this work useful in your research, please cite:
|
| 27 |
+
|
| 28 |
+
```bibtex
|
| 29 |
+
@article{luo2026glad,
|
| 30 |
+
title={GLAD: Generative Language-Assisted Visual Tracking for Low-Semantic Templates},
|
| 31 |
+
author={Luo, Xingyu and Cai, Yidong and Liu, Jie and Tang, Jie and Wu, Gangshan and Wang, Limin},
|
| 32 |
+
journal={arXiv preprint arXiv:2602.00570},
|
| 33 |
+
year={2026}
|
| 34 |
+
}
|
| 35 |
+
```
|