Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,26 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+
## 🏰 **Pretrained and Fine-tuned Model**
|
| 6 |
+
|
| 7 |
+
- The following checkpoints are utilized to run Robust-R1:
|
| 8 |
+
|
| 9 |
+
| Checkpoint | Link | Note |
|
| 10 |
+
|:---------:|:----:|:----:|
|
| 11 |
+
| Qwen2.5-VL-Base | [link](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct) | Used as initial weights for training. |
|
| 12 |
+
| **Robust-R1-SFT** | [link](https://huggingface.co/Jiaqi-hkust/Robust-R1-SFT) | Fine-tuned on [Robust-R1 dataset](https://huggingface.co/datasets/Jiaqi-hkust/Robust-R1) |
|
| 13 |
+
| **Robust-R1-RL** | [link](https://huggingface.co/Jiaqi-hkust/Robust-R1-RL) | Fine-tuned with reinforcement learning on [Robust-R1 dataset](https://huggingface.co/datasets/Jiaqi-hkust/Robust-R1) |
|
| 14 |
+
|
| 15 |
+
## ⭐️ Citation
|
| 16 |
+
|
| 17 |
+
If you find Robust-R1 useful for your research and applications, please cite using this BibTeX:
|
| 18 |
+
``` latex
|
| 19 |
+
@inproceedings{tang2025robustr1,
|
| 20 |
+
title={Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding},
|
| 21 |
+
author={Tang, Jiaqi and Chen, Jianmin and Wei, Wei and Xu, Xiaogang and Liu, Runtao and Wu, Xiangyu and Xie, Qipeng and Wu, Jiafei and Zhang, Lei and Chen, Qifeng},
|
| 22 |
+
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
|
| 23 |
+
year={2026}
|
| 24 |
+
}
|
| 25 |
+
```
|
| 26 |
+
|