File size: 1,163 Bytes
00f8676 35516fa 00f8676 5da4eda 00f8676 5da4eda 00f8676 5da4eda 00f8676 5da4eda 00f8676 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 |
---
license: apache-2.0
---
## Model Description
This is a huggingface model card for Q-Insight 👋
- Paper: https://arxiv.org/pdf/2503.22679
- Code: https://github.com/bytedance/Q-Insight
## License
This project is licensed under the Apache-2.0 License. It is finetuned from [Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) under Apache-2.0.
## Citation
If you find the code helpful in your research or work, please cite the following papers:
```
@inproceedings{li2025qinsight,
title={Q-insight: Understanding image quality via visual reinforcement learning},
author={Li, Weiqi and Zhang, Xuanyu and Zhao, Shijie and Zhang, Yabin and Li, Junlin and Zhang, Li and Zhang, Jian},
booktitle={Advances in Neural Information Processing Systems},
year={2025}
}
```
```
@inproceedings{zhang2025vqinsight,
title={VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning},
author={Zhang, Xuanyu and Li, Weiqi and Zhao, Shijie and Li, Junlin and Zhang, Li and Zhang, Jian},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
year={2026}
}
``` |