|
|
--- |
|
|
license: apache-2.0 |
|
|
--- |
|
|
## Model Description |
|
|
This is a huggingface model card for Q-Insight 👋 |
|
|
- Paper: https://arxiv.org/pdf/2503.22679 |
|
|
- Code: https://github.com/bytedance/Q-Insight |
|
|
|
|
|
## License |
|
|
This project is licensed under the Apache-2.0 License. It is finetuned from [Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) under Apache-2.0. |
|
|
|
|
|
## Citation |
|
|
If you find the code helpful in your research or work, please cite the following papers: |
|
|
``` |
|
|
@inproceedings{li2025qinsight, |
|
|
title={Q-insight: Understanding image quality via visual reinforcement learning}, |
|
|
author={Li, Weiqi and Zhang, Xuanyu and Zhao, Shijie and Zhang, Yabin and Li, Junlin and Zhang, Li and Zhang, Jian}, |
|
|
booktitle={Advances in Neural Information Processing Systems}, |
|
|
year={2025} |
|
|
} |
|
|
``` |
|
|
``` |
|
|
@inproceedings{zhang2025vqinsight, |
|
|
title={VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning}, |
|
|
author={Zhang, Xuanyu and Li, Weiqi and Zhao, Shijie and Li, Junlin and Zhang, Li and Zhang, Jian}, |
|
|
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence}, |
|
|
year={2026} |
|
|
} |
|
|
``` |