Image-to-Text
Transformers
PyTorch
TensorBoard
English
mplug_owl2
feature-extraction
image-quality-assessment
document-quality
mplug-owl2
vision-language
document-analysis
sharpness
blur-detection
IQA
custom_code
Instructions to use mapo80/DeQA-Doc-Sharpness with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mapo80/DeQA-Doc-Sharpness with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="mapo80/DeQA-Doc-Sharpness", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("mapo80/DeQA-Doc-Sharpness", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -263,18 +263,29 @@ DeQA-Doc-Sharpness understands document context and can differentiate between in
|
|
| 263 |
- Requires GPU with sufficient VRAM for efficient inference
|
| 264 |
- Sharpness assessment is relative to training data distribution
|
| 265 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 266 |
## Citation
|
| 267 |
|
|
|
|
|
|
|
| 268 |
```bibtex
|
| 269 |
-
@
|
| 270 |
-
title={DeQA-Doc
|
| 271 |
-
author={
|
| 272 |
-
|
| 273 |
-
|
| 274 |
-
url={https://huggingface.co/mapo80/DeQA-Doc-Sharpness}
|
| 275 |
}
|
| 276 |
```
|
| 277 |
|
|
|
|
|
|
|
| 278 |
## License
|
| 279 |
|
| 280 |
Apache 2.0
|
|
|
|
| 263 |
- Requires GPU with sufficient VRAM for efficient inference
|
| 264 |
- Sharpness assessment is relative to training data distribution
|
| 265 |
|
| 266 |
+
## Credits & Attribution
|
| 267 |
+
|
| 268 |
+
This model is based on the **DeQA-Doc** project by Junjie Gao et al., which won the **Championship** in the VQualA 2025 DIQA (Document Image Quality Assessment) Challenge.
|
| 269 |
+
|
| 270 |
+
**Original Repository**: [https://github.com/Junjie-Gao19/DeQA-Doc](https://github.com/Junjie-Gao19/DeQA-Doc)
|
| 271 |
+
|
| 272 |
+
All credit for the research, training methodology, and model architecture goes to the original authors.
|
| 273 |
+
|
| 274 |
## Citation
|
| 275 |
|
| 276 |
+
If you use this model in your research, please cite the original paper:
|
| 277 |
+
|
| 278 |
```bibtex
|
| 279 |
+
@inproceedings{deqadoc,
|
| 280 |
+
title={{DeQA-Doc}: Adapting {DeQA-Score} to Document Image Quality Assessment},
|
| 281 |
+
author={Gao, Junjie and Liu, Runze and Peng, Yingzhe and Yang, Shujian and Zhang, Jin and Yang, Kai and You, Zhiyuan},
|
| 282 |
+
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision Workshop},
|
| 283 |
+
year={2025},
|
|
|
|
| 284 |
}
|
| 285 |
```
|
| 286 |
|
| 287 |
+
**ArXiv**: [https://arxiv.org/abs/2507.12796](https://arxiv.org/abs/2507.12796)
|
| 288 |
+
|
| 289 |
## License
|
| 290 |
|
| 291 |
Apache 2.0
|