zzhowe
/

GRMP-IQA

Image quality assessment

Model card Files Files and versions

GRMP-IQA / README.md

lxdmac's picture

push model card

31bbff1 7 months ago

|

history blame contribute delete

2.88 kB

	---
	tags:
	- Image quality assessment
	- GRMP-IQA
	license: mit
	metrics:
	- PLCC
	- SRCC
	language:
	- en
	---

	# GRMP-IQA Model Card

	### Installation

	```bash
	pip install torch==1.12.0 torchvision==0.13.0
	pip install -r requirements.txt
	```

	### Quick Start

	#### 1. Meta-Learning Pre-training
	```bash
	python pretrain.py
	```

	#### 2. Few-shot Fine-tuning
	```bash
	# 50-shot fine-tuning on CLIVE
	python finetune.py --dataset clive --num_image 50 --lda 5.0

	# Fine-tuning on KonIQ-10K
	python finetune.py --dataset koniq --num_image 50 --lda 5.0

	# Using pre-trained model
	python finetune.py --dataset clive --num_image 50 --pretrained --lda 5.0
	```

	#### 3. Python API Usage

	```python
	import torch
	from CLIP import clip
	from finetune import CustomCLIP, load_clip_to_cpu

	# Load pre-trained model
	classnames = [['good', 'bad'], ['clear', 'unclear'], ['high quality', 'low quality']]
	clip_model = load_clip_to_cpu('ViT-B/16').float()
	model = CustomCLIP(classnames, clip_model)

	# Load checkpoint
	checkpoint = torch.load('path/to/checkpoint.pt')
	model.load_state_dict(checkpoint, strict=False)

	# Inference
	model.eval()
	with torch.no_grad():
	# image: torch.Tensor [B, 3, 224, 224]
	logits = model(image)
	quality_score = torch.softmax(logits[:, :2], dim=-1)[:, 0]
	```

	## Hugging Face Model Hub

	### Available Resources

	Our model and associated resources are available on the Hugging Face Model Hub:

	Repository: [GRMP-IQA](https://huggingface.co/zzhowe/GRMP-IQA)

	### Usage Example with Hugging Face

	```python
	from huggingface_hub import hf_hub_download
	import torch
	import scipy.io as sio

	# Download pre-trained model weights
	model_path = hf_hub_download(
	repo_id="zzhowe/GRMP-IQA",
	filename="clive_50_prompt_lda_5.0.pt"
	)

	# Download dataset file
	dataset_path = hf_hub_download(
	repo_id="zzhowe/GRMP-IQA",
	filename="LIVE_224.mat"
	)

	# Load model
	model = torch.load(model_path, map_location='cpu')

	# Load dataset
	dataset = sio.loadmat(dataset_path)
	```

	## Citation

	If you use this model in your research, please cite:

	```bibtex
	@article{li2024boosting,
	title={Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models},
	author={Li, Xudong and Huang, Zihao and Hu, Runze and Zhang, Yan and Cao, Liujuan and Ji, Rongrong},
	journal={arXiv preprint arXiv:2409.05381},
	year={2024}
	}
	```

	## License

	This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

	## Contact

	For questions or issues, please contact:
	- 📧 Email: [lxd761050753@gmail.com](mailto:lxd761050753@gmail.com)
	- 📧 Email: [huangzihhhh@gmail.com](mailto:huangzihhhh@gmail.com)

	## Acknowledgments

	- CLIP model from OpenAI
	- PyTorch team for the deep learning framework
	- All contributors to the IQA datasets used in training