--- tags: - Image quality assessment - GRMP-IQA license: mit metrics: - PLCC - SRCC language: - en --- # GRMP-IQA Model Card ### Installation ```bash pip install torch==1.12.0 torchvision==0.13.0 pip install -r requirements.txt ``` ### Quick Start #### 1. Meta-Learning Pre-training ```bash python pretrain.py ``` #### 2. Few-shot Fine-tuning ```bash # 50-shot fine-tuning on CLIVE python finetune.py --dataset clive --num_image 50 --lda 5.0 # Fine-tuning on KonIQ-10K python finetune.py --dataset koniq --num_image 50 --lda 5.0 # Using pre-trained model python finetune.py --dataset clive --num_image 50 --pretrained --lda 5.0 ``` #### 3. Python API Usage ```python import torch from CLIP import clip from finetune import CustomCLIP, load_clip_to_cpu # Load pre-trained model classnames = [['good', 'bad'], ['clear', 'unclear'], ['high quality', 'low quality']] clip_model = load_clip_to_cpu('ViT-B/16').float() model = CustomCLIP(classnames, clip_model) # Load checkpoint checkpoint = torch.load('path/to/checkpoint.pt') model.load_state_dict(checkpoint, strict=False) # Inference model.eval() with torch.no_grad(): # image: torch.Tensor [B, 3, 224, 224] logits = model(image) quality_score = torch.softmax(logits[:, :2], dim=-1)[:, 0] ``` ## Hugging Face Model Hub ### Available Resources Our model and associated resources are available on the Hugging Face Model Hub: **Repository**: [GRMP-IQA](https://huggingface.co/zzhowe/GRMP-IQA) ### Usage Example with Hugging Face ```python from huggingface_hub import hf_hub_download import torch import scipy.io as sio # Download pre-trained model weights model_path = hf_hub_download( repo_id="zzhowe/GRMP-IQA", filename="clive_50_prompt_lda_5.0.pt" ) # Download dataset file dataset_path = hf_hub_download( repo_id="zzhowe/GRMP-IQA", filename="LIVE_224.mat" ) # Load model model = torch.load(model_path, map_location='cpu') # Load dataset dataset = sio.loadmat(dataset_path) ``` ## Citation If you use this model in your research, please cite: ```bibtex @article{li2024boosting, title={Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models}, author={Li, Xudong and Huang, Zihao and Hu, Runze and Zhang, Yan and Cao, Liujuan and Ji, Rongrong}, journal={arXiv preprint arXiv:2409.05381}, year={2024} } ``` ## License This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details. ## Contact For questions or issues, please contact: - 📧 Email: [lxd761050753@gmail.com](mailto:lxd761050753@gmail.com) - 📧 Email: [huangzihhhh@gmail.com](mailto:huangzihhhh@gmail.com) ## Acknowledgments - CLIP model from OpenAI - PyTorch team for the deep learning framework - All contributors to the IQA datasets used in training