Add library_name and pipeline_tag to metadata
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,6 +1,11 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
base_model: Qwen/Qwen2.5-VL-7B-Instruct
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
tags:
|
| 5 |
- multimodal
|
| 6 |
- personalized-mllm
|
|
@@ -9,9 +14,6 @@ tags:
|
|
| 9 |
- cvpr2026
|
| 10 |
- personality-evolving
|
| 11 |
- benchmark
|
| 12 |
-
datasets:
|
| 13 |
-
- ClareNie/Persona-MME
|
| 14 |
-
- ClareNie/PersonaVLM-Dataset
|
| 15 |
---
|
| 16 |
|
| 17 |
# PersonaVLM: Long-Term Personalized Multimodal LLMs (CVPR 2026)
|
|
@@ -45,6 +47,17 @@ This project consists of several components. You can access the model weights, t
|
|
| 45 |
| π **Benchmark** | [HF: Persona-MME (2,000+ cases)](https://huggingface.co/datasets/ClareNie/Persona-MME) |
|
| 46 |
| π **Training Data** | [HF: PersonaVLM-Dataset (80k+ samples)](https://huggingface.co/datasets/ClareNie/PersonaVLM-Dataset) |
|
| 47 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 48 |
## βοΈ Citation
|
| 49 |
If you find our work helpful, please cite our paper:
|
| 50 |
|
|
@@ -55,4 +68,5 @@ If you find our work helpful, please cite our paper:
|
|
| 55 |
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
|
| 56 |
year={2026},
|
| 57 |
url={http://arxiv.org/abs/2604.13074}
|
| 58 |
-
}
|
|
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
base_model: Qwen/Qwen2.5-VL-7B-Instruct
|
| 3 |
+
datasets:
|
| 4 |
+
- ClareNie/Persona-MME
|
| 5 |
+
- ClareNie/PersonaVLM-Dataset
|
| 6 |
+
license: apache-2.0
|
| 7 |
+
library_name: transformers
|
| 8 |
+
pipeline_tag: image-text-to-text
|
| 9 |
tags:
|
| 10 |
- multimodal
|
| 11 |
- personalized-mllm
|
|
|
|
| 14 |
- cvpr2026
|
| 15 |
- personality-evolving
|
| 16 |
- benchmark
|
|
|
|
|
|
|
|
|
|
| 17 |
---
|
| 18 |
|
| 19 |
# PersonaVLM: Long-Term Personalized Multimodal LLMs (CVPR 2026)
|
|
|
|
| 47 |
| π **Benchmark** | [HF: Persona-MME (2,000+ cases)](https://huggingface.co/datasets/ClareNie/Persona-MME) |
|
| 48 |
| π **Training Data** | [HF: PersonaVLM-Dataset (80k+ samples)](https://huggingface.co/datasets/ClareNie/PersonaVLM-Dataset) |
|
| 49 |
|
| 50 |
+
## π Quick Start
|
| 51 |
+
Since PersonaVLM is built upon Qwen2.5-VL, it is strictly required to install `transformers==4.51.3` to prevent compatibility issues.
|
| 52 |
+
|
| 53 |
+
To run inference via CLI:
|
| 54 |
+
```bash
|
| 55 |
+
python inference.py
|
| 56 |
+
# Optional arguments:
|
| 57 |
+
# --force-retrieve : Force the agent to execute memory retrieval for every message.
|
| 58 |
+
# --reasoning-mode : Flag to toggle reasoning (default is True; passing this sets it to False).
|
| 59 |
+
```
|
| 60 |
+
|
| 61 |
## βοΈ Citation
|
| 62 |
If you find our work helpful, please cite our paper:
|
| 63 |
|
|
|
|
| 68 |
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
|
| 69 |
year={2026},
|
| 70 |
url={http://arxiv.org/abs/2604.13074}
|
| 71 |
+
}
|
| 72 |
+
```
|