Visual Question Answering
Transformers
Safetensors
English
Chinese
minicpmv
feature-extraction
custom_code
Eval Results
Instructions to use openbmb/MiniCPM-V-2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openbmb/MiniCPM-V-2 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="openbmb/MiniCPM-V-2", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("openbmb/MiniCPM-V-2", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -3,7 +3,7 @@ pipeline_tag: visual-question-answering
|
|
| 3 |
---
|
| 4 |
|
| 5 |
## MiniCPM-V 2.0
|
| 6 |
-
**MiniCPM-V 2.8B** is
|
| 7 |
|
| 8 |
- 🔥 **State-of-the-art Performance.**
|
| 9 |
|
|
|
|
| 3 |
---
|
| 4 |
|
| 5 |
## MiniCPM-V 2.0
|
| 6 |
+
**MiniCPM-V 2.8B** is a strong multimodal large language model for efficient end-side deployment. The model is built based on SigLip-400M and [MiniCPM-2.4B](https://github.com/OpenBMB/MiniCPM/), connected by a perceiver resampler. Our latest version, **MiniCPM-V 2.0** has several notable features.
|
| 7 |
|
| 8 |
- 🔥 **State-of-the-art Performance.**
|
| 9 |
|