Add library_name, pipeline_tag, Github code
#3
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -2,9 +2,11 @@
|
|
| 2 |
language:
|
| 3 |
- en
|
| 4 |
license: apache-2.0
|
|
|
|
|
|
|
| 5 |
---
|
| 6 |
-
This is BLIP3o-4B checkpoint trained on the **open source** data.
|
| 7 |
|
|
|
|
| 8 |
|
| 9 |
| Model | Pretrain Data | GenEval | DBP | WISE |
|
| 10 |
|---------------------|-----------------------------------------------------------|---------|--------|------|
|
|
@@ -12,6 +14,7 @@ This is BLIP3o-4B checkpoint trained on the **open source** data.
|
|
| 12 |
| 8B (open source) | 30 million open-source data | 0.83 | 80.73 | 0.52 |
|
| 13 |
| 8B (paper reported) | 30 million open-source + 30 million proprietary data | 0.84 | 81.60 | 0.62 |
|
| 14 |
|
|
|
|
| 15 |
|
| 16 |
### Download
|
| 17 |
|
|
@@ -39,4 +42,4 @@ Launch with your model path:
|
|
| 39 |
|
| 40 |
```
|
| 41 |
python app.py /path/to/your/model
|
| 42 |
-
```
|
|
|
|
| 2 |
language:
|
| 3 |
- en
|
| 4 |
license: apache-2.0
|
| 5 |
+
library_name: diffusers
|
| 6 |
+
pipeline_tag: image-text-to-text
|
| 7 |
---
|
|
|
|
| 8 |
|
| 9 |
+
This is BLIP3o-4B checkpoint trained on the **open source** data described in the paper [BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset](https://huggingface.co/papers/2505.09568).
|
| 10 |
|
| 11 |
| Model | Pretrain Data | GenEval | DBP | WISE |
|
| 12 |
|---------------------|-----------------------------------------------------------|---------|--------|------|
|
|
|
|
| 14 |
| 8B (open source) | 30 million open-source data | 0.83 | 80.73 | 0.52 |
|
| 15 |
| 8B (paper reported) | 30 million open-source + 30 million proprietary data | 0.84 | 81.60 | 0.62 |
|
| 16 |
|
| 17 |
+
See https://github.com/JiuhaiChen/BLIP3o for the code.
|
| 18 |
|
| 19 |
### Download
|
| 20 |
|
|
|
|
| 42 |
|
| 43 |
```
|
| 44 |
python app.py /path/to/your/model
|
| 45 |
+
```
|