OpenGVLab
/

InternViT-6B-224px

Image Feature Extraction

feature-extraction

Model card Files Files and versions

czczup commited on Feb 11, 2024

Commit

191faa6

·

verified ·

1 Parent(s): a11177b

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -28,6 +28,14 @@ It is _**the largest open-source vision/vision-language foundation model (14B)**
   - Image size: 224 x 224
 - **Pretrain Dataset:** LAION-en, LAION-COCO, COYO, CC12M, CC3M, SBU, Wukong, LAION-multi
 ## Model Usage (Image Embeddings)
 ```python

   - Image size: 224 x 224
 - **Pretrain Dataset:** LAION-en, LAION-COCO, COYO, CC12M, CC3M, SBU, Wukong, LAION-multi
+## Linear Probing Performance
+See this [document](https://github.com/OpenGVLab/InternVL/tree/main/classification) for more details about the linear probing evaluation.
+| IN-1K | IN-ReaL | IN-V2 | IN-A | IN-R | IN-Sketch |
+| :---: | :-----: | :---: | :--: | :--: | :-------: |
+| 88.2  |  90.4   | 79.9  | 77.5 | 89.8 |   69.1    |
 ## Model Usage (Image Embeddings)
 ```python