Instructions to use OpenGVLab/InternViT-6B-224px with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenGVLab/InternViT-6B-224px with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-feature-extraction", model="OpenGVLab/InternViT-6B-224px", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenGVLab/InternViT-6B-224px", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -31,7 +31,7 @@ pipeline_tag: image-feature-extraction
|
|
| 31 |
- Params (M): 5903
|
| 32 |
- Image size: 224 x 224
|
| 33 |
- **Pretrain Dataset:** LAION-en, LAION-COCO, COYO, CC12M, CC3M, SBU, Wukong, LAION-multi
|
| 34 |
-
- **Note:** This model has 48 blocks, and we found that using the output after the fourth-to-last block worked best for VLLM. Therefore,
|
| 35 |
|
| 36 |
|
| 37 |
## Linear Probing Performance
|
|
|
|
| 31 |
- Params (M): 5903
|
| 32 |
- Image size: 224 x 224
|
| 33 |
- **Pretrain Dataset:** LAION-en, LAION-COCO, COYO, CC12M, CC3M, SBU, Wukong, LAION-multi
|
| 34 |
+
- **Note:** This model has 48 blocks, and we found that using the output after the fourth-to-last block worked best for VLLM. Therefore, when building a VLLM with this model, **please use the features from the fourth-to-last layer.**
|
| 35 |
|
| 36 |
|
| 37 |
## Linear Probing Performance
|