Instructions to use OpenGVLab/InternViT-6B-224px with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenGVLab/InternViT-6B-224px with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-feature-extraction", model="OpenGVLab/InternViT-6B-224px", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenGVLab/InternViT-6B-224px", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -22,7 +22,7 @@ It is _**the largest open-source vision/vision-language foundation model (14B)**
|
|
| 22 |

|
| 23 |
|
| 24 |
## Model Details
|
| 25 |
-
- **Model Type:** feature backbone
|
| 26 |
- **Model Stats:**
|
| 27 |
- Params (M): 5903
|
| 28 |
- Image size: 224 x 224
|
|
@@ -53,7 +53,7 @@ outputs = model(pixel_values)
|
|
| 53 |
|
| 54 |
## Citation
|
| 55 |
|
| 56 |
-
If you find this project useful in your research, please consider
|
| 57 |
|
| 58 |
```BibTeX
|
| 59 |
@article{chen2023internvl,
|
|
|
|
| 22 |

|
| 23 |
|
| 24 |
## Model Details
|
| 25 |
+
- **Model Type:** vision foundation model, feature backbone
|
| 26 |
- **Model Stats:**
|
| 27 |
- Params (M): 5903
|
| 28 |
- Image size: 224 x 224
|
|
|
|
| 53 |
|
| 54 |
## Citation
|
| 55 |
|
| 56 |
+
If you find this project useful in your research, please consider citing:
|
| 57 |
|
| 58 |
```BibTeX
|
| 59 |
@article{chen2023internvl,
|