OpenGVLab
/

InternViT-6B-224px

Image Feature Extraction

feature-extraction

Model card Files Files and versions

czczup commited on Feb 11, 2024

Commit

a11177b

·

verified ·

1 Parent(s): eede272

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ It is _**the largest open-source vision/vision-language foundation model (14B)**
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/k5UATwX5W2b5KJBN5C58x.png)
 ## Model Details
-- **Model Type:** feature backbone
 - **Model Stats:**
   - Params (M): 5903
   - Image size: 224 x 224
@@ -53,7 +53,7 @@ outputs = model(pixel_values)
 ## Citation
-If you find this project useful in your research, please consider cite:
 ```BibTeX
 @article{chen2023internvl,

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/k5UATwX5W2b5KJBN5C58x.png)
 ## Model Details
+- **Model Type:** vision foundation model, feature backbone
 - **Model Stats:**
   - Params (M): 5903
   - Image size: 224 x 224
 ## Citation
+If you find this project useful in your research, please consider citing:
 ```BibTeX
 @article{chen2023internvl,