OpenGVLab
/

InternVL-14B-224px

Image Feature Extraction

feature-extraction

Model card Files Files and versions

czczup commited on Dec 26, 2023

Commit

0b827a6

·

1 Parent(s): 44e4329

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -20,6 +20,9 @@ InternVL scales up the ViT to _**6B parameters**_ and aligns it with LLM.
 It is _**the largest open-source vision/vision-language foundation model (14B)**_ to date, achieving _**32 state-of-the-art**_ performances on a wide range of tasks such as visual perception, cross-modal retrieval, multimodal dialogue, etc.
 ## Model Details
 - **Model Type:** vision-language foundation model
 - **Model Stats:**

 It is _**the largest open-source vision/vision-language foundation model (14B)**_ to date, achieving _**32 state-of-the-art**_ performances on a wide range of tasks such as visual perception, cross-modal retrieval, multimodal dialogue, etc.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/f1jTYvyxyYbHRalvgtKY2.png)
 ## Model Details
 - **Model Type:** vision-language foundation model
 - **Model Stats:**