OpenGVLab
/

InternViT-6B-224px

Image Feature Extraction

feature-extraction

Model card Files Files and versions

czczup commited on Dec 26, 2023

Commit

48cb3a3

·

1 Parent(s): a393696

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -19,6 +19,7 @@ InternVL scales up the ViT to _**6B parameters**_ and aligns it with LLM.
 It is _**the largest open-source vision/vision-language foundation model (14B)**_ to date, achieving _**32 state-of-the-art**_ performances on a wide range of tasks such as visual perception, cross-modal retrieval, multimodal dialogue, etc.
 ## Model Details
 - **Model Type:** feature backbone

 It is _**the largest open-source vision/vision-language foundation model (14B)**_ to date, achieving _**32 state-of-the-art**_ performances on a wide range of tasks such as visual perception, cross-modal retrieval, multimodal dialogue, etc.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/QmVXOyr4uFQLx5Q-WLn9-.png)
 ## Model Details
 - **Model Type:** feature backbone