OpenGVLab
/

InternViT-6B-224px

Image Feature Extraction

feature-extraction

Model card Files Files and versions

czczup commited on Dec 23, 2023

Commit

58b8706

·

1 Parent(s): 94d2d17

Update README.md

Files changed (1) hide show

README.md +23 -0

README.md CHANGED Viewed

@@ -1,7 +1,30 @@
 ---
 license: mit
 ---
 ```python
 import torch

 ---
 license: mit
+datasets:
+- laion/laion2B-en
+- laion/laion-coco
+- laion/laion2B-multi
+- kakaobrain/coyo-700m
+- conceptual_captions
+- wanng/wukong100m
 ---
+# Model card for InternViT-6B-224px
+## Model Details
+- **Model Type:** feature backbone
+- **Model Stats:**
+  - Params (M): 5903
+  - Image size: 224 x 224
+- **Papers:**
+  - InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
+- **GitHub:**
+  - https://github.com/OpenGVLab/InternVL
+- **Pretrain Dataset:** LAION-en, LAION-COCO, COYO, CC12M, CC3M, SBU, Wukong, LAION-multi
+## Model Usage
+### Image Embeddings
 ```python
 import torch