OpenGVLab

community

https://github.com/opengvlab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

Eurayka authored a paper 2 days ago

TimeLens2: Generalist Video Temporal Grounding with Multimodal LLMs

wzk1015 submitted a paper 2 days ago

WorldCupArena: Fine-Grained Evaluation of Language Models and Deep-Research Agents on Football Forecasting

ganlinyang authored a paper 28 days ago

EventVLA: Event-Driven Visual Evidence Memory for Long-Horizon Vision-Language-Action Policies

View all activity

Papers

Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction

RIVER: A Real-Time Interaction Benchmark for Video LLMs

View all Papers

OpenGVLab 's models 286

OpenGVLab/InternVL3-9B-Pretrained

Image-Text-to-Text • 9B • Updated Apr 25, 2025 • 19

OpenGVLab/InternVL2_5-8B-MPO-hf

Image-Text-to-Text • 8B • Updated Apr 23, 2025 • 2.17k

OpenGVLab/InternVL2_5-2B-MPO-hf

Image-Text-to-Text • 2B • Updated Apr 23, 2025 • 4.62k

OpenGVLab/InternVL3-8B-hf

Image-Text-to-Text • 8B • Updated Apr 23, 2025 • 68.6k • 10

OpenGVLab/InternVL3-78B-hf

Image-Text-to-Text • 78B • Updated Apr 23, 2025 • 385 • 2

OpenGVLab/InternVL3-38B-hf

Image-Text-to-Text • 38B • Updated Apr 23, 2025 • 2.26k • 2

OpenGVLab/InternVL3-14B-hf

Image-Text-to-Text • 15B • Updated Apr 23, 2025 • 6.66k

OpenGVLab/InternVL3-2B-hf

Image-Text-to-Text • 2B • Updated Apr 23, 2025 • 17.7k • 3

OpenGVLab/InternVL3-1B-hf

Image-Text-to-Text • 0.9B • Updated Apr 23, 2025 • 256k • 10

OpenGVLab/VideoChat-R1_7B

Video-Text-to-Text • 8B • Updated Apr 22, 2025 • 197 • 7

OpenGVLab/VideoChat-R1_7B_caption

Video-Text-to-Text • 8B • Updated Apr 22, 2025 • 57 • 5

OpenGVLab/PIIP-LLaVA-Plus_ConvNeXt-L_CLIP-L_1024-336_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 6

OpenGVLab/clip-vit-large-patch14to16-224

0.4B • Updated Apr 20, 2025 • 6

OpenGVLab/PIIP-LLaVA_CLIP-BL_512-256_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 3

OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_1024-336_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 6

OpenGVLab/PIIP-LLaVA_ConvNeXt-L_CLIP-L_1024-336_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 6

OpenGVLab/clip-vit-large-patch14to16-336

0.4B • Updated Apr 20, 2025 • 5

OpenGVLab/PIIP-LLaVA_CLIP-BL_512-448_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 5

OpenGVLab/PIIP-LLaVA_ConvNeXt-L_CLIP-L_1024-336_13B

Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 5

OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_640-224_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 5

OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_1024-336_13B

Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 5

OpenGVLab/PIIP-LLaVA_CLIP-BL_512-448_13B

Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 6

OpenGVLab/InternVL3-9B-AWQ

Image-Text-to-Text • Updated Apr 17, 2025 • 26 • 1

OpenGVLab/PIIP

Object Detection • Updated Apr 16, 2025 • 5

OpenGVLab/VideoChat-R1-thinking_7B

Video-Text-to-Text • 8B • Updated Apr 13, 2025 • 6

OpenGVLab/Mini-InternVL2-2B-DA-BDD

Image-Text-to-Text • 2B • Updated Mar 26, 2025 • 27 • 1

OpenGVLab/Mini-InternVL2-2B-DA-DriveLM

Image-Text-to-Text • 2B • Updated Mar 26, 2025 • 34 • 2

OpenGVLab/Mini-InternVL2-2B-DA-Medical

Image-Text-to-Text • 2B • Updated Mar 26, 2025 • 31 • 1

OpenGVLab/InternVL2_5-26B-MPO

Image-Text-to-Text • 26B • Updated Mar 25, 2025 • 75 • 14

OpenGVLab/InternVL2_5-8B-MPO

Image-Text-to-Text • 8B • Updated Mar 25, 2025 • 1.39k • 48