Spaces:

OpenGVLab
/

README

Running

czczup commited on Jun 7, 2024

Commit

3e6ee21

verified ·

1 Parent(s): 1859be0

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -19,17 +19,13 @@ Welcome to OpenGVLab! We are a research group from Shanghai AI Lab focused on Vi
 - [InternImage](https://github.com/OpenGVLab/InternImage): a large-scale vision foundation models with deformable convolutions.
 - [InternVideo](https://github.com/OpenGVLab/InternVideo): large-scale video foundation models for multimodal understanding.
 - [VideoChat](https://github.com/OpenGVLab/Ask-Anything): an end-to-end chat assistant for video comprehension.
-- [All Seeing]():
-- [All Seeing V2]():
--
 # Datasets
-- [ShareGPT4o]():
 - [InternVid](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid): a large-scale video-text dataset for multimodal understanding and generation.
 # Benchmarks
 - [MVBench](https://github.com/OpenGVLab/Ask-Anything/tree/main/video_chat2): a comprehensive benchmark for multimodal video understanding.

 - [InternImage](https://github.com/OpenGVLab/InternImage): a large-scale vision foundation models with deformable convolutions.
 - [InternVideo](https://github.com/OpenGVLab/InternVideo): large-scale video foundation models for multimodal understanding.
 - [VideoChat](https://github.com/OpenGVLab/Ask-Anything): an end-to-end chat assistant for video comprehension.
+- [All-Seeing-V1](https://github.com/OpenGVLab/all-seeing): towards panoptic visual recognition and understanding of the open world.
+- [All Seeing V2](https://github.com/OpenGVLab/all-seeing): towards general relation comprehension of the open world.
 # Datasets
+- [ShareGPT4o](https://sharegpt4o.github.io/): a groundbreaking large-scale resource that we plan to open-source with 200K meticulously annotated images, 10K videos with highly descriptive captions, and 10K audio files with detailed descriptions.
 - [InternVid](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid): a large-scale video-text dataset for multimodal understanding and generation.
 # Benchmarks
 - [MVBench](https://github.com/OpenGVLab/Ask-Anything/tree/main/video_chat2): a comprehensive benchmark for multimodal video understanding.