Collections

Discover the best community collections!

Collections trending this week
Papers
Machine Learning and Neural Network papers 📜
GIT
GIT (Generative Image-to-text Transformer) is a model useful for vision-language tasks such as image/video captioning and question answering.
ARCH models, benchmark and paper
This collection contains pre-trained models on the AudioSet dataset, offering a diverse set of features for audio representation learning.
EdsEyes
A series of multimodal large language models developed for robotics
GIT
GIT (Generative Image-to-text Transformer) is a model useful for vision-language tasks such as image/video captioning and question answering.
ARCH models, benchmark and paper
This collection contains pre-trained models on the AudioSet dataset, offering a diverse set of features for audio representation learning.
EdsEyes
A series of multimodal large language models developed for robotics
Papers
Machine Learning and Neural Network papers 📜