Collections of multimodal (image+text) instruction finetuning datasets tailored for visual language models like LlaVA, Fuyu, or IDEFICS.
Victor Sanh PRO
VictorSanh
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 2 hours ago
notesbymuneeb/epstein-emails
liked
a model
9 days ago
moonshotai/Kimi-K2.5
liked
a dataset
15 days ago
mercor/apex-agents