AI & ML interests

Video Captioning, Multimodal Large Language Models (MLLMs), Video Understanding, Vision-Language Models, Reinforcement Learning, Dataset Construction

Recent Activity

qx112  updated a model about 2 months ago
OwlCap/OwlCap-7B
qx112  updated a dataset about 2 months ago
OwlCap/HMD-270K
Chunlin13  published a dataset about 2 months ago
OwlCap/HMD-270K
View all activity