Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 36 items • Updated 1 day ago • 148
Zero-To-CAD Collection Datasets (1M & 100K) and model for synthesizing executable CAD programs from an LLM in a CadQuery environment. No real data used. • 3 items • Updated 12 days ago • 13
Indic Alpaca Datasets Collection This collection comprises an alpaca datasets that encompasses a wide range of Indian languages. • 18 items • Updated Mar 21, 2024 • 10
story writing favourites Collection Models I personally liked for generating stories in the past. Not a recommendation, most of these are outdated. • 17 items • Updated Mar 2 • 101
MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 30 items • Updated 6 days ago • 80
GoClick: Lightweight Element Grounding Model for Autonomous GUI Interaction Paper • 2604.23941 • Published 10 days ago • 5
VideoThinker: Building Agentic VideoLLMs with LLM-Guided Tool Reasoning Paper • 2601.15724 • Published Jan 22 • 1
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding Paper • 2312.02051 • Published Dec 4, 2023 • 2
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis Paper • 2505.13227 • Published May 19, 2025 • 46
Auto-Aggressive (Uncensored Qwen 3.6 Native Multimodal) Collection Collection of Uncensored Variants of Qwen3.6 Native Multimodal Models • 4 items • Updated 9 days ago • 3
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper • 2604.19254 • Published 16 days ago • 28
CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation Paper • 2604.19636 • Published 16 days ago • 87
AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 23 days ago • 162
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper • 2604.18486 • Published 17 days ago • 90
CogVLM2 Collection This collection hosts the repos of the THUDM's CogVLM2 releases • 8 items • Updated Jun 30, 2025 • 22
gliner2 family Collection GLiNER2 extends the original GLiNER architecture to support multi-task information extraction with a schema-driven interface. This base model provid • 4 items • Updated Feb 10 • 44