3DV 2026 Collection Collection of all the 3DV models, datasets and demos • 27 items • Updated 11 days ago • 4
PALM: A Dataset and Baseline for Learning Multi-subject Hand Prior Paper • 2511.05403 • Published Nov 7, 2025 • 1
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 29 items • Updated 5 days ago • 138
OpenX-LeRobot Collection Open X-Embodiment datasets in LeRobot format with standard transfomation (https://github.com/Tavish9/any4lerobot) • 32 items • Updated Mar 2 • 33
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper • 2509.16506 • Published Sep 20, 2025 • 22
Granite Docling Models Collection Models for parsing complex PDFs and structured documents, designed to complement Docling. • 4 items • Updated 4 days ago • 60
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15, 2025 • 54
view article Article Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub Jun 27, 2025 • 31
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 202
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! +1 Jun 6, 2025 • 56
Holo1 Collection Vision-Language Action Model for use in Surfer-H web navigation agent • 6 items • Updated Jun 10, 2025 • 49