JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation Paper • 2510.00974 • Published Oct 1, 2025 • 1
PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations Paper • 2505.24717 • Published May 30, 2025 • 1
MolmoAct2 Models Collection Collection of the base models for MolmoAct2 • 6 items • Updated 3 days ago • 13
MolmoAct2 Datasets Collection Collection of robotics datasets for MolmoAct2 • 8 items • Updated 3 days ago • 8
NVIDIA Ising Collection NVIDIA Ising is a new Model Family to enable building useful Quantum Computers with AI. • 4 items • Updated 18 days ago • 21
SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated 28 days ago • 17
WildDet3D Collection This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated 25 days ago • 17
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling Paper • 2504.14219 • Published Apr 19, 2025 • 2
Nemotron-Terminal Collection We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 18 days ago • 34
VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference Paper • 2512.01031 • Published Nov 30, 2025 • 26
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published Feb 12 • 62
WorldCompass: Reinforcement Learning for Long-Horizon World Models Paper • 2602.09022 • Published Feb 9 • 21
Self-Hinting Language Models Enhance Reinforcement Learning Paper • 2602.03143 • Published Feb 3 • 31
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation Paper • 2512.23705 • Published Dec 29, 2025 • 45