view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth mlabonne โข Jul 29, 2024 โข 371
GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents Paper โข 2603.24329 โข Published Mar 25 โข 28
Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents Paper โข 2510.23691 โข Published Oct 27, 2025 โข 56
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers โข 49 items โข Updated 3 days ago โข 153
microsoft/Phi-3-mini-4k-instruct-gguf Text Generation โข 4B โข Updated Dec 10, 2025 โข 52.9k โข 581
view article Article Introducing NVIDIA Cosmos Policy for Advanced Robot Control nvidia โข Jan 29 โข 48
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models Paper โข 2602.07026 โข Published Feb 2 โข 140
view article Article State of open video generation models in Diffusers +1 sayakpaul, a-r-r-o-w, dn6 โข Jan 27, 2025 โข 70
view post Post 3209 releasing: smol vision ๐ผ A repository with notebooks on shrinking, optimizing, speeding-up, customizing large vision models! https://github.com/merveenoyan/smol-vision 1 reply ยท ๐ฅ 18 18 โค๏ธ 4 4 ๐ 3 3 ๐ค 2 2 ๐ 1 1 ๐ค 1 1 ๐ง 1 1 ๐คฏ 1 1 โ 1 1 ๐ 1 1 ๐ 1 1 + Reply
TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis Paper โข 2307.15042 โข Published Jul 27, 2023 โข 7 โข 1
view article Article Arc Virtual Cell Challenge: A Primer FL33TW00D-HF, abhinadduri โข Jul 18, 2025 โข 66
view article Article You could have designed state of the art positional encoding FL33TW00D-HF โข Nov 25, 2024 โข 478
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix codelion โข Nov 3, 2025 โข 65