Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated 14 days ago • 174
InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams Paper • 2601.02281 • Published 21 days ago • 33
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 20 days ago • 134
XVLA Collection X-VLA is a soft-prompted Transformer for cross-embodiment robot learning • 6 items • Updated Dec 4, 2025 • 11
Treble10 Collection Treble Technologies and Hugging Face have entered in to a long term collaboration. In celebration, we are releasing the Treble10 dataset. • 3 items • Updated Oct 28, 2025 • 4
Persian Models Collection This is the largest collection of Persian models available on Huggingface • 776 items • Updated about 1 month ago • 16
Persian Datasets Collection This the largest collection of Persian datasets available on Huggingface • 130 items • Updated about 1 month ago • 15
NaturalVoices - Voice Conversion Datasets Collection This is a collaborative work of JHU Smile Lab and CMU MSP Lab. Please cite https://arxiv.org/abs/2511.00256 • 5 items • Updated Nov 10, 2025 • 4
Evolving Diagnostic Agents in a Virtual Clinical Environment Paper • 2510.24654 • Published Oct 28, 2025 • 12
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model Paper • 2510.24992 • Published Oct 28, 2025 • 4
OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes Paper • 2510.26800 • Published Oct 30, 2025 • 22
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30, 2025 • 121
Emu3.5: Native Multimodal Models are World Learners Paper • 2510.26583 • Published Oct 30, 2025 • 109
Emu3.5 Collection Native Multimodal Models are World Learners 🌍 • 4 items • Updated Dec 25, 2025 • 73
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30, 2025 • 117
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published Oct 30, 2025 • 85