Multimodal Implementations Collection Comprehensive Demo of Multimodal VLMs on the Hub β’ 24 items β’ Updated 7 days ago β’ 13
view article Article Weβre open-sourcing our text-to-image model and the process behind it Nov 12, 2025 β’ 94
view article Article Train AI models with Unsloth and Hugging Face Jobs for FREE +4 21 days ago β’ 83
BitDance Collection BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. β’ 10 items β’ Updated 10 days ago β’ 11
Tiny Aya Collection Bridging Scale and Multilingual Depth β’ 10 items β’ Updated 24 days ago β’ 64
Alterbute: Editing Intrinsic Attributes of Objects in Images Paper β’ 2601.10714 β’ Published Jan 15 β’ 31
YOLO26 Models Collection YOLO26 models: detection, segmentation, classification, pose, and OBB variants with demos and ONNX variants. β’ 42 items β’ Updated Jan 19 β’ 36
CoreML Collection Models for Apple devices. See https://github.com/FluidInference/FluidAudio for usage details β’ 12 items β’ Updated about 6 hours ago β’ 5
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models Dec 15, 2025 β’ 110
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face Dec 5, 2025 β’ 43
DictaLM 3.0 Collection Collection Dicta-LM 3.0 is a powerful open-weight collection of sovereign LLMs for Hebrew. β’ 24 items β’ Updated Dec 10, 2025 β’ 18
view article Article How to make NeuTTS-air generate over 200 seconds of audio in a single second. Nov 21, 2025 β’ 24