Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 6 days ago • 58
view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts 14 days ago • 23
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 13 days ago • 68
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 2 days ago • 235
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 18 days ago • 43
view article Article Follow the White Rabbit: Using Embeddings So You Never Get Lost in Translation 27 days ago • 8
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 about 1 month ago • 488
Real-time Vision Models Collection A collection of real-time detectors. • 20 items • Updated Feb 18 • 23