💻 Qwopus-Coder Collection Reasoning-distilled coding models optimized for specialized domains like agentic workflows. • 7 items • Updated 14 days ago • 25
FastContext: Training Efficient Repository Explorer for Coding Agents Paper • 2606.14066 • Published 17 days ago • 93
Nemotron-Terminal Collection We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 17 days ago • 35
Nemotron Math & Reasoning Collection Datasets for building models that excel at math reasoning, proofs, and quantitative problem-solving. Covers SFT, RL, and pretraining data. • 23 items • Updated 17 days ago • 11
Nemotron Code & SWE Collection Datasets for building models that write, debug, and reason about code. Covers competitive programming, software engineering, and code pretraining. • 14 items • Updated 17 days ago • 6
Nemotron Agentic & Tool-Use Collection Datasets for building models capable of function calling, multi-step agentic tasks, terminal use, and SWE workflows. • 11 items • Updated 17 days ago • 11
Nemotron Supervised Fine-Tuning Collection SFT datasets covering math, code, chat, safety, agentic, VLM, multilingual, and specialized domains. • 44 items • Updated 17 days ago • 11
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 910
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 • 24 items • Updated Apr 16 • 66
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models Paper • 2410.02355 • Published Oct 3, 2024 • 1
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 23 items • Updated 17 days ago • 330
rikunarita's Space of Qwen3.5 (llama.cpp) Collection This Collections is a space for the Qwen 3.5 series. • 5 items • Updated May 3 • 3
Qwen 3.5 - 0.8, 2, 4, 9, 27, 35B - regular / uncensored Collection Min 256k context + images : Reg, Heretic, Heretic fine tunes of Qwen 3.5 in all sizes. • 41 items • Updated 18 days ago • 47
view article Article Introducing Storage Buckets on the Hugging Face Hub +10 Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner • Mar 10 • 195
Dark MOEs - Mixture of Experts - Uncensored Creative Models Collection Listed in terms of most power ; including both GGUF and Source. Link to quants on source pages too. You can dial up or down the number of experts. • 24 items • Updated Mar 24 • 24
Dark / Evil / NSFW Reasoning Models (gguf/source) Collection Models that are dark/evil/corrupt (and many times NSFW!) to begin with then I add reasoning/thinking to them to make them even... ahh... better. • 134 items • Updated 7 days ago • 182