Efficient Context Scaling with LongCat ZigZag Attention Paper • 2512.23966 • Published about 1 month ago • 7
view article Article Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads 23 days ago • 6
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 24 days ago • 37
📝 Research & Long-Form Blog Posts Collection In-depth technical articles and research pieces published by Hugging Face • 9 items • Updated 22 days ago • 18
HyperCLOVA X SEED Collection HyperCLOVA X SEED is NAVER's lightweight open-source lineup with a strong focus on Korean language performance • 6 items • Updated Dec 24, 2025 • 41
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 7 items • Updated 9 days ago • 128
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 9 days ago • 95
view changelog Changelog Team & Enterprise Articles Now Featured on the Hugging Face Blog Dec 8, 2025 • 91
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 102
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 294
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 287
INTELLECT-3 Collection INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated Nov 28, 2025 • 11