view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 12 days ago • 68
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published 15 days ago • 39
Jupyter Agent Collection Blog: https://huggingface.co/blog/jupyter-agent-2 • 4 items • Updated Sep 12, 2025 • 3
MamayLM-v1.0-Gemma-3 Collection First Open and Multimodal Ukrainian-focused LLM • 5 items • Updated Oct 8, 2025 • 17
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 179
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 326
view article Article Announcing UA-Code-Bench: a New Benchmark for Evaluating LLMs on Competitive Programming Tasks in Ukrainian Jul 12, 2025 • 2
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 92
view article Article FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages Jul 8, 2025 • 35
The Jailbreak Tax (Jailbreak Utility) Collection Models and dataset used in paper "The Jailbreak Tax: How Useful Are Your Jailbreak Outputs" • 13 items • Updated Apr 5, 2025 • 2
MamayLM-Gemma-2 Collection Ukrainian-focused MamayLM model collection, based on Gemma2 • 2 items • Updated Oct 8, 2025 • 3
view article Article Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM Apr 23, 2025 • 62