TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 44
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 21 days ago • 483
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 Feb 4 • 88
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published Jan 5 • 112
Arabic Speech Datasets Collection Best Datasets for Arabic Speech Tasks • 16 items • Updated Jan 1 • 16
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2512.20848 • Published Dec 23, 2025 • 38
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 305
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data Paper • 2402.15343 • Published Feb 23, 2024 • 16