CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models Paper • 2602.17684 • Published Feb 4 • 22
Efficient RLVR Training via Weighted Mutual Information Data Selection Paper • 2603.01907 • Published 10 days ago • 14
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning Paper • 2504.13914 • Published Apr 10, 2025 • 5
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published 15 days ago • 43
view article Article Exploring New Frontiers of LLMs: Adaptive Dual-Search Distillation (ADS) and the 30B Model Open Beta 11 days ago • 2
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 20 days ago • 481
Kai Models Series Collection Kai Models Distilled via Adaptive Dual Search Distillation • 3 items • Updated 10 days ago • 2
Nacrith: Neural Lossless Compression via Ensemble Context Modeling and High-Precision CDF Coding Paper • 2602.19626 • Published 17 days ago • 3
view article Article Shattering the Memory Wall: O(1) Inference and Causal Monoid State Compression in Spartacus-1B 15 days ago • 2
Spartacus Monoid Reasoning Models Collection O(1) Reasoning Models • 1 item • Updated 15 days ago • 2
Geilim Smol Language Models Collection Geilim Smol Language Models • 2 items • Updated 9 days ago • 1
Weight-sparse transformers have interpretable circuits Paper • 2511.13653 • Published Nov 17, 2025 • 2
Reasoning at the Edge (HF Preprints) Collection This collection traces the mathematical and empirical limits of machine reasoning. • 12 items • Updated 12 days ago • 1
view article Article Project SmolMoE-8x135M: From Zero to a Custom Mixture-of-Experts Model Aug 7, 2025 • 4