OptiMind: Teaching LLMs to Think Like Optimization Experts Paper • 2509.22979 • Published Sep 26, 2025 • 3
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 5 days ago • 63
Clara-Molecular Collection NVIDIA Clara Models for Molecular Science • 10 items • Updated 6 days ago • 7
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 18 days ago • 208
Dr. Zero: Self-Evolving Search Agents without Training Data Paper • 2601.07055 • Published 15 days ago • 20
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published 18 days ago • 29
Jamba Reasoning 3B Collection AI21's top-performing reasoning model that packs leading scores on intelligence benchmarks and highly-efficient processing into a compact 3B build • 2 items • Updated Oct 8, 2025 • 6
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 24 items • Updated 3 days ago • 92
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published Dec 23, 2025 • 83
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 68 items • Updated 4 days ago • 318
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 21 days ago • 37
MiroThinker-v1.5 Collection MiroMind’s Open Source Research Agent for Prediction • 4 items • Updated 10 days ago • 24
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 178
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published Dec 17, 2025 • 43