NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 23 items • Updated 14 days ago • 330
MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models Paper • 2602.12871 • Published Feb 13 • 18
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 183
KORMo pretraining datasets Collection The pretraining datasets for KORMo-10B were collected from diverse, publicly available source. • 14 items • Updated Oct 13, 2025 • 22
Tri Series Collection Introducing our new series of models: Tri-7B, Tri-21B, and Tri-70B-preview-SFT • 12 items • Updated Feb 20 • 11
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 780
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention sirluk • Oct 7, 2024 • 71
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published Apr 17, 2025 • 98
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 125
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published Jan 14, 2025 • 62
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code Paper • 2410.08196 • Published Oct 10, 2024 • 49
VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation Paper • 2412.10151 • Published Dec 13, 2024 • 7
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8, 2024 • 175
Understanding Reference Policies in Direct Preference Optimization Paper • 2407.13709 • Published Jul 18, 2024 • 17