MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 31 items • Updated 1 day ago • 64
Nemotron v3 Pre-Training Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 5 days ago • 8
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 8 items • Updated 5 days ago • 62
view article Article Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time Feb 18, 2025 • 35