nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated about 5 hours ago • 2.85k • 146
🤏 Smol-Data Collection Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated 11 days ago • 12
Claude 4.5 Opus Collection Distilled models and datasets for Claude 4.5 Opus. • 14 items • Updated 11 days ago • 29