Collections

Discover the best community collections!

Collections trending this week
TinyLettuce
This Collection contains our small, Ettin-encoder (https://arxiv.org/abs/2507.11412) based models trained on synthetic and RagTruth data.
Apertus LLM
Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages
Nemotron-Pre-Training-Datasets
Large scale pre-training datasets used in the Nemotron family of models.
🇪🇪 Estonian LLM Evaluation
A collection of resources for evaluation of LLM capabilities in the Estonian language.
Apertus LLM
Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages
TinyLettuce
This Collection contains our small, Ettin-encoder (https://arxiv.org/abs/2507.11412) based models trained on synthetic and RagTruth data.
Nemotron-Pre-Training-Datasets
Large scale pre-training datasets used in the Nemotron family of models.
🇪🇪 Estonian LLM Evaluation
A collection of resources for evaluation of LLM capabilities in the Estonian language.