Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

Nous' Flagship LLM Series

NousResearch/Hermes-2-Theta-Llama-3-70B

Text Generation • 71B • Updated Sep 8, 2024 • 1.77k • • 82
NousResearch/Hermes-2-Pro-Llama-3-70B

Text Generation • 71B • Updated Sep 8, 2024 • 180 • • 35
NousResearch/Hermes-2-Theta-Llama-3-8B

Text Generation • 8B • Updated Sep 8, 2024 • 11.3k • • 207
NousResearch/Hermes-2-Pro-Llama-3-8B

Text Generation • 8B • Updated Sep 14, 2024 • 20.5k • • 453

Partial layer training LLMs

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 24
GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Paper • 2505.20355 • Published May 26, 2025 • 37
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28, 2025 • 47
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Paper • 2509.15591 • Published Sep 19, 2025 • 46

DistilBERT release

Original DistilBERT model, checkpoints obtained from using teacher-student learning from the original BERT checkpoints.

distilbert/distilbert-base-cased

Fill-Mask • 65.8M • Updated May 6, 2024 • 315k • • 67
distilbert/distilbert-base-uncased

Fill-Mask • 67M • Updated May 6, 2024 • 8.86M • • 903
distilbert/distilbert-base-multilingual-cased

Fill-Mask • 0.1B • Updated May 6, 2024 • 600k • • 244
distilbert/distilbert-base-uncased-finetuned-sst-2-english

Text Classification • 67M • Updated Dec 19, 2023 • 3.64M • • 912

SDXL fine-tunes

diffusers/stable-diffusion-xl-1.0-inpainting-0.1

Text-to-Image • Updated Sep 3, 2023 • 72.8k • 376

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 85
SlimPajama-DC: Understanding Data Combinations for LLM Training

Paper • 2309.10818 • Published Sep 19, 2023 • 11
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 24
Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 40

Tiny datasets that empower the foundation of Small Language Model!

nampdn-ai/tiny-textbooks

Viewer • Updated Jul 3, 2024 • 420k • 507 • 179
nampdn-ai/tiny-codes

Viewer • Updated Sep 30, 2023 • 1.63M • 1.2k • 291
nampdn-ai/tiny-strange-textbooks

Viewer • Updated Feb 2, 2024 • 1M • 107 • 92
nampdn-ai/tiny-math-textbooks

Viewer • Updated Jan 27, 2024 • 635k • 105 • 33

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard)

Running on CPU Upgrade

14k

Open LLM Leaderboard

🏆

14k

Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade

7.51k

MTEB Leaderboard

📊

7.51k

Embedding Leaderboard
Running

4.93k

Arena Leaderboard

🏆

4.93k

View the LMArena leaderboard in full‑screen
Running

Agents

Featured

588

LLM-Perf Leaderboard

🏆

588

Compare LLM hardware performance and find the best model

models_collection

about 1 hour ago

facebook/rag-token-nq

Updated Nov 13, 2023 • 3.75k • 179

Nous' Flagship LLM Series

NousResearch/Hermes-2-Theta-Llama-3-70B

Text Generation • 71B • Updated Sep 8, 2024 • 1.77k • • 82
NousResearch/Hermes-2-Pro-Llama-3-70B

Text Generation • 71B • Updated Sep 8, 2024 • 180 • • 35
NousResearch/Hermes-2-Theta-Llama-3-8B

Text Generation • 8B • Updated Sep 8, 2024 • 11.3k • • 207
NousResearch/Hermes-2-Pro-Llama-3-8B

Text Generation • 8B • Updated Sep 14, 2024 • 20.5k • • 453

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 85
SlimPajama-DC: Understanding Data Combinations for LLM Training

Paper • 2309.10818 • Published Sep 19, 2023 • 11
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 24
Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 40

Partial layer training LLMs

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 24
GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Paper • 2505.20355 • Published May 26, 2025 • 37
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28, 2025 • 47
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Paper • 2509.15591 • Published Sep 19, 2025 • 46

Tiny datasets that empower the foundation of Small Language Model!

nampdn-ai/tiny-textbooks

Viewer • Updated Jul 3, 2024 • 420k • 507 • 179
nampdn-ai/tiny-codes

Viewer • Updated Sep 30, 2023 • 1.63M • 1.2k • 291
nampdn-ai/tiny-strange-textbooks

Viewer • Updated Feb 2, 2024 • 1M • 107 • 92
nampdn-ai/tiny-math-textbooks

Viewer • Updated Jan 27, 2024 • 635k • 105 • 33

DistilBERT release

Original DistilBERT model, checkpoints obtained from using teacher-student learning from the original BERT checkpoints.

distilbert/distilbert-base-cased

Fill-Mask • 65.8M • Updated May 6, 2024 • 315k • • 67
distilbert/distilbert-base-uncased

Fill-Mask • 67M • Updated May 6, 2024 • 8.86M • • 903
distilbert/distilbert-base-multilingual-cased

Fill-Mask • 0.1B • Updated May 6, 2024 • 600k • • 244
distilbert/distilbert-base-uncased-finetuned-sst-2-english

Text Classification • 67M • Updated Dec 19, 2023 • 3.64M • • 912

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard)

Running on CPU Upgrade

14k

Open LLM Leaderboard

🏆

14k

Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade

7.51k

MTEB Leaderboard

📊

7.51k

Embedding Leaderboard
Running

4.93k

Arena Leaderboard

🏆

4.93k

View the LMArena leaderboard in full‑screen
Running

Agents

Featured

588

LLM-Perf Leaderboard

🏆

588

Compare LLM hardware performance and find the best model

SDXL fine-tunes

diffusers/stable-diffusion-xl-1.0-inpainting-0.1

Text-to-Image • Updated Sep 3, 2023 • 72.8k • 376

models_collection

about 1 hour ago

facebook/rag-token-nq

Updated Nov 13, 2023 • 3.75k • 179

Previous
1
...
54
55
56
57
58
...
21,397
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs