Small Language Models Collection

Below is a list of small language models suitable for various tasks:

Model Name Task/Capability Hugging Face Link
BERT Base General Text Classification https://huggingface.co/bert-base-uncased
DistilBERT Efficient Text Classification https://huggingface.co/distilbert-base-uncased
RoBERTa Base Advanced Text Classification https://huggingface.co/roberta-base
ALBERT Base Efficient Large-Scale Classification https://huggingface.co/albert-base-v2
T5 Small Text-to-Text Generation https://huggingface.co/t5-small
T5 Base General Text-to-Text Tasks https://huggingface.co/t5-base
T5 Large Advanced Text-to-Text Generation https://huggingface.co/t5-large
Longformer Base Long-Sequence Text Processing https://huggingface.co/longformer-base-4096
BigBird Base Long-Sequence Text Processing https://huggingface.co/google/bigbird-base-4096
Reformer Base Efficient Long-Sequence Processing https://huggingface.co/google/reformer-enwik8
BART Base Text Summarization and Generation https://huggingface.co/facebook/bart-base
ProphetNet Base Future Event Prediction https://huggingface.co/microsoft/prophetnet-large-nli
PPLM Base Controlled Text Generation https://huggingface.co/decapoda-research/llama-7b-hf (Note: PPLM is not directly available; this link is for a similar model)
DeBERTa Base Advanced Sentiment Analysis https://huggingface.co/microsoft/deberta-base
DeBERTa Large High-Accuracy Sentiment Analysis https://huggingface.co/microsoft/deberta-large
XLM-R Base Multilingual Text Classification https://huggingface.co/xlm-r-100-base
XLM-R Large Advanced Multilingual Tasks https://huggingface.co/xlm-r-100-large
MarianMT Machine Translation https://huggingface.co/Helsinki-NLP/opus-mt-en-fr
CamemBERT French Language Tasks https://huggingface.co/camembert-base
FlauBERT French Language Tasks https://huggingface.co/flaubert/flaubert-base-uncased
DistilCamemBERT Efficient French Tasks https://huggingface.co/camembert/camembert-base (Note: DistilCamemBERT is not directly available; this link is for CamemBERT)
BART Large Advanced Text Summarization https://huggingface.co/facebook/bart-large
ProphetNet Large Advanced Future Event Prediction https://huggingface.co/microsoft/prophetnet-large-nli
T5 3B Large-Scale Text-to-Text Generation https://huggingface.co/t5-3b
T5 11B High-Capacity Text-to-Text Generation https://huggingface.co/t5-11b
LLaMA 7B Large-Scale General Tasks https://huggingface.co/decapoda-research/llama-7b-hf
LLaMA 13B High-Capacity General Tasks https://huggingface.co/decapoda-research/llama-13b-hf
OPT 175B Very Large-Scale General Tasks https://huggingface.co/facebook/opt-175b
OPT 2.7B Large-Scale General Tasks https://huggingface.co/facebook/opt-2.7b
OPT 6.7B High-Capacity General Tasks https://huggingface.co/facebook/opt-6.7b
OPT 13B Advanced General Tasks https://huggingface.co/facebook/opt-13b
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support