Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,42 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
---
|
| 4 |
+
# Small Language Models Collection
|
| 5 |
+
|
| 6 |
+
Below is a list of small language models suitable for various tasks:
|
| 7 |
+
|
| 8 |
+
| Model Name | Task/Capability | Hugging Face Link |
|
| 9 |
+
|------------|-----------------|-------------------|
|
| 10 |
+
| BERT Base | General Text Classification | [https://huggingface.co/bert-base-uncased](https://huggingface.co/bert-base-uncased) |
|
| 11 |
+
| DistilBERT | Efficient Text Classification | [https://huggingface.co/distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) |
|
| 12 |
+
| RoBERTa Base | Advanced Text Classification | [https://huggingface.co/roberta-base](https://huggingface.co/roberta-base) |
|
| 13 |
+
| ALBERT Base | Efficient Large-Scale Classification | [https://huggingface.co/albert-base-v2](https://huggingface.co/albert-base-v2) |
|
| 14 |
+
| T5 Small | Text-to-Text Generation | [https://huggingface.co/t5-small](https://huggingface.co/t5-small) |
|
| 15 |
+
| T5 Base | General Text-to-Text Tasks | [https://huggingface.co/t5-base](https://huggingface.co/t5-base) |
|
| 16 |
+
| T5 Large | Advanced Text-to-Text Generation | [https://huggingface.co/t5-large](https://huggingface.co/t5-large) |
|
| 17 |
+
| Longformer Base | Long-Sequence Text Processing | [https://huggingface.co/longformer-base-4096](https://huggingface.co/longformer-base-4096) |
|
| 18 |
+
| BigBird Base | Long-Sequence Text Processing | [https://huggingface.co/google/bigbird-base-4096](https://huggingface.co/google/bigbird-base-4096) |
|
| 19 |
+
| Reformer Base | Efficient Long-Sequence Processing | [https://huggingface.co/google/reformer-enwik8](https://huggingface.co/google/reformer-enwik8) |
|
| 20 |
+
| BART Base | Text Summarization and Generation | [https://huggingface.co/facebook/bart-base](https://huggingface.co/facebook/bart-base) |
|
| 21 |
+
| ProphetNet Base | Future Event Prediction | [https://huggingface.co/microsoft/prophetnet-large-nli](https://huggingface.co/microsoft/prophetnet-large-nli) |
|
| 22 |
+
| PPLM Base | Controlled Text Generation | [https://huggingface.co/decapoda-research/llama-7b-hf](https://huggingface.co/decapoda-research/llama-7b-hf) (Note: PPLM is not directly available; this link is for a similar model) |
|
| 23 |
+
| DeBERTa Base | Advanced Sentiment Analysis | [https://huggingface.co/microsoft/deberta-base](https://huggingface.co/microsoft/deberta-base) |
|
| 24 |
+
| DeBERTa Large | High-Accuracy Sentiment Analysis | [https://huggingface.co/microsoft/deberta-large](https://huggingface.co/microsoft/deberta-large) |
|
| 25 |
+
| XLM-R Base | Multilingual Text Classification | [https://huggingface.co/xlm-r-100-base](https://huggingface.co/xlm-r-100-base) |
|
| 26 |
+
| XLM-R Large | Advanced Multilingual Tasks | [https://huggingface.co/xlm-r-100-large](https://huggingface.co/xlm-r-100-large) |
|
| 27 |
+
| MarianMT | Machine Translation | [https://huggingface.co/Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) |
|
| 28 |
+
| CamemBERT | French Language Tasks | [https://huggingface.co/camembert-base](https://huggingface.co/camembert-base) |
|
| 29 |
+
| FlauBERT | French Language Tasks | [https://huggingface.co/flaubert/flaubert-base-uncased](https://huggingface.co/flaubert/flaubert-base-uncased) |
|
| 30 |
+
| DistilCamemBERT | Efficient French Tasks | [https://huggingface.co/camembert/camembert-base](https://huggingface.co/camembert/camembert-base) (Note: DistilCamemBERT is not directly available; this link is for CamemBERT) |
|
| 31 |
+
| BART Large | Advanced Text Summarization | [https://huggingface.co/facebook/bart-large](https://huggingface.co/facebook/bart-large) |
|
| 32 |
+
| ProphetNet Large | Advanced Future Event Prediction | [https://huggingface.co/microsoft/prophetnet-large-nli](https://huggingface.co/microsoft/prophetnet-large-nli) |
|
| 33 |
+
| T5 3B | Large-Scale Text-to-Text Generation | [https://huggingface.co/t5-3b](https://huggingface.co/t5-3b) |
|
| 34 |
+
| T5 11B | High-Capacity Text-to-Text Generation | [https://huggingface.co/t5-11b](https://huggingface.co/t5-11b) |
|
| 35 |
+
| LLaMA 7B | Large-Scale General Tasks | [https://huggingface.co/decapoda-research/llama-7b-hf](https://huggingface.co/decapoda-research/llama-7b-hf) |
|
| 36 |
+
| LLaMA 13B | High-Capacity General Tasks | [https://huggingface.co/decapoda-research/llama-13b-hf](https://huggingface.co/decapoda-research/llama-13b-hf) |
|
| 37 |
+
| OPT 175B | Very Large-Scale General Tasks | [https://huggingface.co/facebook/opt-175b](https://huggingface.co/facebook/opt-175b) |
|
| 38 |
+
| OPT 2.7B | Large-Scale General Tasks | [https://huggingface.co/facebook/opt-2.7b](https://huggingface.co/facebook/opt-2.7b) |
|
| 39 |
+
| OPT 6.7B | High-Capacity General Tasks | [https://huggingface.co/facebook/opt-6.7b](https://huggingface.co/facebook/opt-6.7b) |
|
| 40 |
+
| OPT 13B | Advanced General Tasks | [https://huggingface.co/facebook/opt-13b](https://huggingface.co/facebook/opt-13b) |
|
| 41 |
+
|
| 42 |
+
|