1Kingleo commited on
Commit
755e3c4
·
verified ·
1 Parent(s): dae2483

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -3
README.md CHANGED
@@ -1,3 +1,42 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ # Small Language Models Collection
5
+
6
+ Below is a list of small language models suitable for various tasks:
7
+
8
+ | Model Name | Task/Capability | Hugging Face Link |
9
+ |------------|-----------------|-------------------|
10
+ | BERT Base | General Text Classification | [https://huggingface.co/bert-base-uncased](https://huggingface.co/bert-base-uncased) |
11
+ | DistilBERT | Efficient Text Classification | [https://huggingface.co/distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) |
12
+ | RoBERTa Base | Advanced Text Classification | [https://huggingface.co/roberta-base](https://huggingface.co/roberta-base) |
13
+ | ALBERT Base | Efficient Large-Scale Classification | [https://huggingface.co/albert-base-v2](https://huggingface.co/albert-base-v2) |
14
+ | T5 Small | Text-to-Text Generation | [https://huggingface.co/t5-small](https://huggingface.co/t5-small) |
15
+ | T5 Base | General Text-to-Text Tasks | [https://huggingface.co/t5-base](https://huggingface.co/t5-base) |
16
+ | T5 Large | Advanced Text-to-Text Generation | [https://huggingface.co/t5-large](https://huggingface.co/t5-large) |
17
+ | Longformer Base | Long-Sequence Text Processing | [https://huggingface.co/longformer-base-4096](https://huggingface.co/longformer-base-4096) |
18
+ | BigBird Base | Long-Sequence Text Processing | [https://huggingface.co/google/bigbird-base-4096](https://huggingface.co/google/bigbird-base-4096) |
19
+ | Reformer Base | Efficient Long-Sequence Processing | [https://huggingface.co/google/reformer-enwik8](https://huggingface.co/google/reformer-enwik8) |
20
+ | BART Base | Text Summarization and Generation | [https://huggingface.co/facebook/bart-base](https://huggingface.co/facebook/bart-base) |
21
+ | ProphetNet Base | Future Event Prediction | [https://huggingface.co/microsoft/prophetnet-large-nli](https://huggingface.co/microsoft/prophetnet-large-nli) |
22
+ | PPLM Base | Controlled Text Generation | [https://huggingface.co/decapoda-research/llama-7b-hf](https://huggingface.co/decapoda-research/llama-7b-hf) (Note: PPLM is not directly available; this link is for a similar model) |
23
+ | DeBERTa Base | Advanced Sentiment Analysis | [https://huggingface.co/microsoft/deberta-base](https://huggingface.co/microsoft/deberta-base) |
24
+ | DeBERTa Large | High-Accuracy Sentiment Analysis | [https://huggingface.co/microsoft/deberta-large](https://huggingface.co/microsoft/deberta-large) |
25
+ | XLM-R Base | Multilingual Text Classification | [https://huggingface.co/xlm-r-100-base](https://huggingface.co/xlm-r-100-base) |
26
+ | XLM-R Large | Advanced Multilingual Tasks | [https://huggingface.co/xlm-r-100-large](https://huggingface.co/xlm-r-100-large) |
27
+ | MarianMT | Machine Translation | [https://huggingface.co/Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) |
28
+ | CamemBERT | French Language Tasks | [https://huggingface.co/camembert-base](https://huggingface.co/camembert-base) |
29
+ | FlauBERT | French Language Tasks | [https://huggingface.co/flaubert/flaubert-base-uncased](https://huggingface.co/flaubert/flaubert-base-uncased) |
30
+ | DistilCamemBERT | Efficient French Tasks | [https://huggingface.co/camembert/camembert-base](https://huggingface.co/camembert/camembert-base) (Note: DistilCamemBERT is not directly available; this link is for CamemBERT) |
31
+ | BART Large | Advanced Text Summarization | [https://huggingface.co/facebook/bart-large](https://huggingface.co/facebook/bart-large) |
32
+ | ProphetNet Large | Advanced Future Event Prediction | [https://huggingface.co/microsoft/prophetnet-large-nli](https://huggingface.co/microsoft/prophetnet-large-nli) |
33
+ | T5 3B | Large-Scale Text-to-Text Generation | [https://huggingface.co/t5-3b](https://huggingface.co/t5-3b) |
34
+ | T5 11B | High-Capacity Text-to-Text Generation | [https://huggingface.co/t5-11b](https://huggingface.co/t5-11b) |
35
+ | LLaMA 7B | Large-Scale General Tasks | [https://huggingface.co/decapoda-research/llama-7b-hf](https://huggingface.co/decapoda-research/llama-7b-hf) |
36
+ | LLaMA 13B | High-Capacity General Tasks | [https://huggingface.co/decapoda-research/llama-13b-hf](https://huggingface.co/decapoda-research/llama-13b-hf) |
37
+ | OPT 175B | Very Large-Scale General Tasks | [https://huggingface.co/facebook/opt-175b](https://huggingface.co/facebook/opt-175b) |
38
+ | OPT 2.7B | Large-Scale General Tasks | [https://huggingface.co/facebook/opt-2.7b](https://huggingface.co/facebook/opt-2.7b) |
39
+ | OPT 6.7B | High-Capacity General Tasks | [https://huggingface.co/facebook/opt-6.7b](https://huggingface.co/facebook/opt-6.7b) |
40
+ | OPT 13B | Advanced General Tasks | [https://huggingface.co/facebook/opt-13b](https://huggingface.co/facebook/opt-13b) |
41
+
42
+