Update README.md
Browse files
README.md
CHANGED
|
@@ -9,9 +9,12 @@ datasets:
|
|
| 9 |
---
|
| 10 |
|
| 11 |
## Model description
|
|
|
|
|
|
|
| 12 |
## Intended uses & limitations
|
| 13 |
## How to use
|
| 14 |
## Limitations and bias
|
| 15 |
## Training data
|
|
|
|
| 16 |
## Training procedure
|
| 17 |
## Evaluation results
|
|
|
|
| 9 |
---
|
| 10 |
|
| 11 |
## Model description
|
| 12 |
+
CamemBERT is a state-of-the-art language model for French based on the RoBERTa model.
|
| 13 |
+
It is now available on Hugging Face in 6 different versions with varying number of parameters, amount of pretraining data and pretraining data source domains.
|
| 14 |
## Intended uses & limitations
|
| 15 |
## How to use
|
| 16 |
## Limitations and bias
|
| 17 |
## Training data
|
| 18 |
+
OSCAR or Open Super-large Crawled Aggregated coRpus is a multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the Ungoliant architecture.
|
| 19 |
## Training procedure
|
| 20 |
## Evaluation results
|