Instructions to use olm/olm-roberta-base-oct-2022 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use olm/olm-roberta-base-oct-2022 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="olm/olm-roberta-base-oct-2022")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("olm/olm-roberta-base-oct-2022") model = AutoModelForMaskedLM.from_pretrained("olm/olm-roberta-base-oct-2022") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -89,4 +89,4 @@ The model achieves the following results after tuning on GLUE tasks:
|
|
| 89 |
|wnli |acc |0.4000 |**0.5042** |
|
| 90 |
|
| 91 |
For both the original BERT and our model, we used the Hugging Face run_glue.py script [here](https://github.com/huggingface/transformers/tree/main/examples/pytorch/text-classification).
|
| 92 |
-
For both models, we used the default fine-tuning hyperparameters and we averaged the results over five training seeds.
|
|
|
|
| 89 |
|wnli |acc |0.4000 |**0.5042** |
|
| 90 |
|
| 91 |
For both the original BERT and our model, we used the Hugging Face run_glue.py script [here](https://github.com/huggingface/transformers/tree/main/examples/pytorch/text-classification).
|
| 92 |
+
For both models, we used the default fine-tuning hyperparameters and we averaged the results over five training seeds. These are the results for the GLUE dev sets, which can be a bit different than the results for the test sets.
|