Instructions to use tbs17/MathBERT-custom with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use tbs17/MathBERT-custom with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="tbs17/MathBERT-custom")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("tbs17/MathBERT-custom") model = AutoModelForMaskedLM.from_pretrained("tbs17/MathBERT-custom") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -120,7 +120,7 @@ The BERT model was pretrained on pre-k to HS math curriculum (engageNY, Utah Mat
|
|
| 120 |
|
| 121 |
#### Training procedure
|
| 122 |
|
| 123 |
-
The texts are lowercased and tokenized using WordPiece and a vocabulary size of 30,522. The inputs of the model are then of the form:
|
| 124 |
|
| 125 |
```
|
| 126 |
[CLS] Sentence A [SEP] Sentence B [SEP]
|
|
|
|
| 120 |
|
| 121 |
#### Training procedure
|
| 122 |
|
| 123 |
+
The texts are lowercased and tokenized using WordPiece and a customized vocabulary size of 30,522. We use the ```bert_tokenizer``` from huggingface tokenizers library to generate a custom vocab file from our training raw math texts. The inputs of the model are then of the form:
|
| 124 |
|
| 125 |
```
|
| 126 |
[CLS] Sentence A [SEP] Sentence B [SEP]
|