Instructions to use Master-AI-Lab/EnergyBERT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Master-AI-Lab/EnergyBERT with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="Master-AI-Lab/EnergyBERT")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("Master-AI-Lab/EnergyBERT") model = AutoModelForMaskedLM.from_pretrained("Master-AI-Lab/EnergyBERT") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -50,3 +50,21 @@ from transformers import pipeline
|
|
| 50 |
unmasker = pipeline('fill-mask', model='EnergyBERT')
|
| 51 |
unmasker("Hello I'm a <mask> model.")
|
| 52 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 50 |
unmasker = pipeline('fill-mask', model='EnergyBERT')
|
| 51 |
unmasker("Hello I'm a <mask> model.")
|
| 52 |
```
|
| 53 |
+
|
| 54 |
+
Training Details
|
| 55 |
+
|
| 56 |
+
## Training Data
|
| 57 |
+
|
| 58 |
+
<!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
| 59 |
+
|
| 60 |
+
1.2M Published full-text literature corpus from 2000 to 2021.
|
| 61 |
+
|
| 62 |
+
## Training Procedure
|
| 63 |
+
|
| 64 |
+
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 65 |
+
|
| 66 |
+
BERT is trained on two unsupervised tasks during its pre-training period: masked language modeling and next sentence prediction. A masked language model involves masking some of the input tokens at random and training the model to predict the masked tokens based on the context surrounding the input tokens. Next sentence prediction involves training the model to predict whether two sentences follow each other logically.
|
| 67 |
+
|
| 68 |
+
### Training Hyperparameters
|
| 69 |
+
|
| 70 |
+
- **Training regime:**
|