nyu-mll/glue
Viewer • Updated • 1.49M • 466k • 496
How to use muhtasham/bert-tiny-target-cola with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="muhtasham/bert-tiny-target-cola") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("muhtasham/bert-tiny-target-cola")
model = AutoModelForSequenceClassification.from_pretrained("muhtasham/bert-tiny-target-cola")This model is a fine-tuned version of google/bert_uncased_L-2_H-128_A-2 on the glue dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Matthews Correlation |
|---|---|---|---|---|
| 0.6124 | 1.87 | 500 | 0.6192 | 0.0 |
| 0.6016 | 3.73 | 1000 | 0.6167 | 0.0 |
| 0.5838 | 5.6 | 1500 | 0.6166 | 0.0149 |
| 0.5555 | 7.46 | 2000 | 0.6344 | 0.0465 |
| 0.5272 | 9.33 | 2500 | 0.6542 | 0.1399 |
| 0.5058 | 11.19 | 3000 | 0.6626 | 0.1458 |
| 0.4791 | 13.06 | 3500 | 0.6868 | 0.1192 |
| 0.4577 | 14.93 | 4000 | 0.7215 | 0.1230 |
| 0.4425 | 16.79 | 4500 | 0.7322 | 0.1243 |