nyu-mll/glue
Viewer • Updated • 1.49M • 492k • 499
How to use gokuls/mobilebert_sa_GLUE_Experiment_logit_kd_data_aug_sst2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="gokuls/mobilebert_sa_GLUE_Experiment_logit_kd_data_aug_sst2") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("gokuls/mobilebert_sa_GLUE_Experiment_logit_kd_data_aug_sst2")
model = AutoModelForSequenceClassification.from_pretrained("gokuls/mobilebert_sa_GLUE_Experiment_logit_kd_data_aug_sst2")This model is a fine-tuned version of google/mobilebert-uncased on the GLUE SST2 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 0.4923 | 1.0 | 8748 | 0.5804 | 0.8314 |
| 0.3226 | 2.0 | 17496 | 0.5184 | 0.8475 |
| 0.2725 | 3.0 | 26244 | 0.5341 | 0.8509 |
| 0.2453 | 4.0 | 34992 | 0.4892 | 0.8521 |
| 0.2278 | 5.0 | 43740 | 0.4834 | 0.8601 |
| 0.2149 | 6.0 | 52488 | 0.4980 | 0.8624 |
| 0.2047 | 7.0 | 61236 | 0.5031 | 0.8532 |
| 0.1963 | 8.0 | 69984 | 0.5011 | 0.8509 |
| 0.1893 | 9.0 | 78732 | 0.4899 | 0.8567 |
| 0.1835 | 10.0 | 87480 | 0.4965 | 0.8589 |