nyu-mll/glue
Viewer • Updated • 1.49M • 470k • 497
How to use pszemraj/distilroberta-base-CoLA with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="pszemraj/distilroberta-base-CoLA") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("pszemraj/distilroberta-base-CoLA")
model = AutoModelForSequenceClassification.from_pretrained("pszemraj/distilroberta-base-CoLA")This model is a fine-tuned version of distilroberta-base on the GLUE COLA dataset. It achieves the following results on the evaluation set:
Mostly as a litmus test to see how it fares vs. the textattack one (should be similar) & associated metrics:
{
"epoch": 4.0,
"eval_loss": 0.49744734168052673,
"eval_matthews_correlation": 0.5678267214677118,
"eval_runtime": 1.9223,
"eval_samples": 1043,
"eval_samples_per_second": 542.586,
"eval_steps_per_second": 135.777
}
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Matthews Correlation |
|---|---|---|---|---|
| 0.4778 | 1.0 | 67 | 0.4630 | 0.5161 |
| 0.4356 | 2.0 | 134 | 0.4725 | 0.5287 |
| 0.2934 | 3.0 | 201 | 0.4974 | 0.5678 |
| 0.1998 | 4.0 | 268 | 0.5419 | 0.5584 |