Emo-Codec/CREMA-D_synth
Viewer • Updated • 112k • 85 • 1
How to use kushalballari/distilhubert-tone-classification with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("audio-classification", model="kushalballari/distilhubert-tone-classification") # Load model directly
from transformers import AutoProcessor, AutoModelForAudioClassification
processor = AutoProcessor.from_pretrained("kushalballari/distilhubert-tone-classification")
model = AutoModelForAudioClassification.from_pretrained("kushalballari/distilhubert-tone-classification")This model is a fine-tuned version of ntu-spml/distilhubert on the CREMA-D dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
|---|---|---|---|---|---|---|---|
| 1.339 | 1.0 | 442 | 1.3491 | 0.4987 | 0.5533 | 0.4987 | 0.4664 |
| 1.0008 | 2.0 | 884 | 1.0219 | 0.6408 | 0.6668 | 0.6408 | 0.6373 |
| 0.7673 | 3.0 | 1326 | 0.9572 | 0.6676 | 0.6870 | 0.6676 | 0.6557 |
| 0.5888 | 4.0 | 1768 | 0.8830 | 0.6890 | 0.6930 | 0.6890 | 0.6889 |
| 0.4396 | 5.0 | 2210 | 1.0893 | 0.6810 | 0.7064 | 0.6810 | 0.6738 |
| 0.2987 | 6.0 | 2652 | 1.0561 | 0.6810 | 0.6892 | 0.6810 | 0.6738 |
| 0.2009 | 7.0 | 3094 | 1.1421 | 0.6836 | 0.6944 | 0.6836 | 0.6769 |
| 0.1345 | 8.0 | 3536 | 1.1479 | 0.7024 | 0.7037 | 0.7024 | 0.6970 |
Base model
ntu-spml/distilhubert