LanceaKing/asvspoof2019
Updated • 280 • 4
How to use MattyB95/VIT-ASVspoof2019-ConstantQ-Synthetic-Voice-Detection with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("image-classification", model="MattyB95/VIT-ASVspoof2019-ConstantQ-Synthetic-Voice-Detection")
pipe("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png") # Load model directly
from transformers import AutoImageProcessor, AutoModelForImageClassification
processor = AutoImageProcessor.from_pretrained("MattyB95/VIT-ASVspoof2019-ConstantQ-Synthetic-Voice-Detection")
model = AutoModelForImageClassification.from_pretrained("MattyB95/VIT-ASVspoof2019-ConstantQ-Synthetic-Voice-Detection")This model is a fine-tuned version of google/vit-base-patch16-224-in21k on the imagefolder dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
|---|---|---|---|---|---|---|---|
| 0.0383 | 1.0 | 3173 | 0.1192 | 0.9753 | 0.9864 | 0.9734 | 0.9997 |
| 0.0158 | 2.0 | 6346 | 0.0505 | 0.9888 | 0.9938 | 0.9911 | 0.9965 |
| 0.0021 | 3.0 | 9519 | 0.1042 | 0.9849 | 0.9917 | 0.9836 | 0.9998 |
Base model
google/vit-base-patch16-224-in21k