google/speech_commands
Updated • 3.72k • 59
How to use Zarakun/wav2vec with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("audio-classification", model="Zarakun/wav2vec") # Load model directly
from transformers import AutoProcessor, AutoModelForAudioClassification
processor = AutoProcessor.from_pretrained("Zarakun/wav2vec")
model = AutoModelForAudioClassification.from_pretrained("Zarakun/wav2vec")# Load model directly
from transformers import AutoProcessor, AutoModelForAudioClassification
processor = AutoProcessor.from_pretrained("Zarakun/wav2vec")
model = AutoModelForAudioClassification.from_pretrained("Zarakun/wav2vec")This model is a fine-tuned version of facebook/wav2vec2-base on the speech_commands dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
|---|---|---|---|---|---|
| 0.6895 | 1.0 | 639 | 0.7875 | 0.8773 | 0.7995 |
| 0.4171 | 2.0 | 1278 | 0.5445 | 0.8932 | 0.8675 |
| 0.2706 | 3.0 | 1917 | 0.4992 | 0.8939 | 0.8872 |
Base model
facebook/wav2vec2-base
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("audio-classification", model="Zarakun/wav2vec")