legacy-datasets/common_voice
Updated • 1.53k • 144
How to use willcai/wav2vec2_common_voice_accents with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="willcai/wav2vec2_common_voice_accents") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("willcai/wav2vec2_common_voice_accents")
model = AutoModelForCTC.from_pretrained("willcai/wav2vec2_common_voice_accents")This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the common_voice dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 4.0135 | 5.33 | 400 | 1.3259 | 0.8067 |
| 0.5608 | 10.67 | 800 | 0.7832 | 0.5024 |
| 0.1441 | 16.0 | 1200 | 0.9309 | 0.4698 |
| 0.0724 | 21.33 | 1600 | 0.9750 | 0.4461 |
| 0.0444 | 26.67 | 2000 | 0.9095 | 0.4269 |