legacy-datasets/common_voice
Updated • 1.53k • 144
How to use rossevine/Model_G_Wav2Vec2_Version3 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="rossevine/Model_G_Wav2Vec2_Version3") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("rossevine/Model_G_Wav2Vec2_Version3")
model = AutoModelForCTC.from_pretrained("rossevine/Model_G_Wav2Vec2_Version3")# Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("rossevine/Model_G_Wav2Vec2_Version3")
model = AutoModelForCTC.from_pretrained("rossevine/Model_G_Wav2Vec2_Version3")This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the common_voice dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 3.7935 | 5.97 | 400 | 0.6443 | 0.6220 | 0.1605 |
| 0.2843 | 11.94 | 800 | 0.5294 | 0.4286 | 0.1090 |
| 0.1364 | 17.91 | 1200 | 0.4766 | 0.3774 | 0.0969 |
| 0.0914 | 23.88 | 1600 | 0.4960 | 0.3408 | 0.0880 |
| 0.0662 | 29.85 | 2000 | 0.4863 | 0.3321 | 0.0851 |
Base model
facebook/wav2vec2-large-xlsr-53
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="rossevine/Model_G_Wav2Vec2_Version3")