keithito/lj_speech
Updated • 1.48k • 62
How to use keyi10/wav2vec2-model-training with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="keyi10/wav2vec2-model-training") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("keyi10/wav2vec2-model-training")
model = AutoModelForCTC.from_pretrained("keyi10/wav2vec2-model-training")# Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("keyi10/wav2vec2-model-training")
model = AutoModelForCTC.from_pretrained("keyi10/wav2vec2-model-training")This model is a fine-tuned version of facebook/wav2vec2-base on the lj_speech dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 3.5099 | 1.52 | 500 | 1.1323 | 0.6740 |
| 0.3293 | 3.05 | 1000 | 0.1430 | 0.1851 |
| 0.1047 | 4.57 | 1500 | 0.1258 | 0.1446 |
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="keyi10/wav2vec2-model-training")