KEA Omnilingual-ASR β Kabuverdianu Fine-Tune
Fine-tuned version of omniASR_CTC_300M
for Kabuverdianu (kea) β Cape Verdean Creole, a low-resource language spoken by ~1 million people.
Model Details
Training Data
| Split |
Samples |
| Train |
2715 |
| Validation |
366 |
| Test |
864 |
Training Hyperparameters
| Parameter |
Value |
| Training steps |
N/A |
| Learning rate |
N/A |
| Mixed precision |
N/A |
| Gradient accumulation |
N/A batches |
| Encoder frozen for |
N/A steps |
| Min audio length |
N/A samples (2 s) |
| Max audio length |
N/A samples (60 s) |
| Max elements per batch |
N/A |
Evaluation Results
| Metric |
Value |
| WER (validation) |
N/A |
| Train CTC Loss |
N/A |
| Validation CTC Loss |
N/A |
Checkpoint Structure
checkpoint/sdp_00.pt β model weights (fairseq2 sharded format)
model.yaml β architecture definition
config.yaml β full training configuration
metrics/train.jsonl β per-step training metrics
metrics/valid.jsonl β per-step validation metrics (WER)