models
Collection
3 items • Updated • 1
WeNet U2++ Conformer model trained on google/fleurs-mn
for Mongolian (Cyrillic) automatic speech recognition.
google/fleurs-mnruns/).| File | Description |
|---|---|
avg_10.pt |
Best model (averaged top-10 checkpoints by default) |
train.yaml |
Training config |
lang_char.txt |
Character vocabulary (38 tokens) |
global_cmvn |
Feature normalization stats |
train.log |
Full training log |
runs/ |
TensorBoard events |
python wenet/bin/recognize.py
--config train.yaml
--checkpoint avg_10.pt
--dict lang_char.txt
--test_data your_data.list
--mode attention_rescoring
--beam_size 10
--result_file result.txt
## Limitations
- Trained on ~11.5 h of FLEURS Mongolian — small-scale; WER/CER will be relatively high on out-of-domain speech.
- Only Cyrillic script supported; Latin characters and digits are stripped.
- No language model rescoring applied.