Initial

Files changed (9) hide show

.gitattributes CHANGED Viewed

@@ -25,3 +25,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zstandard filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zstandard filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.arpa filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

+---
+license: mit
+language: en
+tags:
+- speech-to-text
+---
+# Exported [Nemo](https://github.com/NVIDIA/NeMo) models for Speech to Text with [OpenSLR 11](https://www.openslr.org/11/) librispeech 3-gram language model
+This model is intended to be used with [npc-engine](https://github.com/npc-engine/npc-engine).

config.yml ADDED Viewed

+model_type: "NemoSTT"
+# frame size in ms for incremental transcription
+frame_size: 1000
+# Parameters from https://github.com/NVIDIA/NeMo/blob/stable/tutorials/asr/Online_ASR_Microphone_Demo.ipynb
+frame_overlap: 2
+offset: 4
+# timestep_duration = model._cfg.preprocessor['window_stride']
+# for block in model._cfg.encoder['jasper']:
+#     timestep_duration *= block['stride'][0] ** block['repeat']
+timestep_duration: 0.02
+# Sample rate
+sample_rate: 16000
+# Minimum detectable VAD section in ms
+min_speech_duration: 400
+# Timeout in ms to flush results if speech wasn't finished semantically
+max_silence_duration: 1000
+# VAD frame size in ms
+vad_frame_ms: 20
+transcribe_realtime: False
+predict_punctuation: False
+alpha: 0.0253813572180912
+beta: 0.08

ctc.onnx ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c0d72eb87f56c10bc51952ee327d45f76ebfbd30e21f828cd3fab18ff3212f10
+size 75577820

lowercase_3-gram.pruned.1e-7.arpa ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:61dd499d412fb7493b093d846e055587985272373808b4b0316ee76dee5805bb
+size 40314194

punctuation.onnx ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d3c64263783470c92264dcd2d9279c7ba2e45818b5d9ff0bbed987def578ae0f
+size 265505964

sentence_prediction.onnx ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:fa2fb314443b18f1f0d82f9845f0cef49fd92d98a6a314c535e6717127f21500
+size 90894457

sentence_tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff