File size: 2,384 Bytes
205b97e ea40c2e 205b97e ea40c2e | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 | ---
license: other
language:
- en
tags:
- automatic-speech-recognition
- speech-recognition
- state-space-models
- mamba
- ctc
- librispeech
- pytorch
- masters-thesis
pretty_name: SSM-AST thesis checkpoints
---
# SSM-AST: State Space Models for Automatic Speech Transcription
This repository contains model artifacts for the master's thesis **βState Space Models for Automatic Speech Transcription.β** It provides the trained acoustic encoder checkpoints, language-model checkpoints, n-gram training text, and selected training logs used to support evaluation of a pure State Space Model (SSM) automatic speech transcription pipeline on LibriSpeech.
The code for training and evaluation is maintained separately. This Hugging Face repository is intended as a checkpoint and artifact archive so that the thesis evaluation pipeline can be run without retraining the full models from scratch.
## Repository contents
```text
SSM-AST/
βββ datasets/
β βββ librispeecm_lm_dataset_pre-processed_char_level_text.txt
βββ encoder_checkpoints/
β βββ enc_mamba3_460h_checkpoint_best_epoch=49_val_wer=0.255.ckpt
β βββ enc_mamba3_960h_checkpoint_best_epoch=91_val_wer=0.186.ckpt
β βββ enc_mamba_460h_checkpoint_best_epoch=49_val_wer=0.227.ckpt
β βββ enc_mamba_960h_checkpoint_best_epoch=100_val_wer=0.155.ckpt
β βββ enc_ssssm_460h_checkpoint_best_epoch=49_val_wer=0.197.ckpt
β βββ enc_ssssm_960h_checkpoint_best_epoch=100_val_wer=0.111.ckpt
β βββ enc_ssssm_960h_checkpoint_best_epoch=98_val_wer=0.111.ckpt
βββ lm_checkpoints/
β βββ lm_mamba3_checkpoint_MaxChars-1000000000_ds-64_d320_L18.pt
β βββ lm_mamba_checkpoint_MaxChars-1000000000_d320_L18.pt
β βββ lm_ngram_checkpoint_char_10gram.pkl
βββ log files/
β βββ 0utput_exp-mamba-1_960h_W-320_D-48_S-16_B-128_E-100.txt
β βββ 0utput_exp-mamba3_460h_W-512_D-30_S-16_B-64_E-50.txt
β βββ 0utput_exp-mamba3_enc_960h_W-320_D-48_S-16_b-64_E-100.txt
β βββ 0utput_exp-mamba_dt_bias_hier_460h_W-512_D-30_S-16.txt
β βββ 0utput_exp-v75_460h_hier_gating_256_42.txt
β βββ 0utput_exp-v77_960h_hier_gating_320_48.txt
β βββ mamba3_elm_training.log
β βββ mamba_elm_training.log
βββ .gitattributes
βββ README.md
``` |