Whisper-si-experiments - a SPEAK-ASR Collection

SPEAK-ASR 's Collections

updated Feb 8

SPEAK-ASR/whisper-si-exp-1

Updated Jan 31 • 1

Note Used dataset: 1/3 OpenSLR Dataset (~60h) Lora Projection Layers to: Query (Q) and Value (V) layers
SPEAK-ASR/whisper-si-exp-2

Updated Jan 31 • 2

Note Use `pranay-j/whisper-small-hindi` as base model
SPEAK-ASR/whisper-si-exp-3

Updated Jan 31 • 1

Note Used dataset: Full OpenSLR Dataset (~180h)
SPEAK-ASR/whisper-si-exp-4

Updated Feb 8 • 2

Note Used dataset: OpenSLR + YouTube Dataset (~185h)
SPEAK-ASR/whisper-si-exp-5

Updated Jan 31 • 1

Note Used WER to choose the best model instead of loss Used dataset: OpenSLR + YouTube Dataset (~185h) Changed Lora Projection Layers to: Q, V, Key (K), and Output (O) layers
SPEAK-ASR/whisper-si-exp-6

Updated Feb 8 • 5

Note Revert Lora Projection Layers to: Q, V layers Change num_epoch from 5 to 15