Commit
·
37ffa6c
1
Parent(s):
c310df5
add training sample spec
Browse files
README.md
CHANGED
|
@@ -26,7 +26,7 @@ metrics:
|
|
| 26 |
# Model Card for **ASR** (CTC-based ASR on English)
|
| 27 |
|
| 28 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 29 |
-
This repository contains an end‑to‑end **Automatic Speech Recognition (ASR)** pipeline built around Hugging Face Transformers. The default configuration fine‑tunes **`facebook/wav2vec2-base-960h`** with a **CTC** head on **Common Voice 17.0 (English)** and provides scripts to **train, evaluate, export to ONNX, and deploy on AWS SageMaker**. It also includes a robust audio loading stack (FFmpeg preferred, with fallbacks) and utilities for text normalization and evaluation (WER/CER).
|
| 30 |
|
| 31 |
## Model Details
|
| 32 |
|
|
|
|
| 26 |
# Model Card for **ASR** (CTC-based ASR on English)
|
| 27 |
|
| 28 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 29 |
+
This repository contains an end‑to‑end **Automatic Speech Recognition (ASR)** pipeline built around Hugging Face Transformers. The default configuration fine‑tunes **`facebook/wav2vec2-base-960h`** with a **CTC** head on 50k sub sample of **Common Voice 17.0 (English)** and provides scripts to **train, evaluate, export to ONNX, and deploy on AWS SageMaker**. It also includes a robust audio loading stack (FFmpeg preferred, with fallbacks) and utilities for text normalization and evaluation (WER/CER).
|
| 30 |
|
| 31 |
## Model Details
|
| 32 |
|