Adding `safetensors` variant of this model
#2
by
SFconvertbot
- opened
- README.md +27 -27
- eval_results/baseline_cs_dialogue.json +0 -0
- eval_results/baseline_emilia.json +0 -0
- eval_results/trained_cs_dialogue.json +0 -0
- eval_results/trained_emilia.json +0 -0
- eval_results/trained_seame.json +0 -0
- model.safetensors +3 -0
README.md
CHANGED
|
@@ -27,8 +27,8 @@ A fine-tuned version of [MERaLiON/MERaLiON-2-3B](https://huggingface.co/MERaLiON
|
|
| 27 |
| Benchmark | Baseline | This Model | Improvement |
|
| 28 |
|-----------|----------|------------|-------------|
|
| 29 |
| **SEAME** | 0.3372 | **0.2530** | **-25.0%** |
|
| 30 |
-
| **EMILIA** | 0.3201 | **0.
|
| 31 |
-
| **CS-Dialogue** | 0.
|
| 32 |
|
| 33 |
### Benchmark Descriptions
|
| 34 |
- **SEAME**: English-Mandarin code-switching conversational speech from Singapore/Malaysia (9,764 samples)
|
|
@@ -39,45 +39,45 @@ A fine-tuned version of [MERaLiON/MERaLiON-2-3B](https://huggingface.co/MERaLiON
|
|
| 39 |
|
| 40 |
Below are examples showing improvements from baseline to DPO-trained model:
|
| 41 |
|
| 42 |
-
### Example 1: Hallucination Fixed
|
| 43 |
| | Transcription |
|
| 44 |
|---|---|
|
| 45 |
-
| **Ground Truth** |
|
| 46 |
-
| **Baseline** |
|
| 47 |
-
| **This Model** |
|
| 48 |
-
| **MER** |
|
| 49 |
|
| 50 |
-
### Example 2: Code-
|
| 51 |
| | Transcription |
|
| 52 |
|---|---|
|
| 53 |
-
| **Ground Truth** |
|
| 54 |
-
| **Baseline** |
|
| 55 |
-
| **This Model** |
|
| 56 |
-
| **MER** |
|
| 57 |
|
| 58 |
-
### Example 3:
|
| 59 |
| | Transcription |
|
| 60 |
|---|---|
|
| 61 |
-
| **Ground Truth** |
|
| 62 |
-
| **Baseline** |
|
| 63 |
-
| **This Model** |
|
| 64 |
-
| **MER** |
|
| 65 |
|
| 66 |
-
### Example 4:
|
| 67 |
| | Transcription |
|
| 68 |
|---|---|
|
| 69 |
-
| **Ground Truth** |
|
| 70 |
-
| **Baseline** |
|
| 71 |
-
| **This Model** |
|
| 72 |
-
| **MER** |
|
| 73 |
|
| 74 |
-
### Example 5:
|
| 75 |
| | Transcription |
|
| 76 |
|---|---|
|
| 77 |
-
| **Ground Truth** |
|
| 78 |
-
| **Baseline** |
|
| 79 |
-
| **This Model** |
|
| 80 |
-
| **MER** |
|
| 81 |
|
| 82 |
## Training Configuration
|
| 83 |
|
|
|
|
| 27 |
| Benchmark | Baseline | This Model | Improvement |
|
| 28 |
|-----------|----------|------------|-------------|
|
| 29 |
| **SEAME** | 0.3372 | **0.2530** | **-25.0%** |
|
| 30 |
+
| **EMILIA** | 0.3201 | **0.3046** | **-4.8%** |
|
| 31 |
+
| **CS-Dialogue** | 0.2258 | 0.2541 | +12.5% |
|
| 32 |
|
| 33 |
### Benchmark Descriptions
|
| 34 |
- **SEAME**: English-Mandarin code-switching conversational speech from Singapore/Malaysia (9,764 samples)
|
|
|
|
| 39 |
|
| 40 |
Below are examples showing improvements from baseline to DPO-trained model:
|
| 41 |
|
| 42 |
+
### Example 1: Hallucination Fixed (Valentine's Day)
|
| 43 |
| | Transcription |
|
| 44 |
|---|---|
|
| 45 |
+
| **Ground Truth** | (呃) 我们 二月 多 有 valentine's day |
|
| 46 |
+
| **Baseline** | ah moment ah month ah month ah month ah month... *(repeated 250+ times)* |
|
| 47 |
+
| **This Model** | (呃) 我们二月多有 valentine's day |
|
| 48 |
+
| **MER** | 56.89 → **0.00** |
|
| 49 |
|
| 50 |
+
### Example 2: Repetition Fixed (Code-Switch Preserved)
|
| 51 |
| | Transcription |
|
| 52 |
|---|---|
|
| 53 |
+
| **Ground Truth** | it's to give yourself 一个 台阶 right |
|
| 54 |
+
| **Baseline** | You have to give yourself a a a a a a a a... *(repeated 500+ times)* |
|
| 55 |
+
| **This Model** | is to give yourself 一个台阶 right |
|
| 56 |
+
| **MER** | 56.56 → **0.11** |
|
| 57 |
|
| 58 |
+
### Example 3: Code-Switching Preserved
|
| 59 |
| | Transcription |
|
| 60 |
|---|---|
|
| 61 |
+
| **Ground Truth** | inside circle yah like 进出 进出 会 生病 的 leh |
|
| 62 |
+
| **Baseline** | And you say so could yeah like you can you can you can... *(repeated 500+ times)* |
|
| 63 |
+
| **This Model** | inside the circle ya like 进出进出会生病的 (leh) |
|
| 64 |
+
| **MER** | 39.31 → **0.15** |
|
| 65 |
|
| 66 |
+
### Example 4: Perfect Recovery from Repetition
|
| 67 |
| | Transcription |
|
| 68 |
|---|---|
|
| 69 |
+
| **Ground Truth** | 有 有 有 有 有 有 control 有 有 有 他们 要 control |
|
| 70 |
+
| **Baseline** | 有有有有有有有有有有有有有有有有有有有有... *(repeated 500+ times, no "control")* |
|
| 71 |
+
| **This Model** | 有有有有有有 control 有有有他们要 control |
|
| 72 |
+
| **MER** | 35.93 → **0.00** |
|
| 73 |
|
| 74 |
+
### Example 5: Technical Terms Preserved
|
| 75 |
| | Transcription |
|
| 76 |
|---|---|
|
| 77 |
+
| **Ground Truth** | 大部分 [哪] 大部分 是 triple e. 跟 computer en~ com~ computer [lah] |
|
| 78 |
+
| **Baseline** | 大部分呐大部分是跟跟跟跟跟跟跟... *(repeated 500+ times, lost "triple e" and "computer")* |
|
| 79 |
+
| **This Model** | 大部分 (啊) 大部分是 triple e 跟 computer (啊) computer (啦) |
|
| 80 |
+
| **MER** | 31.56 → **0.25** |
|
| 81 |
|
| 82 |
## Training Configuration
|
| 83 |
|
eval_results/baseline_cs_dialogue.json
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
eval_results/baseline_emilia.json
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
eval_results/trained_cs_dialogue.json
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
eval_results/trained_emilia.json
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
eval_results/trained_seame.json
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fda3d67efd6e3fc991b1b9e9057292a33889e42295a89da38b3ed0c9045156fa
|
| 3 |
+
size 8121505608
|