Adding `safetensors` variant of this model

#2
README.md CHANGED
@@ -27,8 +27,8 @@ A fine-tuned version of [MERaLiON/MERaLiON-2-3B](https://huggingface.co/MERaLiON
27
  | Benchmark | Baseline | This Model | Improvement |
28
  |-----------|----------|------------|-------------|
29
  | **SEAME** | 0.3372 | **0.2530** | **-25.0%** |
30
- | **EMILIA** | 0.3201 | **0.3041** | **-5.0%** |
31
- | **CS-Dialogue** | 0.2541 | **0.2258** | **-11.1%** |
32
 
33
  ### Benchmark Descriptions
34
  - **SEAME**: English-Mandarin code-switching conversational speech from Singapore/Malaysia (9,764 samples)
@@ -39,45 +39,45 @@ A fine-tuned version of [MERaLiON/MERaLiON-2-3B](https://huggingface.co/MERaLiON
39
 
40
  Below are examples showing improvements from baseline to DPO-trained model:
41
 
42
- ### Example 1: Hallucination Fixed
43
  | | Transcription |
44
  |---|---|
45
- | **Ground Truth** | 你们 一首 也是 一个 session [啊] [哦] [嗯] |
46
- | **Baseline** | 你们是一首歌也是教一个 session (oh) 我们也是 session 那个 sessional practice 的... *(hallucinated extra content)* |
47
- | **This Model** | 你们是一首歌也是教一个 session () (哦) |
48
- | **MER** | 2.20 → **0.07** |
49
 
50
- ### Example 2: Code-Switching Preserved (Maid)
51
  | | Transcription |
52
  |---|---|
53
- | **Ground Truth** | [啊] 然后 因为 我们 家里 有 一个 maid [吗] 我 妈妈 有请 一个 maid [mah] 那个 是 打扫 屋子 的 东西 这样 之类 [吗] that is why 可以 [咯] 因为 |
54
- | **Baseline** | (ah) 然后因为我们家里有一个 maid (mah) 妈妈就请一个 maid (mah) (mah) (mah)... *(repeated filler words)* |
55
- | **This Model** | (啊) 然后因为我们家里有一个 maid (mah) 我妈妈就请一个 maid (mah) 那个是打扫屋子的东西这样子 (leh) (mah) that's why 可以 (loh) 因为 |
56
- | **MER** | 1.02 → **0.17** |
57
 
58
- ### Example 3: English Location Preserved (Temasek Poly)
59
  | | Transcription |
60
  |---|---|
61
- | **Ground Truth** | temasek poly 那边 |
62
- | **Baseline** | 我住达马士科波利那边 *(transliterated to Chinese)* |
63
- | **This Model** | 我住 tamasek poly 那边 |
64
- | **MER** | 1.00 → **0.17** |
65
 
66
- ### Example 4: Code-Switching Preserved (Exam)
67
  | | Transcription |
68
  |---|---|
69
- | **Ground Truth** | like shit |
70
- | **Baseline** | 课程很课程很 like shit *(wrong Chinese characters)* |
71
- | **This Model** | 考得很 考得 like shit |
72
- | **MER** | 0.71 → **0.00** |
73
 
74
- ### Example 5: Mixed Language Preserved (Youth)
75
  | | Transcription |
76
  |---|---|
77
- | **Ground Truth** | not really youth [lah] 还是 youth 三十岁 |
78
- | **Baseline** | not really you (lah) 还是 you (lah) 三十岁 (oh) *(lost "youth")* |
79
- | **This Model** | not really youth (lah) 还是 youth 了三十岁 |
80
- | **MER** | 0.36 → **0.00** |
81
 
82
  ## Training Configuration
83
 
 
27
  | Benchmark | Baseline | This Model | Improvement |
28
  |-----------|----------|------------|-------------|
29
  | **SEAME** | 0.3372 | **0.2530** | **-25.0%** |
30
+ | **EMILIA** | 0.3201 | **0.3046** | **-4.8%** |
31
+ | **CS-Dialogue** | 0.2258 | 0.2541 | +12.5% |
32
 
33
  ### Benchmark Descriptions
34
  - **SEAME**: English-Mandarin code-switching conversational speech from Singapore/Malaysia (9,764 samples)
 
39
 
40
  Below are examples showing improvements from baseline to DPO-trained model:
41
 
42
+ ### Example 1: Hallucination Fixed (Valentine's Day)
43
  | | Transcription |
44
  |---|---|
45
+ | **Ground Truth** | (呃) 我们 二月 valentine's day |
46
+ | **Baseline** | ah moment ah month ah month ah month ah month... *(repeated 250+ times)* |
47
+ | **This Model** | () 我们二月多有 valentine's day |
48
+ | **MER** | 56.89 → **0.00** |
49
 
50
+ ### Example 2: Repetition Fixed (Code-Switch Preserved)
51
  | | Transcription |
52
  |---|---|
53
+ | **Ground Truth** | it's to give yourself 一个 台阶 right |
54
+ | **Baseline** | You have to give yourself a a a a a a a a... *(repeated 500+ times)* |
55
+ | **This Model** | is to give yourself 一个台阶 right |
56
+ | **MER** | 56.56 → **0.11** |
57
 
58
+ ### Example 3: Code-Switching Preserved
59
  | | Transcription |
60
  |---|---|
61
+ | **Ground Truth** | inside circle yah like 进出 进出 会 生病 的 leh |
62
+ | **Baseline** | And you say so could yeah like you can you can you can... *(repeated 500+ times)* |
63
+ | **This Model** | inside the circle ya like 进出进出会生病的 (leh) |
64
+ | **MER** | 39.31 → **0.15** |
65
 
66
+ ### Example 4: Perfect Recovery from Repetition
67
  | | Transcription |
68
  |---|---|
69
+ | **Ground Truth** | control 有 有 有 他们 要 control |
70
+ | **Baseline** | 有有有有有有有有有有有有有有有有有有有有... *(repeated 500+ times, no "control")* |
71
+ | **This Model** | 有有有有有有 control 有有有他们要 control |
72
+ | **MER** | 35.93 → **0.00** |
73
 
74
+ ### Example 5: Technical Terms Preserved
75
  | | Transcription |
76
  |---|---|
77
+ | **Ground Truth** | 大部分 [] 大部分 triple e. 跟 computer en~ com~ computer [lah] |
78
+ | **Baseline** | 大部分呐大部分是跟跟跟跟跟跟跟... *(repeated 500+ times, lost "triple e" and "computer")* |
79
+ | **This Model** | 大部分 (啊) 大部分是 triple e 跟 computer () computer (啦) |
80
+ | **MER** | 31.56 → **0.25** |
81
 
82
  ## Training Configuration
83
 
eval_results/baseline_cs_dialogue.json CHANGED
The diff for this file is too large to render. See raw diff
 
eval_results/baseline_emilia.json CHANGED
The diff for this file is too large to render. See raw diff
 
eval_results/trained_cs_dialogue.json CHANGED
The diff for this file is too large to render. See raw diff
 
eval_results/trained_emilia.json CHANGED
The diff for this file is too large to render. See raw diff
 
eval_results/trained_seame.json CHANGED
The diff for this file is too large to render. See raw diff
 
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fda3d67efd6e3fc991b1b9e9057292a33889e42295a89da38b3ed0c9045156fa
3
+ size 8121505608