LaelaZ commited on
Commit
e4f7cbb
·
verified ·
1 Parent(s): 0772f5d

Model card: voice pass (drop ship tell, em-dashes out); metrics and tables unchanged

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -83,7 +83,7 @@ mistakes), which is where the model's real limits show.
83
  ## Error analysis
84
 
85
  A real confusion matrix and per-class breakdown on the **full held-out test set (2,000
86
- examples)**, regenerated from the shipped weights with `python -m emotion.error_report`.
87
 
88
  ![Confusion matrix](assets/confusion_matrix.png)
89
 
@@ -114,10 +114,10 @@ examples)**, regenerated from the shipped weights with `python -m emotion.error_
114
  **Where it fails.** The single largest error axis is **joy ↔ love** (28 + 28 mutual
115
  misclassifications): both are short, affect-positive messages, so the model leans toward the
116
  higher-frequency neighbour. The rarest class, `surprise` (n=66), leaks mainly into `fear` (12)
117
- and `joy` (7). The mistakes are semantically adjacent rather than random the model learned the
118
  manifold and is mostly losing the low-support classes, not misfiring broadly.
119
 
120
- **Confidently wrong (highest-confidence mistakes)** the cases the model got wrong *and* was
121
  sure about, the slice worth reading:
122
 
123
  | true | predicted | conf | text |
 
83
  ## Error analysis
84
 
85
  A real confusion matrix and per-class breakdown on the **full held-out test set (2,000
86
+ examples)**, regenerated from the trained weights with `python -m emotion.error_report`.
87
 
88
  ![Confusion matrix](assets/confusion_matrix.png)
89
 
 
114
  **Where it fails.** The single largest error axis is **joy ↔ love** (28 + 28 mutual
115
  misclassifications): both are short, affect-positive messages, so the model leans toward the
116
  higher-frequency neighbour. The rarest class, `surprise` (n=66), leaks mainly into `fear` (12)
117
+ and `joy` (7). The mistakes are semantically adjacent rather than random. The model learned the
118
  manifold and is mostly losing the low-support classes, not misfiring broadly.
119
 
120
+ **Confidently wrong (highest-confidence mistakes):** the cases the model got wrong *and* was
121
  sure about, the slice worth reading:
122
 
123
  | true | predicted | conf | text |