waveletdeboshir
/

whisper-base-ser-dusha

Audio Classification

Eval Results (legacy)

Model card Files Files and versions

waveletdeboshir commited on Jul 8, 2025

Commit

9d3ad6a

·

verified ·

1 Parent(s): c9a8a56

Update README.md

Files changed (1) hide show

README.md +25 -25

README.md CHANGED Viewed

@@ -9,29 +9,29 @@ tags:
 - SER
 - speech
 - emotion
-# model-index:
-# - name: Whisper-base for Speech Emotion Recognition in Russian
-#   results:
-#   - task:
-#       name: Audio Classification
-#       type: speech-emotion-recognition
-#     dataset:
-#       name: Sberdevices Dusha (crowd)
-#       type: SberDevices/Dusha
-#       args: ru
-#     metrics:
-#     - name: Test Weighted Accuracy
-#       type: acc
-#       value: 0.8364
-#     - name: Test F1 macro
-#       type: f1
-#       value: 0.8429
-#     - name: Test Recall macro
-#       type: recall
-#       value: 0.83
-#     - name: Test Precision macro
-#       type: precision
-#       value: 0.85
 metrics:
 - f1
 ---
@@ -40,7 +40,7 @@ Whisper-base encoder with classification head for speech emotion recognition.
 **Dusha dataset**: https://github.com/salute-developers/golos/tree/master/dusha
-**5 classes:**
 * angry 0
 * sad 1
 * neutral 2
@@ -49,7 +49,7 @@ Whisper-base encoder with classification head for speech emotion recognition.
 Model was fine-tuned on full Dusha-crowd with
 * augmentations Time Shift, Time Masking and Colored Noise;
-* Weighted batch sampler.
 ## Usage
 ```python

 - SER
 - speech
 - emotion
+model-index:
+- name: Whisper-base for Speech Emotion Recognition in Russian
+  results:
+  - task:
+      name: Audio Classification
+      type: speech-emotion-recognition
+    dataset:
+      name: Sberdevices Dusha (crowd)
+      type: SberDevices/Dusha
+      args: ru
+    metrics:
+    - name: Test Weighted Accuracy
+      type: acc
+      value: 0.8364
+    - name: Test F1 macro
+      type: f1
+      value: 0.8429
+    - name: Test Recall macro
+      type: recall
+      value: 0.83
+    - name: Test Precision macro
+      type: precision
+      value: 0.85
 metrics:
 - f1
 ---
 **Dusha dataset**: https://github.com/salute-developers/golos/tree/master/dusha
+**Multiclass classification into 5 classes:**
 * angry 0
 * sad 1
 * neutral 2
 Model was fine-tuned on full Dusha-crowd with
 * augmentations Time Shift, Time Masking and Colored Noise;
+* WeightedRandomSampler.
 ## Usage
 ```python