laion
/

BUD-E-Whisper

Model card Files Files and versions

felfri commited on Jun 20, 2025

Commit

51ce18e

·

verified ·

1 Parent(s): 9400e67

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -1,10 +1,11 @@
 ---
 license: cc-by-4.0
 ---
 # BUD-E Whisper: Emotional Speech Captioning Model
-**BUD-E Whisper** is a suite of Whisper models fine-tuned for **direct emotional speech captioning**. The core models are built upon OpenAI's Whisper architecture, with the current primary variant being a fine-tune of **OpenAI Whisper Small**. These models are designed to generate text captions that not only transcribe speech but also inherently reflect its emotional content.
 The embeddings generated by BUD-E Whisper can also serve as input for **Empathic Insight - Voice**, a downstream ensemble of Multi-Layer Perceptrons (MLPs) designed to predict dimensional emotion scores.

 ---
 license: cc-by-4.0
+pipeline_tag: audio-classification
 ---
 # BUD-E Whisper: Emotional Speech Captioning Model
+**BUD-E Whisper** is a suite of Whisper models fine-tuned for **direct emotional speech captioning**, as introduced in [https://arxiv.org/abs/2506.09827](https://arxiv.org/abs/2506.09827). The core models are built upon OpenAI's Whisper architecture, with the current primary variant being a fine-tune of **OpenAI Whisper Small**. These models are designed to generate text captions that not only transcribe speech but also inherently reflect its emotional content.
 The embeddings generated by BUD-E Whisper can also serve as input for **Empathic Insight - Voice**, a downstream ensemble of Multi-Layer Perceptrons (MLPs) designed to predict dimensional emotion scores.