RobotsMali
/

reward-model

Tabular Regression

Model card Files Files and versions

Panga-Azazia commited on Nov 10

Commit

60ea63e

·

verified ·

1 Parent(s): 4b52aef

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ metrics:
 # Description
 This model is a Reward Model trained on the [RobotsMali transcription scorer dataset](https://huggingface.co/datasets/RobotsMali/transcription-scorer), where the scores were assigned by human annotators.
-It predicts a continuous score between 0 and 1 for a pair (audio, text), representing how well the text matches the spoken audio.
 The model can be integrated as a Reward Model within RLHF pipelines to evaluate or fine-tune ASR models based on human preference scores.
@@ -34,7 +34,7 @@ The model consists of two main encoders — one for audio and one for text — f
 ### Audio Encoder
 Input: Raw waveform (16 kHz)
-Feature extraction: Mel-spectrogram computed from waveform using WhisperFeatureExtractor
 Parameters:
 - n_fft: 1024

 # Description
 This model is a Reward Model trained on the [RobotsMali transcription scorer dataset](https://huggingface.co/datasets/RobotsMali/transcription-scorer), where the scores were assigned by human annotators.
+It predicts a continuous score between 0 and 1 for a pair (**audio**, **text**), representing how well the text matches the spoken audio.
 The model can be integrated as a Reward Model within RLHF pipelines to evaluate or fine-tune ASR models based on human preference scores.
 ### Audio Encoder
 Input: Raw waveform (16 kHz)
+Feature extraction: Mel-spectrogram computed from waveform using ***WhisperFeatureExtractor***
 Parameters:
 - n_fft: 1024