asapp/slue-phase-2
Viewer • Updated • 74.8k • 2.26k • 12
How to use Masioki/prosody_gtsc_phi-3-mini with Transformers:
# Load model directly
from transformers import CrossAttentionSentenceClassifier
model = CrossAttentionSentenceClassifier.from_pretrained("Masioki/prosody_gtsc_phi-3-mini", dtype="auto")Ground truth text with prosody encoding residual cross attention multi-label DAC
Prosody encoder: 2 layer transformer encoder with initial dense projection
Backbone: Phi 3 mini
Pooling: Self attention
Multi-label classification head: 2 dense layers with two dropouts 0.3 and Tanh activation inbetween
Trained on ground truth.
Evaluated on ground truth (GT) and normalized Whisper small transcripts (E2E).
The following hyperparameters were used during training: