devanshty commited on
Commit
f66e5fd
·
verified ·
1 Parent(s): c57c395

Add model card

Browse files
Files changed (1) hide show
  1. README.md +58 -0
README.md ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - pytorch
5
+ - audio
6
+ - emotion-recognition
7
+ - audio-classification
8
+ - customer-service
9
+ ---
10
+
11
+ # Mantis
12
+
13
+ ## Model Description
14
+ Mantis is an audio-based emotion recognition model designed for customer service intelligence. It classifies emotional states from speech audio using a HuBERT + CNN hybrid architecture, enabling real-time sentiment monitoring in call center environments.
15
+
16
+ ## Model Architecture
17
+ - **Architecture**: HuBERT (feature extractor) + CNN (classifier head)
18
+ - **Framework**: PyTorch
19
+ - **Task**: Audio Emotion Classification
20
+ - **Input**: Raw audio waveforms / mel spectrograms
21
+ - **Output**: Emotion class (e.g., neutral, happy, angry, sad, frustrated)
22
+
23
+ ## Training Details
24
+ - **Dataset**: Trained on emotion speech datasets (e.g., RAVDESS, IEMOCAP, or proprietary customer service audio)
25
+ - **Approach**: HuBERT pre-trained representations fed into a custom CNN classifier
26
+ - **Fine-tuning**: End-to-end fine-tuning for customer service emotion categories
27
+
28
+ ## Performance
29
+ Evaluated on held-out emotion speech samples with strong accuracy across key emotion classes relevant to customer service.
30
+
31
+ ## Files
32
+ | File | Description |
33
+ |------|-------------|
34
+ | `emotion_model.pth` | Final trained HuBERT-CNN emotion recognition model |
35
+
36
+ ## Usage
37
+
38
+ ```python
39
+ import torch
40
+ from huggingface_hub import hf_hub_download
41
+
42
+ # Download model
43
+ model_path = hf_hub_download(repo_id='devanshty/Mantis', filename='emotion_model.pth')
44
+
45
+ # Load model (adjust to your model class)
46
+ model = torch.load(model_path, map_location='cpu')
47
+ model.eval()
48
+
49
+ # Run inference on audio features
50
+ # (preprocess audio to match training pipeline)
51
+ ```
52
+
53
+ ## Download & Use
54
+
55
+ ```python
56
+ from huggingface_hub import hf_hub_download
57
+ model_path = hf_hub_download(repo_id='devanshty/Mantis', filename='emotion_model.pth')
58
+ ```