Miracle12345's picture
Update README.md
ed17bd5 verified
metadata
license: mit
language:
  - en
base_model:
  - ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition

Speech Emotion Recognition - 6-Class Classifier

This model is a fine-tuned version of ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition, specifically designed to classify emotions in English speech.

๐Ÿง  Emotion Classes

The model predicts one of the following six emotions:

Happy

Angry

Disgust

Fearful

Neutral

Sad

๐Ÿ“Š Dataset

The model was trained on the Speech Emotion Recognition dataset from Kaggle: ๐Ÿ”— https://www.kaggle.com/datasets/kevinignatiuswijaya/speech-emotion-recognition-dl

๐ŸŽฏ Accuracy Achieved an accuracy of 84% on the test set.

๐Ÿ”ง Base Model Fine-tuned from the pretrained model: ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition

Load model and feature extractor

model = Wav2Vec2ForSequenceClassification.from_pretrained("your-username/your-model-name") extractor = Wav2Vec2FeatureExtractor.from_pretrained("your-username/your-model-name")

Create pipeline

classifier = pipeline("audio-classification", model=model, feature_extractor=extractor)

Predict emotion

result = classifier("path/to/audio.wav") print(result)

๐Ÿงช Applications This model can be used for:

Emotion-aware virtual assistants

Mental health monitoring tools

Human-computer interaction research

Call center emotion analytics

๐Ÿ“ License

Ensure compliance with the licenses for both the Kaggle dataset and the pretrained model used.