Spaces:
Running
Running
metadata
title: Manan ML API
emoji: 🧠
colorFrom: green
colorTo: purple
sdk: docker
pinned: false
license: mit
Manan ML API - Mental Health Emotion Recognition
This is the ML backend for the Manan (मनन) mental health analysis app.
Features
- Face Emotion Recognition: Using DeepFace to detect 7 emotions (Angry, Disgust, Fear, Happy, Sad, Surprise, Neutral)
- Voice Emotion Recognition: Using SpeechBrain's Wav2Vec2-IEMOCAP model
- Text Emotion Recognition: Using Whisper for transcription + DistilBERT for emotion classification
API Endpoints
Health Check
GET /
GET /health
Face Emotion Prediction
POST /pred_face
- files: List of image files
- questions: JSON string with question metadata
Voice Emotion Prediction
POST /predict_audio_batch
- files: List of audio files (WAV format)
Text Emotion Prediction
POST /predict_text/
- files: List of audio files (WAV format)
- Returns: transcript + emotion
Models Used
- DeepFace - Facial emotion recognition
- OpenAI Whisper (base) - Speech-to-text
- SpeechBrain Wav2Vec2-IEMOCAP - Voice emotion recognition
- DistilBERT - Text emotion classification
Usage
The API is designed to be used with the Manan Flutter mobile app for multimodal emotion analysis.
License
MIT License