Spaces:

govinuts
/

manan-ml-api

Running

manan-ml-api / README.md

CodeGovindz

Initial commit: Manan ML API for emotion detection

6deff13 5 days ago

1.35 kB

	---
	title: Manan ML API
	emoji: 🧠
	colorFrom: green
	colorTo: purple
	sdk: docker
	pinned: false
	license: mit
	---

	# Manan ML API - Mental Health Emotion Recognition

	This is the ML backend for the Manan (मनन) mental health analysis app.

	## Features

	- Face Emotion Recognition: Using DeepFace to detect 7 emotions (Angry, Disgust, Fear, Happy, Sad, Surprise, Neutral)
	- Voice Emotion Recognition: Using SpeechBrain's Wav2Vec2-IEMOCAP model
	- Text Emotion Recognition: Using Whisper for transcription + DistilBERT for emotion classification

	## API Endpoints

	### Health Check
	```
	GET /
	GET /health
	```

	### Face Emotion Prediction
	```
	POST /pred_face
	- files: List of image files
	- questions: JSON string with question metadata
	```

	### Voice Emotion Prediction
	```
	POST /predict_audio_batch
	- files: List of audio files (WAV format)
	```

	### Text Emotion Prediction
	```
	POST /predict_text/
	- files: List of audio files (WAV format)
	- Returns: transcript + emotion
	```

	## Models Used

	1. DeepFace - Facial emotion recognition
	2. OpenAI Whisper (base) - Speech-to-text
	3. SpeechBrain Wav2Vec2-IEMOCAP - Voice emotion recognition
	4. DistilBERT - Text emotion classification

	## Usage

	The API is designed to be used with the Manan Flutter mobile app for multimodal emotion analysis.

	## License

	MIT License