Sympatheia

This is the model checkpoint for Sympatheia, an emotionally adaptive speech-to-speech dialogue model. It includes LoRA adapter checkpoint files.

[Paper] | [Demo] | [Dataset] | [Code]


Model description

Sympatheia fine-tunes GLM-4-Voice-9B with LoRA to generate spoken responses conditioned on a continuous valence–arousal (VA) affect signal injected into the system prompt as User emotion (valence=v, arousal=a). It is trained on Sympatheia-18k, a synthetic corpus of 18k emotion-conditioned spoken dialogue pairs spanning 12 emotion anchors (happy, sad, angry, excited, frustrated, anxious, relaxed, surprised, disgusted, tired, content, neutral).

How to use

This checkpoint is a LoRA adapter for GLM-4-Voice-9B. You also need:

  • The GLM-4-Voice-9B base model (THUDM/glm-4-voice-9b)
  • The GLM-4-Voice decoder weights (flow.pt, hift.pt from THUDM/glm-4-voice-decoder)

See the project code at https://github.com/susameddin/sympatheia for full inference and evaluation scripts.

# Download this checkpoint
huggingface-cli download anonymous2222/Sympatheia --local-dir /path/to/checkpoint

# Run inference (from the project src/ directory)
python inference_sympatheia.py --checkpoint /path/to/checkpoint

# Interactive Gradio demo
python gradio_demo.py --checkpoint /path/to/checkpoint --port 7860

Training data

Sympatheia-18k (https://huggingface.co/datasets/susameddin/Sympatheia-18k): 18k synthetic emotion-conditioned spoken dialogue pairs (Emotional split: 12k; Neutral split: 6k). Generated with Qwen3-32B (text) and Qwen3-TTS (speech).

Training procedure

LoRA fine-tuning of GLM-4-Voice-9B with DeepSpeed ZeRO Stage 3, BF16 precision. See src/config.yaml in the project code for full hyperparameter details.

Intended use

  • Research on emotionally adaptive voice assistants.
  • Evaluation of continuous affect conditioning for speech-to-speech dialogue.
  • Integration experiments with external emotion sensing modules.

Not intended for: covert emotion sensing, clinical diagnosis, or any deployment without explicit user consent and opt-in affect sensing.

License

Apache 2.0. The GLM-4-Voice-9B base model is subject to the GLM-4-Voice License (https://huggingface.co/THUDM/glm-4-voice-9b).

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for susameddin/Sympatheia

Adapter
(2)
this model

Dataset used to train susameddin/Sympatheia