Speech AI models for Apple Silicon via MLX. ASR, TTS, VAD, diarization, speaker embedding.
-
aufklarer/WeSpeaker-ResNet34-LM-MLX
Audio Classification • Updated • 373k • 2 -
aufklarer/Qwen3-ASR-0.6B-MLX-4bit
0.3B • Updated • 58.1k • 2 -
aufklarer/Qwen3-ForcedAligner-0.6B-4bit
Audio Classification • Updated • 52.3k • 1 -
aufklarer/Pyannote-Segmentation-MLX
Voice Activity Detection • Updated • 6.51k