Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

2,879

Base only

Active filters: speech

QuarkAudio/H-Codec-2.0

Updated Feb 4 • 5

soda-research/soda-600m-prelim

0.6B • Updated Feb 13 • 8

Mayank022/Audio-Language-Model

Audio-Text-to-Text • Updated Feb 26

mlx-community/Soprano-1.1-80M-5bit

Text-to-Speech • 21M • Updated Feb 4 • 19

mlx-community/Soprano-1.1-80M-6bit

Text-to-Speech • 24.5M • Updated Feb 4 • 4

mlx-community/Soprano-1.1-80M-8bit

Text-to-Speech • 31.3M • Updated Feb 4 • 23

diwskx/speaker-diarization-3.1

Automatic Speech Recognition • Updated May 10, 2024 • 7

Archime/parakeet-tdt-0.6b-v3-fr-tv-media

Automatic Speech Recognition • Updated Feb 5 • 95 • 2

ARTPARK-IISc/Vaani-LID_v0

Feature Extraction • 0.8B • Updated Apr 2 • 412 • 1

Hguimaraes/biome_edge_bio

Feature Extraction • 5.91M • Updated Feb 17 • 17

abedir/emotion-detector

Audio Classification • 94.6M • Updated Feb 5 • 17 • 1

scrappylabs/narrator-tts

Text-to-Speech • Updated Feb 6 • 16 • 2

phtran/stt_en_conformer_ctc_small

Automatic Speech Recognition • Updated Feb 6 • 15

Aratako/MioTTS-1.7B

Text-to-Speech • 2B • Updated Feb 10 • 719 • 13

SiddharthaGolu/Qwen3-TTS-12Hz-1.7B-Base-bf16

Text-to-Speech • 2B • Updated Feb 6 • 9

Hguimaraes/biome_small_bio

Feature Extraction • 26.4M • Updated Feb 17 • 33

Hguimaraes/biome_base_bio

Feature Extraction • 76.1M • Updated Feb 17 • 15

shreyask/voxtral-mini-4b-realtime-mlx-mixed-4-6

1B • Updated Feb 7 • 7

Pomni/kotoba-whisper-v2.2-ggml-allquants

Automatic Speech Recognition • Updated Feb 9

Aratako/MioTTS-0.6B

Text-to-Speech • 0.6B • Updated Feb 10 • 618 • 8

Aratako/MioTTS-0.4B

Text-to-Speech • 0.4B • Updated Feb 10 • 664 • 5

Aratako/MioTTS-0.1B

Text-to-Speech • 0.1B • Updated Feb 13 • 184 • 23

Pomni/whisper-large-v3-arabic-ggml-allquants

Automatic Speech Recognition • Updated Feb 7 • 1

Pomni/whisper-medium-GermanMed-full-ggml-allquants

Automatic Speech Recognition • Updated Feb 7

alphacep/vosk-model-small-streaming-bn

Automatic Speech Recognition • Updated Feb 7

xnpx/wav2vec2-large-xlsr-ipa-phonemes

Automatic Speech Recognition • 0.3B • Updated Feb 7 • 7 • 1

smdesai/Qwen3-TTS-12Hz-1.7B-Base-4bit

Text-to-Speech • 0.6B • Updated Feb 8 • 5

Aratako/MioTTS-1.2B

Text-to-Speech • 1B • Updated Feb 10 • 347 • 7

OzLabs/Caspi-1.7B

Automatic Speech Recognition • 2B • Updated Mar 8 • 197 • 3

huper29/huper_recognizer

0.3B • Updated 11 days ago • 5.19k • 6