Candidate on-device TTS models under evaluation for Vox Jot. Models here are not ranked or verified until full Vox Jot benchmark suites pass.
Kimani James
IrieDinamik
·
AI & ML interests
None yet
Recent Activity
updated a model 1 day ago
IrieDinamik/ocr-tessdata-best updated a model 1 day ago
IrieDinamik/ocr-nemotron-ocr-v2 updated a model 1 day ago
IrieDinamik/ocr-qwen2-5-vl-3bOrganizations
None yet
ML Models
Machine Learning Models
Vox Jot – File ASR Verified
Curated file-transcription ASR models verified for Vox Jot. File/audio engines, not live dictation hot path.
Vox Jot – LLM Verified
Curated on-device LLM/GGUF models verified for Vox Jot. Small, fast instruct models optimized for local inference on-device.
-
IrieDinamik/LiquidAI-LFM2.5-Audio-1.5B-GGUF
1B • Updated • 208 -
IrieDinamik/LiquidAI-LFM2-1.2B-Tool-GGUF
Text Generation • 1B • Updated • 171 -
bartowski/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 293k • 206 -
bartowski/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 97.6k • 163
Vox Jot – STT Verified
Curated CTranslate2 Whisper models verified for Vox Jot speech-to-text. Ranked: Fastest → Balanced → Best Quality → Experimental. MIT/Apache licensed,
-
Systran/faster-whisper-tiny
Automatic Speech Recognition • Updated • 655k • 20 -
Systran/faster-whisper-tiny.en
Automatic Speech Recognition • Updated • 1.17M • 9 -
Systran/faster-whisper-base
Automatic Speech Recognition • Updated • 920k • 26 -
Systran/faster-whisper-base.en
Automatic Speech Recognition • Updated • 86.1k • 4
Vox Jot – Speech Analysis Runtime
Public runtime group for Vox Jot Python-sidecar file ASR and speaker isolation dependencies.
Vox Jot – Speaker Isolation Verified
Curated speaker diarization and isolation models verified for Vox Jot file transcription.
-
pyannote/speaker-diarization-community-1
Automatic Speech Recognition • Updated • 2.89M • 374 -
pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 10.6M • 1.88k -
BUT-FIT/diarizen-wavlm-large-s80-md-v2
Voice Activity Detection • Updated • 1.21k • 12 -
nvidia/diar_sortformer_4spk-v1
Automatic Speech Recognition • 0.1B • Updated • 14.8k • 138
Vox Jot – OCR Verified
Curated on-device OCR models verified for Vox Jot. Image-to-text and document scanning models optimized for local inference on-device.
Vox Jot – TTS Verified
Curated on-device TTS models verified for Vox Jot speech synthesis. Ranked: Fastest → Balanced → Best Quality → Voice Cloning. MIT/Apache licensed.
Vox Jot - TTS Candidates
Candidate on-device TTS models under evaluation for Vox Jot. Models here are not ranked or verified until full Vox Jot benchmark suites pass.
Vox Jot – Speech Analysis Runtime
Public runtime group for Vox Jot Python-sidecar file ASR and speaker isolation dependencies.
ML Models
Machine Learning Models
Vox Jot – Speaker Isolation Verified
Curated speaker diarization and isolation models verified for Vox Jot file transcription.
-
pyannote/speaker-diarization-community-1
Automatic Speech Recognition • Updated • 2.89M • 374 -
pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 10.6M • 1.88k -
BUT-FIT/diarizen-wavlm-large-s80-md-v2
Voice Activity Detection • Updated • 1.21k • 12 -
nvidia/diar_sortformer_4spk-v1
Automatic Speech Recognition • 0.1B • Updated • 14.8k • 138
Vox Jot – File ASR Verified
Curated file-transcription ASR models verified for Vox Jot. File/audio engines, not live dictation hot path.
Vox Jot – OCR Verified
Curated on-device OCR models verified for Vox Jot. Image-to-text and document scanning models optimized for local inference on-device.
Vox Jot – LLM Verified
Curated on-device LLM/GGUF models verified for Vox Jot. Small, fast instruct models optimized for local inference on-device.
-
IrieDinamik/LiquidAI-LFM2.5-Audio-1.5B-GGUF
1B • Updated • 208 -
IrieDinamik/LiquidAI-LFM2-1.2B-Tool-GGUF
Text Generation • 1B • Updated • 171 -
bartowski/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 293k • 206 -
bartowski/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 97.6k • 163
Vox Jot – TTS Verified
Curated on-device TTS models verified for Vox Jot speech synthesis. Ranked: Fastest → Balanced → Best Quality → Voice Cloning. MIT/Apache licensed.
Vox Jot – STT Verified
Curated CTranslate2 Whisper models verified for Vox Jot speech-to-text. Ranked: Fastest → Balanced → Best Quality → Experimental. MIT/Apache licensed,
-
Systran/faster-whisper-tiny
Automatic Speech Recognition • Updated • 655k • 20 -
Systran/faster-whisper-tiny.en
Automatic Speech Recognition • Updated • 1.17M • 9 -
Systran/faster-whisper-base
Automatic Speech Recognition • Updated • 920k • 26 -
Systran/faster-whisper-base.en
Automatic Speech Recognition • Updated • 86.1k • 4