Kimani James

IrieDinamik

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

IrieDinamik/vox-jot-releases

updated a model 23 days ago

IrieDinamik/vox-jot-ocr-runtime

updated a model 23 days ago

IrieDinamik/vox-jot-models

View all activity

Organizations

None yet

IrieDinamik 's collections 10

Vox Jot - Creative Audio Verified

Verified Story Studio creative-audio models and runtimes for Vox Jot.

IrieDinamik/vox-jot-creative-audio-runtime

Updated May 25
stabilityai/stable-audio-3-optimized

Text-to-Audio • Updated about 4 hours ago • 99 • 20
cvssp/audioldm2-music

Updated Apr 16, 2024 • 943 • 29
cvssp/audioldm2

Updated Apr 16, 2024 • 17.4k • 70

Vox Jot – Speech Analysis Runtime

Public runtime group for Vox Jot Python-sidecar file ASR and speaker isolation dependencies.

IrieDinamik/vox-jot-speech-analysis-runtime

Automatic Speech Recognition • Updated about 1 month ago

Vox Jot – Speaker Isolation Verified

Curated speaker diarization and isolation models verified for Vox Jot file transcription.

pyannote/speaker-diarization-community-1

Automatic Speech Recognition • Updated Sep 29, 2025 • 4.07M • 698
pyannote/speaker-diarization-3.1

Automatic Speech Recognition • Updated May 10, 2024 • 8.42M • 2.62k
BUT-FIT/diarizen-wavlm-large-s80-md-v2

Voice Activity Detection • Updated Dec 9, 2025 • 1.96k • 18
nvidia/diar_sortformer_4spk-v1

Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 7.87k • 144

Vox Jot – OCR Verified

Curated on-device OCR models verified for Vox Jot. Image-to-text and document scanning models optimized for local inference on-device.

IrieDinamik/ocr-tessdata-best

Updated May 15
IrieDinamik/ocr-nemotron-ocr-v2

Updated May 15 • 7
IrieDinamik/ocr-qwen2-5-vl-3b

Image-Text-to-Text • 4B • Updated May 15 • 3
IrieDinamik/ocr-glm-ocr

Image-Text-to-Text • 1B • Updated May 15 • 4

Vox Jot – TTS Verified

Curated on-device TTS models verified for Vox Jot speech synthesis. Ranked: Fastest → Balanced → Best Quality → Voice Cloning. MIT/Apache licensed.

rhasspy/piper-voices

Updated 22 days ago • 589
microsoft/speecht5_tts

Text-to-Speech • Updated Nov 8, 2023 • 78.8k • 836
onnx-community/Kokoro-82M-v1.0-ONNX

Text-to-Speech • Updated Feb 8, 2025 • 647k • 234
hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 13.5M • • 6.46k

Vox Jot - TTS Candidates

Candidate on-device TTS models under evaluation for Vox Jot. Models here are not ranked or verified until full Vox Jot benchmark suites pass.

Supertone/supertonic-3

Text-to-Speech • Updated May 18 • 66.1k • 870
mlx-community/kitten-tts-nano-0.8

Text-to-Speech • 14.6M • Updated Feb 24 • 44
mlx-community/kitten-tts-micro-0.8

Text-to-Speech • 35.5M • Updated Feb 24 • 14
mlx-community/kitten-tts-mini-0.8

Text-to-Speech • 73.8M • Updated Feb 24 • 51 • 2

ML Models

Machine Learning Models

AngelSlim/Hy-MT1.5-1.8B-1.25bit

Translation • 2B • Updated May 26 • 114 • 194
Supertone/supertonic-3

Text-to-Speech • Updated May 18 • 66.1k • 870

Vox Jot – File ASR Verified

Curated file-transcription ASR models verified for Vox Jot. File/audio engines, not live dictation hot path.

ibm-granite/granite-speech-4.1-2b

Automatic Speech Recognition • 2B • Updated 26 days ago • 470k • 147
CohereLabs/cohere-transcribe-03-2026

Automatic Speech Recognition • 2B • Updated 28 days ago • 863k • • 1.03k
Systran/faster-whisper-large-v3

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.08M • 614
mlx-community/nemotron-3.5-asr-streaming-0.6b

Automatic Speech Recognition • 0.6B • Updated Jun 5 • 1.65k • 10

Vox Jot – LLM Verified

Curated on-device LLM/GGUF models verified for Vox Jot. Small, fast instruct models optimized for local inference on-device.

IrieDinamik/LiquidAI-LFM2.5-Audio-1.5B-GGUF

1B • Updated Apr 28 • 384
IrieDinamik/LiquidAI-LFM2-1.2B-Tool-GGUF

Text Generation • 1B • Updated Apr 28 • 41
bartowski/Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Oct 8, 2024 • 210k • 221
bartowski/Llama-3.2-1B-Instruct-GGUF

Text Generation • 1B • Updated Oct 8, 2024 • 439k • 170

Vox Jot – STT Verified

Curated CTranslate2 Whisper models verified for Vox Jot speech-to-text. Ranked: Fastest → Balanced → Best Quality → Experimental. MIT/Apache licensed,

Systran/faster-whisper-tiny

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.23M • 23
Systran/faster-whisper-tiny.en

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.1M • 10
Systran/faster-whisper-base

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.44M • 32
Systran/faster-whisper-base.en

Automatic Speech Recognition • Updated Nov 23, 2023 • 244k • 6

Vox Jot - Creative Audio Verified

Verified Story Studio creative-audio models and runtimes for Vox Jot.

IrieDinamik/vox-jot-creative-audio-runtime

Updated May 25
stabilityai/stable-audio-3-optimized

Text-to-Audio • Updated about 4 hours ago • 99 • 20
cvssp/audioldm2-music

Updated Apr 16, 2024 • 943 • 29
cvssp/audioldm2

Updated Apr 16, 2024 • 17.4k • 70

Vox Jot - TTS Candidates

Candidate on-device TTS models under evaluation for Vox Jot. Models here are not ranked or verified until full Vox Jot benchmark suites pass.

Supertone/supertonic-3

Text-to-Speech • Updated May 18 • 66.1k • 870
mlx-community/kitten-tts-nano-0.8

Text-to-Speech • 14.6M • Updated Feb 24 • 44
mlx-community/kitten-tts-micro-0.8

Text-to-Speech • 35.5M • Updated Feb 24 • 14
mlx-community/kitten-tts-mini-0.8

Text-to-Speech • 73.8M • Updated Feb 24 • 51 • 2

Vox Jot – Speech Analysis Runtime

Public runtime group for Vox Jot Python-sidecar file ASR and speaker isolation dependencies.

IrieDinamik/vox-jot-speech-analysis-runtime

Automatic Speech Recognition • Updated about 1 month ago

ML Models

Machine Learning Models

AngelSlim/Hy-MT1.5-1.8B-1.25bit

Translation • 2B • Updated May 26 • 114 • 194
Supertone/supertonic-3

Text-to-Speech • Updated May 18 • 66.1k • 870

Vox Jot – Speaker Isolation Verified

Curated speaker diarization and isolation models verified for Vox Jot file transcription.

pyannote/speaker-diarization-community-1

Automatic Speech Recognition • Updated Sep 29, 2025 • 4.07M • 698
pyannote/speaker-diarization-3.1

Automatic Speech Recognition • Updated May 10, 2024 • 8.42M • 2.62k
BUT-FIT/diarizen-wavlm-large-s80-md-v2

Voice Activity Detection • Updated Dec 9, 2025 • 1.96k • 18
nvidia/diar_sortformer_4spk-v1

Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 7.87k • 144

Vox Jot – File ASR Verified

Curated file-transcription ASR models verified for Vox Jot. File/audio engines, not live dictation hot path.

ibm-granite/granite-speech-4.1-2b

Automatic Speech Recognition • 2B • Updated 26 days ago • 470k • 147
CohereLabs/cohere-transcribe-03-2026

Automatic Speech Recognition • 2B • Updated 28 days ago • 863k • • 1.03k
Systran/faster-whisper-large-v3

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.08M • 614
mlx-community/nemotron-3.5-asr-streaming-0.6b

Automatic Speech Recognition • 0.6B • Updated Jun 5 • 1.65k • 10

Vox Jot – OCR Verified

Curated on-device OCR models verified for Vox Jot. Image-to-text and document scanning models optimized for local inference on-device.

IrieDinamik/ocr-tessdata-best

Updated May 15
IrieDinamik/ocr-nemotron-ocr-v2

Updated May 15 • 7
IrieDinamik/ocr-qwen2-5-vl-3b

Image-Text-to-Text • 4B • Updated May 15 • 3
IrieDinamik/ocr-glm-ocr

Image-Text-to-Text • 1B • Updated May 15 • 4

Vox Jot – LLM Verified

Curated on-device LLM/GGUF models verified for Vox Jot. Small, fast instruct models optimized for local inference on-device.

IrieDinamik/LiquidAI-LFM2.5-Audio-1.5B-GGUF

1B • Updated Apr 28 • 384
IrieDinamik/LiquidAI-LFM2-1.2B-Tool-GGUF

Text Generation • 1B • Updated Apr 28 • 41
bartowski/Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Oct 8, 2024 • 210k • 221
bartowski/Llama-3.2-1B-Instruct-GGUF

Text Generation • 1B • Updated Oct 8, 2024 • 439k • 170

Vox Jot – TTS Verified

Curated on-device TTS models verified for Vox Jot speech synthesis. Ranked: Fastest → Balanced → Best Quality → Voice Cloning. MIT/Apache licensed.

rhasspy/piper-voices

Updated 22 days ago • 589
microsoft/speecht5_tts

Text-to-Speech • Updated Nov 8, 2023 • 78.8k • 836
onnx-community/Kokoro-82M-v1.0-ONNX

Text-to-Speech • Updated Feb 8, 2025 • 647k • 234
hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 13.5M • • 6.46k

Vox Jot – STT Verified

Curated CTranslate2 Whisper models verified for Vox Jot speech-to-text. Ranked: Fastest → Balanced → Best Quality → Experimental. MIT/Apache licensed,

Systran/faster-whisper-tiny

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.23M • 23
Systran/faster-whisper-tiny.en

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.1M • 10
Systran/faster-whisper-base

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.44M • 32
Systran/faster-whisper-base.en

Automatic Speech Recognition • Updated Nov 23, 2023 • 244k • 6