How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull Promptengineering/whryte-models:Q4_K_M
Run and chat with the model
lemonade run user.whryte-models-Q4_K_M
List all available models
lemonade list
Quick Links

Whryte Models

Model artifacts downloaded by the Whryte desktop dictation app for Windows. This repository mirrors upstream releases so the app has a stable, owner-controlled download source. These are not original works โ€” see LICENSES.md for the license and origin of every file.

Path Model Used for License
parakeet/ NVIDIA Parakeet TDT 0.6B v3 int8 (sherpa-onnx export) Batch dictation, file transcription CC-BY-4.0
nemotron-en/ NVIDIA Nemotron Speech Streaming EN 0.6B int8, 4 chunk sizes (sherpa-onnx exports) Live dictation (English) OpenMDW-1.1
nemotron35/ NVIDIA Nemotron 3.5 ASR Streaming Multilingual 0.6B int8 (community ONNX export) Live dictation (multilingual) OpenMDW-1.1
llm/ Qwen3-4B-Instruct-2507, Qwen2.5-1.5B/0.5B-Instruct (GGUF Q4_K_M) Transcript enhancement Apache-2.0
diarization/ pyannote segmentation-3.0, 3D-Speaker ERes2Net (sherpa-onnx exports) Speaker identification MIT / Apache-2.0
Downloads last month
19
GGUF
Model size
0.5B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support