Silero-VAD-v5 β€” CoreML

CoreML conversion of Silero VAD v5 for Apple Neural Engine.

Model Details

Detail Value
Architecture STFT β†’ Conv1d encoder β†’ LSTM β†’ decoder
Parameters ~309K
Input 512 samples (32ms @ 16kHz)
Output Speech probability (0.0–1.0)
Size ~4.2 MB

Usage

let vad = try await SileroVADModel.fromPretrained(backend: .coreML)
let prob = vad.processChunk(samples)

Variants

Variant Backend Model ID
MLX GPU aufklarer/Silero-VAD-v5-MLX
CoreML Neural Engine aufklarer/Silero-VAD-v5-CoreML

Links


Links

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support