silero-vad / README.md

vccarvalho11

Upload silero-vad ONNX model

5af90a9 verified 25 days ago

preview code

raw

history blame contribute delete

1.49 kB

metadata

library_name: onnx
tags:
  - silero
  - voice-activity-detection
  - vad
  - audio
  - onnx
  - inference4j
license: mit
pipeline_tag: voice-activity-detection

Silero VAD — ONNX

ONNX export of Silero VAD, a lightweight and fast voice activity detection model. Detects speech segments in audio with high accuracy and low latency.

Mirrored for use with inference4j, an inference-only AI library for Java.

Original Source

Repository: Silero Team (snakers4)
License: mit

Usage with inference4j

try (SileroVAD vad = SileroVAD.fromPretrained("models/silero-vad")) {
    List<VoiceSegment> segments = vad.detect(Path.of("meeting.wav"));
    for (VoiceSegment segment : segments) {
        System.out.printf("Speech: %.2fs - %.2fs%n", segment.start(), segment.end());
    }
}

Model Details

Property	Value
Architecture	Silero VAD (lightweight CNN + LSTM)
Task	Voice activity detection
Input	16kHz mono audio (float32 waveform, 512-sample chunks)
Output	Speech probability per chunk
Model size	~2 MB
Original source	snakers4/silero-vad

License

This model is licensed under the MIT License. Original model by Silero Team.