vccarvalho11 commited on
Commit
5af90a9
·
verified ·
1 Parent(s): 026b001

Upload silero-vad ONNX model

Browse files
Files changed (2) hide show
  1. README.md +49 -0
  2. model.onnx +3 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: onnx
3
+ tags:
4
+ - silero
5
+ - voice-activity-detection
6
+ - vad
7
+ - audio
8
+ - onnx
9
+ - inference4j
10
+ license: mit
11
+ pipeline_tag: voice-activity-detection
12
+ ---
13
+
14
+ # Silero VAD — ONNX
15
+
16
+ ONNX export of [Silero VAD](https://github.com/snakers4/silero-vad), a lightweight and fast voice activity detection model. Detects speech segments in audio with high accuracy and low latency.
17
+
18
+ Mirrored for use with [inference4j](https://github.com/inference4j/inference4j), an inference-only AI library for Java.
19
+
20
+ ## Original Source
21
+
22
+ - **Repository:** [Silero Team (snakers4)](https://github.com/snakers4/silero-vad)
23
+ - **License:** mit
24
+
25
+ ## Usage with inference4j
26
+
27
+ ```java
28
+ try (SileroVAD vad = SileroVAD.fromPretrained("models/silero-vad")) {
29
+ List<VoiceSegment> segments = vad.detect(Path.of("meeting.wav"));
30
+ for (VoiceSegment segment : segments) {
31
+ System.out.printf("Speech: %.2fs - %.2fs%n", segment.start(), segment.end());
32
+ }
33
+ }
34
+ ```
35
+
36
+ ## Model Details
37
+
38
+ | Property | Value |
39
+ |----------|-------|
40
+ | Architecture | Silero VAD (lightweight CNN + LSTM) |
41
+ | Task | Voice activity detection |
42
+ | Input | 16kHz mono audio (float32 waveform, 512-sample chunks) |
43
+ | Output | Speech probability per chunk |
44
+ | Model size | ~2 MB |
45
+ | Original source | [snakers4/silero-vad](https://github.com/snakers4/silero-vad) |
46
+
47
+ ## License
48
+
49
+ This model is licensed under the [MIT License](https://opensource.org/licenses/MIT). Original model by [Silero Team](https://github.com/snakers4/silero-vad).
model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1a153a22f4509e292a94e67d6f9b85e8deb25b4988682b7e174c65279d8788e3
3
+ size 2327524