Joker5514 commited on
Commit
7102e14
·
verified ·
1 Parent(s): 2677c0d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md CHANGED
@@ -1,3 +1,51 @@
1
  ---
2
  license: mit
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ language:
4
+ - en
5
+ tags:
6
+ - onnx
7
+ - audio
8
+ - voice-isolation
9
+ - speech
10
+ - demucs
11
+ - bsrnn
12
+ pipeline_tag: audio-to-audio
13
+ library_name: onnxruntime
14
  ---
15
+
16
+ # VoiceIsolate-Models
17
+
18
+ Quantized ONNX models used by [VoiceIsolate Pro](https://github.com/Joker5514/VoiceIsolate-Pro) for on-device, GPU-accelerated voice isolation and audio enhancement.
19
+
20
+ All inference runs 100% client-side in the browser via ONNX Runtime Web (WebGPU / WASM fallback). No server required.
21
+
22
+ ## Models in this Repository
23
+
24
+ | File | Description | Size | Source |
25
+ |---|---|---|---|
26
+ | `demucs_v4_quantized.onnx` | Demucs v4 HTDemucs int8-quantized — stem-level voice isolation | ~83 MB | [facebookresearch/demucs](https://github.com/facebookresearch/demucs) |
27
+ | `bsrnn_vocals.onnx` | BSRNN Band-Split RNN vocals separator | ~45 MB | [crlandsc/bsrnn](https://github.com/crlandsc/bsrnn) |
28
+
29
+ ## Usage
30
+
31
+ These models are fetched automatically by `ml-worker-fetch-cache.js` in VoiceIsolate Pro. They are cached in IndexedDB after the first download and never re-fetched.
32
+
33
+ ```js
34
+ // MODEL_REGISTRY entry in ml-worker-fetch-cache.js
35
+ demucs_v4: {
36
+ path: 'models/demucs_v4_quantized.onnx',
37
+ sizeBytes: 87_031_808,
38
+ cdnUrls: ['https://huggingface.co/Joker5514/VoiceIsolate-Models/resolve/main/demucs_v4_quantized.onnx']
39
+ },
40
+ bsrnn_vocals: {
41
+ path: 'models/bsrnn_vocals.onnx',
42
+ sizeBytes: 3_870_554,
43
+ cdnUrls: ['https://huggingface.co/Joker5514/VoiceIsolate-Models/resolve/main/bsrnn_vocals.onnx']
44
+ }
45
+ ```
46
+
47
+ ## License
48
+
49
+ MIT. Model weights inherit the licenses of their respective upstream projects:
50
+ - Demucs: MIT
51
+ - BSRNN: MIT