suwon commited on
Commit
9d06510
·
verified ·
1 Parent(s): 981c60c

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. README.md +64 -3
  2. last_best_checkpoint +1 -0
  3. last_best_checkpoint.pt +3 -0
README.md CHANGED
@@ -1,3 +1,64 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ # Introduction
6
+
7
+ The MossFormer2_SE_48K model weights for 48 kHz speech enhancement in [ClearerVoice-Studio](https://github.com/modelscope/ClearerVoice-Studio/tree/main) repo.
8
+
9
+ This model is trained on large scale datasets inclduing open-sourced and private data.
10
+
11
+ It enhances speech audios by removing background noise.
12
+
13
+ # Install
14
+
15
+ **Clone the Repository**
16
+
17
+ ``` sh
18
+ git clone https://github.com/modelscope/ClearerVoice-Studio.git
19
+ ```
20
+
21
+ **Create Conda Environment**
22
+
23
+ ``` sh
24
+ cd ClearerVoice-Studio
25
+ conda create -n clearvoice python=3.8
26
+ conda activate clearvoice
27
+ pip install -r requirements.txt
28
+ ```
29
+
30
+ **Run Script**
31
+
32
+ Go to `clearvoice/` and use the following examples. The MossFormer2_SE_48K model will be downloaded from huggingface automatically.
33
+
34
+ Sample example 1: use speech enhancement model `MossFormer2_SE_48K` to process one wave file of `samples/input.wav` and save the output wave file to `samples/output_MossFormer2_SE_48K.wav`
35
+
36
+ ```python
37
+ from clearvoice import ClearVoice
38
+
39
+ myClearVoice = ClearVoice(task='speech_enhancement', model_names=['MossFormer2_SE_48K'])
40
+
41
+ output_wav = myClearVoice(input_path='samples/input.wav', online_write=False)
42
+
43
+ myClearVoice.write(output_wav, output_path='samples/output_MossFormer2_SE_48K.wav')
44
+ ```
45
+
46
+ Sample example 2: use speech enhancement model `MossFormer2_SE_48K` to process all input wave files in `samples/path_to_input_wavs/` and save all output files to `samples/path_to_output_wavs`
47
+
48
+ ```python
49
+ from clearvoice import ClearVoice
50
+
51
+ myClearVoice = ClearVoice(task='speech_enhancement', model_names=['MossFormer2_SE_48K'])
52
+
53
+ myClearVoice(input_path='samples/path_to_input_wavs', online_write=True, output_path='samples/path_to_output_wavs')
54
+ ```
55
+
56
+ Sample example 3: use speech enhancement model `MossFormer2_SE_48K` to process wave files listed in `samples/audio_samples.scp' file, and save all output files to 'samples/path_to_output_wavs_scp/'
57
+
58
+ ```python
59
+ from clearvoice import ClearVoice
60
+
61
+ myClearVoice = ClearVoice(task='speech_enhancement', model_names=['MossFormer2_SE_48K'])
62
+
63
+ myClearVoice(input_path='samples/scp/audio_samples.scp', online_write=True, output_path='samples/path_to_output_wavs_scp')
64
+ ```
last_best_checkpoint ADDED
@@ -0,0 +1 @@
 
 
1
+ last_best_checkpoint.pt
last_best_checkpoint.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:03692b9f773bbd6bb43b9c5a41f96b1e28affd66e13796b7bec66ad3d8b227c6
3
+ size 221552019