ghxjiks JSWOOK commited on
Commit
d740475
·
0 Parent(s):

Duplicate from JSWOOK/pyannote_3_fine_tuning

Browse files

Co-authored-by: Jeon Sang Wook <JSWOOK@users.noreply.huggingface.co>

Files changed (7) hide show
  1. .gitattributes +35 -0
  2. README.md +78 -0
  3. config.json +18 -0
  4. config.yaml +19 -0
  5. model.safetensors +3 -0
  6. pytorch_model.bin +3 -0
  7. training_args.bin +3 -0
.gitattributes ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,78 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ language:
4
+ - en
5
+ license: mit
6
+ base_model: pyannote/speaker-diarization-3.1
7
+ tags:
8
+ - speaker-diarization
9
+ - speaker-segmentation
10
+ - generated_from_trainer
11
+ datasets:
12
+ - diarizers-community/voxconverse
13
+ model-index:
14
+ - name: JSWOOK/pyannote_3_fine_tuning
15
+ results: []
16
+ ---
17
+
18
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
19
+ should probably proofread and complete it, then remove this comment. -->
20
+
21
+ # JSWOOK/pyannote_3_fine_tuning
22
+
23
+ This model is a fine-tuned version of [pyannote/speaker-diarization-3.1](https://huggingface.co/pyannote/speaker-diarization-3.1) on the diarizers-community/voxconverse dataset.
24
+ It achieves the following results on the evaluation set:
25
+ - Loss: 0.3134
26
+ - Model Preparation Time: 0.0048
27
+ - Der: 0.0888
28
+ - False Alarm: 0.0134
29
+ - Missed Detection: 0.0337
30
+ - Confusion: 0.0417
31
+
32
+ ## Model description
33
+
34
+ More information needed
35
+
36
+ ## Intended uses & limitations
37
+
38
+ More information needed
39
+
40
+ ## Training and evaluation data
41
+
42
+ More information needed
43
+
44
+ ## Training procedure
45
+
46
+ ### Training hyperparameters
47
+
48
+ The following hyperparameters were used during training:
49
+ - learning_rate: 5e-05
50
+ - train_batch_size: 32
51
+ - eval_batch_size: 32
52
+ - seed: 42
53
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
54
+ - lr_scheduler_type: cosine
55
+ - num_epochs: 10
56
+
57
+ ### Training results
58
+
59
+ | Training Loss | Epoch | Step | Validation Loss | Model Preparation Time | Der | False Alarm | Missed Detection | Confusion |
60
+ |:-------------:|:-----:|:----:|:---------------:|:----------------------:|:------:|:-----------:|:----------------:|:---------:|
61
+ | No log | 1.0 | 24 | 0.3180 | 0.0048 | 0.0915 | 0.0119 | 0.0385 | 0.0410 |
62
+ | 0.1903 | 2.0 | 48 | 0.3116 | 0.0048 | 0.0903 | 0.0125 | 0.0369 | 0.0409 |
63
+ | 0.1839 | 3.0 | 72 | 0.3089 | 0.0048 | 0.0896 | 0.0128 | 0.0357 | 0.0411 |
64
+ | 0.1825 | 4.0 | 96 | 0.3176 | 0.0048 | 0.0896 | 0.0131 | 0.0352 | 0.0413 |
65
+ | 0.1797 | 5.0 | 120 | 0.3148 | 0.0048 | 0.0892 | 0.0132 | 0.0346 | 0.0413 |
66
+ | 0.1801 | 6.0 | 144 | 0.3141 | 0.0048 | 0.0890 | 0.0133 | 0.0342 | 0.0415 |
67
+ | 0.1735 | 7.0 | 168 | 0.3137 | 0.0048 | 0.0887 | 0.0134 | 0.0338 | 0.0416 |
68
+ | 0.1705 | 8.0 | 192 | 0.3133 | 0.0048 | 0.0887 | 0.0134 | 0.0337 | 0.0416 |
69
+ | 0.1796 | 9.0 | 216 | 0.3133 | 0.0048 | 0.0887 | 0.0134 | 0.0337 | 0.0417 |
70
+ | 0.1644 | 10.0 | 240 | 0.3134 | 0.0048 | 0.0888 | 0.0134 | 0.0337 | 0.0417 |
71
+
72
+
73
+ ### Framework versions
74
+
75
+ - Transformers 4.44.2
76
+ - Pytorch 2.5.0+cu121
77
+ - Datasets 3.1.0
78
+ - Tokenizers 0.19.1
config.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "SegmentationModel"
4
+ ],
5
+ "chunk_duration": 10.0,
6
+ "max_speakers_per_chunk": 3,
7
+ "max_speakers_per_frame": 2,
8
+ "min_duration": null,
9
+ "model_type": "pyannet",
10
+ "sample_rate": 16000,
11
+ "torch_dtype": "float32",
12
+ "transformers_version": "4.44.2",
13
+ "warm_up": [
14
+ 0.0,
15
+ 0.0
16
+ ],
17
+ "weigh_by_cardinality": false
18
+ }
config.yaml ADDED
@@ -0,0 +1,19 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ version: 3.0.0
2
+
3
+ pipeline:
4
+ name: pyannote.audio.pipelines.SpeakerDiarization
5
+ params:
6
+ clustering: AgglomerativeClustering
7
+ embedding: hbredin/wespeaker-voxceleb-resnet34-LM
8
+ embedding_batch_size: 16
9
+ embedding_exclude_overlap: true
10
+ segmentation: pyannote/segmentation-3.0
11
+ segmentation_batch_size: 32
12
+
13
+ params:
14
+ clustering:
15
+ method: centroid
16
+ min_cluster_size: 12
17
+ threshold: 0.7045654963945799
18
+ segmentation:
19
+ min_duration_off: 0.0
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0851ed0984dbbc22a62ba602a1cc18eeb3fbbf2a1deb515812f41056a04b9303
3
+ size 5899124
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:511ceac6e3c008f19fbe4b6b944cada16c75921dc5b1f3d4d2cc01ebc87b0206
3
+ size 5905907
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:677f6b93b8287b812971afc0a22f66d80d0d008b82848cf17084db5a42d37f5f
3
+ size 5240