Hervé Bredin
commited on
Commit
·
e87ecc5
1
Parent(s):
df706b9
doc: add thresholds
Browse files
README.md
CHANGED
|
@@ -49,9 +49,9 @@ In order to reproduce results of the paper, one should use the following hyper-p
|
|
| 49 |
|
| 50 |
Dataset | `onset` | `offset` | `min_duration_on` | `min_duration_off`
|
| 51 |
----------------|---------|----------|-------------------|-------------------
|
| 52 |
-
AMI Mix-Headset |
|
| 53 |
-
DIHARD3 |
|
| 54 |
-
VoxConverse |
|
| 55 |
|
| 56 |
### Overlapped speech detection
|
| 57 |
|
|
@@ -66,9 +66,9 @@ In order to reproduce results of the paper, one should use the following hyper-p
|
|
| 66 |
|
| 67 |
Dataset | `onset` | `offset` | `min_duration_on` | `min_duration_off`
|
| 68 |
----------------|---------|----------|-------------------|-------------------
|
| 69 |
-
AMI Mix-Headset |
|
| 70 |
-
DIHARD3 |
|
| 71 |
-
VoxConverse |
|
| 72 |
|
| 73 |
|
| 74 |
### Segmentation
|
|
@@ -84,9 +84,9 @@ In order to reproduce results of the paper, one should use the following hyper-p
|
|
| 84 |
|
| 85 |
Dataset | `onset` | `offset` | `min_duration_on` | `min_duration_off`
|
| 86 |
----------------|---------|----------|-------------------|-------------------
|
| 87 |
-
AMI Mix-Headset |
|
| 88 |
-
DIHARD3 |
|
| 89 |
-
VoxConverse |
|
| 90 |
|
| 91 |
### Resegmentation
|
| 92 |
|
|
@@ -110,9 +110,9 @@ In order to reproduce (VBx) results of the paper, one should use the following h
|
|
| 110 |
|
| 111 |
Dataset | `onset` | `offset` | `min_duration_on` | `min_duration_off`
|
| 112 |
----------------|---------|----------|-------------------|-------------------
|
| 113 |
-
AMI Mix-Headset |
|
| 114 |
-
DIHARD3 |
|
| 115 |
-
VoxConverse |
|
| 116 |
|
| 117 |
## Citations
|
| 118 |
|
|
|
|
| 49 |
|
| 50 |
Dataset | `onset` | `offset` | `min_duration_on` | `min_duration_off`
|
| 51 |
----------------|---------|----------|-------------------|-------------------
|
| 52 |
+
AMI Mix-Headset | 0.851 | 0.430 | 0.115 | 0.146
|
| 53 |
+
DIHARD3 | 0.855 | 0.292 | 0.036 | 0.001
|
| 54 |
+
VoxConverse | 0.883 | 0.688 | 0.106 | 0.526
|
| 55 |
|
| 56 |
### Overlapped speech detection
|
| 57 |
|
|
|
|
| 66 |
|
| 67 |
Dataset | `onset` | `offset` | `min_duration_on` | `min_duration_off`
|
| 68 |
----------------|---------|----------|-------------------|-------------------
|
| 69 |
+
AMI Mix-Headset | 0.552 | 0.311 | 0.131 | 0.180
|
| 70 |
+
DIHARD3 | 0.564 | 0.264 | 0.158 | 0.080
|
| 71 |
+
VoxConverse | 0.617 | 0.387 | 0.367 | 0.334
|
| 72 |
|
| 73 |
|
| 74 |
### Segmentation
|
|
|
|
| 84 |
|
| 85 |
Dataset | `onset` | `offset` | `min_duration_on` | `min_duration_off`
|
| 86 |
----------------|---------|----------|-------------------|-------------------
|
| 87 |
+
AMI Mix-Headset | 0.784 | 0.661 | 0.127 | 0.003
|
| 88 |
+
DIHARD3 | 0.848 | 0.495 | 0.056 | 0.000
|
| 89 |
+
VoxConverse | 0.882 | 0.779 | 0.304 | 0.484
|
| 90 |
|
| 91 |
### Resegmentation
|
| 92 |
|
|
|
|
| 110 |
|
| 111 |
Dataset | `onset` | `offset` | `min_duration_on` | `min_duration_off`
|
| 112 |
----------------|---------|----------|-------------------|-------------------
|
| 113 |
+
AMI Mix-Headset | 0.542 | 0.527 | 0.044 | 0.705
|
| 114 |
+
DIHARD3 | 0.592 | 0.489 | 0.163 | 0.182
|
| 115 |
+
VoxConverse | 0.537 | 0.724 | 0.410 | 0.563
|
| 116 |
|
| 117 |
## Citations
|
| 118 |
|