Update README.md
Browse files
README.md
CHANGED
|
@@ -69,6 +69,7 @@ The following table describes the available features:
|
|
| 69 |
| attention | Same as the pooled statistics but with the attention weights applied.
|
| 70 |
| embedding | The standard ECAPA2 speaker embedding.
|
| 71 |
|
|
|
|
| 72 |
The following table describes the available features:
|
| 73 |
|
| 74 |
| Feature Type| Description | Usage | Labels |
|
|
@@ -76,7 +77,7 @@ The following table describes the available features:
|
|
| 76 |
| Local Feature | Non-uniform effective receptive field in the frequency dimension of each frame-level feature.| Abstract features, probably usefull in tasks less related to speaker characteristics. | lfe1, lfe2, lfe3, lfe4
|
| 77 |
| Global Feature | Uniform effective receptive field of each frame-level feature in the frequency dimension.| Generally capture intra-speaker variance better then speaker embeddings. E.g. speaker profiling, emotion recognition. | gfe1, gfe2, gfe3, pool
|
| 78 |
| Speaker Embedding | Uniform effective receptive field of each frame-level feature in the frequency dimension.| Best for tasks directly depending on the speaker identity (as opposed to speaker characteristics). E.g. speaker verification, speaker diarization. | embedding
|
| 79 |
-
|
| 80 |
|
| 81 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 82 |
|
|
|
|
| 69 |
| attention | Same as the pooled statistics but with the attention weights applied.
|
| 70 |
| embedding | The standard ECAPA2 speaker embedding.
|
| 71 |
|
| 72 |
+
<!--
|
| 73 |
The following table describes the available features:
|
| 74 |
|
| 75 |
| Feature Type| Description | Usage | Labels |
|
|
|
|
| 77 |
| Local Feature | Non-uniform effective receptive field in the frequency dimension of each frame-level feature.| Abstract features, probably usefull in tasks less related to speaker characteristics. | lfe1, lfe2, lfe3, lfe4
|
| 78 |
| Global Feature | Uniform effective receptive field of each frame-level feature in the frequency dimension.| Generally capture intra-speaker variance better then speaker embeddings. E.g. speaker profiling, emotion recognition. | gfe1, gfe2, gfe3, pool
|
| 79 |
| Speaker Embedding | Uniform effective receptive field of each frame-level feature in the frequency dimension.| Best for tasks directly depending on the speaker identity (as opposed to speaker characteristics). E.g. speaker verification, speaker diarization. | embedding
|
| 80 |
+
-->
|
| 81 |
|
| 82 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 83 |
|