Jenthe
/

ECAPA2

Jenthe commited on Oct 16, 2023

Commit

80c97ed

1 Parent(s): 5ab3c24

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -69,6 +69,7 @@ The following table describes the available features:
 | attention | Same as the pooled statistics but with the attention weights applied.
 | embedding | The standard ECAPA2 speaker embedding.
 The following table describes the available features:
 | Feature Type| Description | Usage | Labels |
@@ -76,7 +77,7 @@ The following table describes the available features:
 | Local Feature | Non-uniform effective receptive field in the frequency dimension of each frame-level feature.| Abstract features, probably usefull in tasks less related to speaker characteristics. | lfe1, lfe2, lfe3, lfe4
 | Global Feature | Uniform effective receptive field of each frame-level feature in the frequency dimension.| Generally capture intra-speaker variance better then speaker embeddings. E.g. speaker profiling, emotion recognition. | gfe1, gfe2, gfe3, pool
 | Speaker Embedding | Uniform effective receptive field of each frame-level feature in the frequency dimension.| Best for tasks directly depending on the speaker identity (as opposed to speaker characteristics). E.g. speaker verification, speaker diarization. | embedding
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->

 | attention | Same as the pooled statistics but with the attention weights applied.
 | embedding | The standard ECAPA2 speaker embedding.
+<!--
 The following table describes the available features:
 | Feature Type| Description | Usage | Labels |
 | Local Feature | Non-uniform effective receptive field in the frequency dimension of each frame-level feature.| Abstract features, probably usefull in tasks less related to speaker characteristics. | lfe1, lfe2, lfe3, lfe4
 | Global Feature | Uniform effective receptive field of each frame-level feature in the frequency dimension.| Generally capture intra-speaker variance better then speaker embeddings. E.g. speaker profiling, emotion recognition. | gfe1, gfe2, gfe3, pool
 | Speaker Embedding | Uniform effective receptive field of each frame-level feature in the frequency dimension.| Best for tasks directly depending on the speaker identity (as opposed to speaker characteristics). E.g. speaker verification, speaker diarization. | embedding
+-->
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->