marlenezw
/

AutoVC_Voice_Conversion

Model card Files Files and versions

marlenezw commited on Feb 15, 2023

Commit

86a0b16

·

1 Parent(s): bf0121b

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -1,3 +1,12 @@
 ---
 license: cc-by-2.0
 ---

 ---
 license: cc-by-2.0
+language:
+- en
+tags:
+- code
+- audio
+- voice
 ---
+This model is used to extract the speaker-agnostic content representation of an audio file. The model leverages AutoVC from Qian et al. [2019]..
+ The AutoVC network utilizes an LSTM-based encoder that compresses the input audio into a compact representation (bottleneck) trained to abandon the original speaker identity but preserve content. In our case, we extract a content embedding A ∈ R𝑇×𝐷 from AutoVC network, where 𝑇 is the total number of input audio frames, and 𝐷 is the content dimension.