Jeff Xie
myjeffxie
Β·
AI & ML interests
education, ai, bigdata
Organizations
T2V
Virtual
-
Running2.78k
OutfitAnyone
π’2.78kGenerate virtual tryβon images for any person and clothing
-
Tann-dev/sex-chat-dirty-girlfriend
Text Generation β’ 7B β’ Updated β’ 80 β’ 43 -
baicai1145/GPT-SoVITS-STAR
Updated β’ 59 -
Runtime error150
Multi Voice TTS(English/Chinese/Japanese)
π150[δΈζ/English/ζ₯ζ¬θͺ]multilingual text-to-speech
ASR
text-gen
audio_deepfake
-
facebook/wav2vec2-base-960h
Automatic Speech Recognition β’ 94.4M β’ Updated β’ 1.29M β’ 387 -
microsoft/wavlm-base
Feature Extraction β’ Updated β’ 83.2k β’ 11 -
microsoft/wavlm-large
Feature Extraction β’ Updated β’ 946k β’ 95 -
facebook/hubert-large-ls960-ft
Automatic Speech Recognition β’ Updated β’ 243k β’ 76
TTS
-
coqui/XTTS-v2
Text-to-Speech β’ Updated β’ 6.64M β’ 3.39k -
Runtime error150
Multi Voice TTS(English/Chinese/Japanese)
π150[δΈζ/English/ζ₯ζ¬θͺ]multilingual text-to-speech
-
alphacep/vosk-tts-ru-gpt-sovits
Updated β’ 3 -
metavoiceio/metavoice-1B-v0.1
Text-to-Speech β’ Updated β’ 80 β’ 790
RAG
dataset
text_to_image
audio_deepfake
-
facebook/wav2vec2-base-960h
Automatic Speech Recognition β’ 94.4M β’ Updated β’ 1.29M β’ 387 -
microsoft/wavlm-base
Feature Extraction β’ Updated β’ 83.2k β’ 11 -
microsoft/wavlm-large
Feature Extraction β’ Updated β’ 946k β’ 95 -
facebook/hubert-large-ls960-ft
Automatic Speech Recognition β’ Updated β’ 243k β’ 76
T2V
TTS
-
coqui/XTTS-v2
Text-to-Speech β’ Updated β’ 6.64M β’ 3.39k -
Runtime error150
Multi Voice TTS(English/Chinese/Japanese)
π150[δΈζ/English/ζ₯ζ¬θͺ]multilingual text-to-speech
-
alphacep/vosk-tts-ru-gpt-sovits
Updated β’ 3 -
metavoiceio/metavoice-1B-v0.1
Text-to-Speech β’ Updated β’ 80 β’ 790
Virtual
-
Running2.78k
OutfitAnyone
π’2.78kGenerate virtual tryβon images for any person and clothing
-
Tann-dev/sex-chat-dirty-girlfriend
Text Generation β’ 7B β’ Updated β’ 80 β’ 43 -
baicai1145/GPT-SoVITS-STAR
Updated β’ 59 -
Runtime error150
Multi Voice TTS(English/Chinese/Japanese)
π150[δΈζ/English/ζ₯ζ¬θͺ]multilingual text-to-speech
RAG
ASR
dataset
text-gen