qwen
yuhiri
talentestors
AI & ML interests
None yet
Recent Activity
updated a collection 13 days ago
VLM updated a collection 13 days ago
DataSet liked a dataset 13 days ago
stepfun-ai/Step-3.5-Flash-SFTOrganizations
VLM
-
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text • 3B • Updated • 2.6M • 3.2k -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 8.69k • 1.58k -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • Updated • 3.63k • 378 -
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 346k • • 385
Application
DataSet
LLM
A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language
embedding
TTS
- Runtime error216
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
🎙216Generate speech from text using a reference audio
-
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
Text-to-Speech • 2B • Updated • 974k • 1.36k - Running on ZeroFeatured1.78k
Qwen3-TTS Demo
🎙1.78kGenerate speech audio via voice design, cloning, or preset speakers
-
IndexTeam/IndexTTS-2
Text-to-Speech • Updated • 18.4k • 666
Generate-Image
-
stabilityai/stable-diffusion-xl-base-1.0
Text-to-Image • Updated • 2.03M • • 7.57k -
black-forest-labs/FLUX.1-Kontext-dev
Image-to-Image • Updated • 84.1k • • 2.58k -
meituan-longcat/LongCat-Image-Edit
Image-to-Image • Updated • 26.1k • • 166 -
meituan-longcat/LongCat-Image
Text-to-Image • Updated • 20.7k • • 239
Generate-3D
qwen
qwen
embedding
VLM
-
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text • 3B • Updated • 2.6M • 3.2k -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 8.69k • 1.58k -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • Updated • 3.63k • 378 -
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 346k • • 385
TTS
- Runtime error216
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
🎙216Generate speech from text using a reference audio
-
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
Text-to-Speech • 2B • Updated • 974k • 1.36k - Running on ZeroFeatured1.78k
Qwen3-TTS Demo
🎙1.78kGenerate speech audio via voice design, cloning, or preset speakers
-
IndexTeam/IndexTTS-2
Text-to-Speech • Updated • 18.4k • 666
Application
Generate-Image
-
stabilityai/stable-diffusion-xl-base-1.0
Text-to-Image • Updated • 2.03M • • 7.57k -
black-forest-labs/FLUX.1-Kontext-dev
Image-to-Image • Updated • 84.1k • • 2.58k -
meituan-longcat/LongCat-Image-Edit
Image-to-Image • Updated • 26.1k • • 166 -
meituan-longcat/LongCat-Image
Text-to-Image • Updated • 20.7k • • 239
DataSet
Generate-3D
LLM
A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language