qwen
yuhiri
talentestors
AI & ML interests
None yet
Recent Activity
liked a model 11 days ago
XiaomiMiMo/MiMo-V2.5-Pro liked a model 23 days ago
deepseek-ai/DeepSeek-V4-Pro updated a collection 23 days ago
LLMOrganizations
VLM
-
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text • 3B • Updated • 2.97M • 3.24k -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 11.2k • 1.61k -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • Updated • 5.72k • 381 -
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 14.9k • • 394
Application
DataSet
LLM
A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language
embedding
TTS
- Runtime errorAgents216
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
🎙216Generate speech from text using a reference audio
-
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
Text-to-Speech • 2B • Updated • 1.51M • 1.48k - Runtime errorAgentsFeatured1.92k
Qwen3-TTS Demo
🎙1.92kGenerate speech audio from text with custom or cloned voices
-
IndexTeam/IndexTTS-2
Text-to-Speech • Updated • 16.1k • 711
Generate-Image
-
stabilityai/stable-diffusion-xl-base-1.0
Text-to-Image • Updated • 2.01M • • 7.71k -
black-forest-labs/FLUX.1-Kontext-dev
Image-to-Image • Updated • 80.7k • • 2.62k -
meituan-longcat/LongCat-Image-Edit
Image-to-Image • Updated • 27.2k • • 173 -
meituan-longcat/LongCat-Image
Text-to-Image • Updated • 20.9k • • 241
Generate-3D
qwen
qwen
embedding
VLM
-
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text • 3B • Updated • 2.97M • 3.24k -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 11.2k • 1.61k -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • Updated • 5.72k • 381 -
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 14.9k • • 394
TTS
- Runtime errorAgents216
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
🎙216Generate speech from text using a reference audio
-
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
Text-to-Speech • 2B • Updated • 1.51M • 1.48k - Runtime errorAgentsFeatured1.92k
Qwen3-TTS Demo
🎙1.92kGenerate speech audio from text with custom or cloned voices
-
IndexTeam/IndexTTS-2
Text-to-Speech • Updated • 16.1k • 711
Application
Generate-Image
-
stabilityai/stable-diffusion-xl-base-1.0
Text-to-Image • Updated • 2.01M • • 7.71k -
black-forest-labs/FLUX.1-Kontext-dev
Image-to-Image • Updated • 80.7k • • 2.62k -
meituan-longcat/LongCat-Image-Edit
Image-to-Image • Updated • 27.2k • • 173 -
meituan-longcat/LongCat-Image
Text-to-Image • Updated • 20.9k • • 241
DataSet
Generate-3D
LLM
A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language