Chethan Kumar D A
chethan62
AI & ML interests
tech
Recent Activity
liked
a model
about 7 hours ago
rootsautomation/GutenOCR-3B
liked
a model
3 days ago
sweepai/sweep-next-edit-1.5B
liked
a Space
4 days ago
Qwen/Qwen3-TTS
Organizations
None yet
TTS
spaces
-
Runtime errorFeatured2.77k
XTTS
πΈ2.77kGenerate speech from text using a reference voice
-
Runtime error35
Moonshine ASR
π35Fast & efficient ASR outperforming Whisper!
-
Running1.04k
Edge TTS Text To Speech
π1.04kGenerate speech from text using Microsoft Edge TTS
-
Paused848
Video Dubbing (SoniTranslate)
π848Video Dubbing with Open Source Projects
papers
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper β’ 2311.10093 β’ Published β’ 58 -
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper β’ 2311.12229 β’ Published β’ 26 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper β’ 2311.12908 β’ Published β’ 49 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper β’ 2312.00845 β’ Published β’ 39
STT
TTS
Ai
spaces
-
Runtime errorFeatured2.77k
XTTS
πΈ2.77kGenerate speech from text using a reference voice
-
Runtime error35
Moonshine ASR
π35Fast & efficient ASR outperforming Whisper!
-
Running1.04k
Edge TTS Text To Speech
π1.04kGenerate speech from text using Microsoft Edge TTS
-
Paused848
Video Dubbing (SoniTranslate)
π848Video Dubbing with Open Source Projects
webgpu
papers
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper β’ 2311.10093 β’ Published β’ 58 -
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper β’ 2311.12229 β’ Published β’ 26 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper β’ 2311.12908 β’ Published β’ 49 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper β’ 2312.00845 β’ Published β’ 39
models