Running on L40S Agents 601 MinerU Document Extraction Tools π 601 Easy converting PDF and Office docs into Markdown and JSON
Running on L4 Agents Featured 731 StyleTTS 2 π£ 731 Efficient, fast, and natural text to speech with StyleTTS 2!
Running on T4 Agents 242 MassivelyMultilingualTTS π 242 Generate natural speech in 7000+ languages
Running on CPU Upgrade Agents Featured 1.34k Open ASR Leaderboard π 1.34k Explore and compare speech recognition model benchmarks
KillerShoaib/RakibulAI-Utub-Bangla-Transcription Viewer β’ Updated Jan 24, 2025 β’ 302 β’ 10 β’ 3
Running Agents 23 Facial Recognition With Sentiment Detector π 23 Detect faces and analyze emotions/sentiments in images
Running Featured 552 Open Source Ai Year In Review 2024 π» 552 What happened in open-source AI this year, and whatβs next?
firdhokk/speech-emotion-recognition-with-openai-whisper-large-v3 Audio Classification β’ 0.6B β’ Updated Nov 1, 2025 β’ 19.8k β’ 109