Kumarmanas Nethil
AI & ML interests
speech, dialog systems and low-resource ML
Recent Activity
new activity about 6 hours ago
skit-ai/emotion-tts:Reformat dataset for Dataset Viewer + add Dataset Card reacted to kavyamanohar's post with š about 7 hours ago
Releasing Vividh-ASR ā an open benchmark and models for Hindi and Malayalam ASR.
Vividh-ASR is built from public data, stratified by complexity:
ā Clean recordings
ā Noisy and accented speech
ā Spontaneous, conversational audio
Alongside the benchmark, we release:
ā Open models for Hindi and Malayalam
ā A training recipe with two counterintuitive choices that moved the needle
ā What failed, not just what worked
The stratified evaluation methodology transfers directly to any low-resource language setup ā beyond Hindi and Malayalam.
Built at @adalatai, where we build speech tech for Indian courts. This is our first open contribution back to the community. @janaab @Kush0610 @orgh0
Link: https://huggingface.co/blog/adalat-ai/vividh-benchmark liked a dataset 2 days ago
L-NLProc/VidhikDastaavej