Releasing Vividh-ASR — an open benchmark and models for Hindi and Malayalam ASR.
Vividh-ASR is built from public data, stratified by complexity: → Clean recordings → Noisy and accented speech → Spontaneous, conversational audio
Alongside the benchmark, we release: → Open models for Hindi and Malayalam → A training recipe with two counterintuitive choices that moved the needle → What failed, not just what worked
The stratified evaluation methodology transfers directly to any low-resource language setup — beyond Hindi and Malayalam.
Built at @adalatai, where we build speech tech for Indian courts. This is our first open contribution back to the community. @janaab@Kush0610@orgh0
Just published: how we built production Sango (Central African Republic) translation without fine-tuning, parallel corpus, or training compute.
The method — vocabulary-augmented prompting with a 581-entry native-speaker-verified lexicon — generalizes to any of the ~2,000 African languages at the same data-poverty level. Recipe, dataset, and code template all included.
I wanted to share a cool feature from my open source AI native web browser, Vessel: Persistent highlights!
You can highlight anything on the page and the context is provided to the agent. It's kind of a fun way to learn about new stuff, synthesize info, or just deepen your comprehension/understanding.
Since highlights are persistent, you can close the page, come back later - and your highlights will be exactly where you left them. I've found this particularly useful when reviewing technical blogs, model cards, etc.