AI & ML interests
None defined yet.
Recent Activity
Josh Talks AI šļø
On a mission to make machines talk like humans.
Welcome to the official Hugging Face organization for Josh Talks AI ā a data-centric initiative focused on creating high-quality datasets.
š Learn more: https://ai.joshtalks.com
š§ Problems with ASR
Conversational multi speaker speech
Dialect and Accented speech
Child Speech
Small models for real time low latency
š§ Our Mission
We are building open datasets to power the next generation of Speech AI ā across languages, domains, and communities.
š§ Our Work
Datasets - Highest quality scientifically designed datasets for training models.
Benchmarks - We can only fix what we can measure. Our benchmarks tell researchers exactly where their models fail.
š¦ Coming Soon
We are currently preparing a series of open datasets, including:
Multilingual speech datasets
Transcribed Indian language audio from real-world conversations and talks.Cultural & regional text corpora
Clean, annotated text datasets in Hindi, Tamil, Bengali, Marathi, and more.Media-rich data from Indian contexts
Video metadata, subtitles, and emotion-rich labels for AI-driven storytelling.
Stay tuned ā our first releases are just around the corner!
š¤ Let's Collaborate
We welcome partnerships with researchers, universities, NGOs, and AI developers working on ASR challenges. If you're interested in contributing or using our datasets, reach out below.
š¬ Contact
š Website: https://ai.joshtalks.com
š¼ LinkedIn: Josh Talks