README / README.md
pareek-joshtalksai's picture
Update README.md
ada1198 verified
---
title: Josh Talks AI
emoji: πŸŽ™οΈ
colorFrom: indigo
colorTo: pink
sdk: streamlit
sdk_version: 1.45.1
app_file: app.py
pinned: false
---
# Josh Talks AI πŸŽ™οΈ
*On a mission to make machines talk like humans.*
Welcome to the official Hugging Face organization for **Josh Talks AI** β€” a data-centric initiative focused on creating high-quality datasets.
🌐 **Learn more**: [https://ai.joshtalks.com](https://ai.joshtalks.com)
---
## 🧭 Problems with ASR
- *Conversational multi speaker speech*
- *Dialect and Accented speech*
- *Child Speech*
- *Small models for real time low latency*
## 🧭 Our Mission
We are building open datasets to power the next generation of Speech AI β€” across languages, domains, and communities.
## 🧭 Our Work
**Datasets** - Highest quality scientifically designed datasets for training models.
**Benchmarks** - We can only fix what we can measure. Our benchmarks tell researchers exactly where their models fail.
## πŸ“¦ Coming Soon
We are currently preparing a series of open datasets, including:
- **Multilingual speech datasets**
Transcribed Indian language audio from real-world conversations and talks.
- **Cultural & regional text corpora**
Clean, annotated text datasets in Hindi, Tamil, Bengali, Marathi, and more.
- **Media-rich data from Indian contexts**
Video metadata, subtitles, and emotion-rich labels for AI-driven storytelling.
Stay tuned β€” our first releases are just around the corner!
## 🀝 Let's Collaborate
We welcome partnerships with researchers, universities, NGOs, and AI developers working on ASR challenges. If you're interested in contributing or using our datasets, reach out below.
---
## πŸ“¬ Contact
🌐 **Website**: [https://ai.joshtalks.com](https://ai.joshtalks.com)
πŸ’Ό **LinkedIn**: [Josh Talks](https://www.linkedin.com/company/joshtalks/)