README / README.md
pareek-joshtalksai's picture
Update README.md
ada1198 verified

A newer version of the Streamlit SDK is available: 1.55.0

Upgrade
metadata
title: Josh Talks AI
emoji: πŸŽ™οΈ
colorFrom: indigo
colorTo: pink
sdk: streamlit
sdk_version: 1.45.1
app_file: app.py
pinned: false

Josh Talks AI πŸŽ™οΈ

On a mission to make machines talk like humans.

Welcome to the official Hugging Face organization for Josh Talks AI β€” a data-centric initiative focused on creating high-quality datasets.

🌐 Learn more: https://ai.joshtalks.com


🧭 Problems with ASR

  • Conversational multi speaker speech

  • Dialect and Accented speech

  • Child Speech

  • Small models for real time low latency

🧭 Our Mission

We are building open datasets to power the next generation of Speech AI β€” across languages, domains, and communities.

🧭 Our Work

Datasets - Highest quality scientifically designed datasets for training models.

Benchmarks - We can only fix what we can measure. Our benchmarks tell researchers exactly where their models fail.

πŸ“¦ Coming Soon

We are currently preparing a series of open datasets, including:

  • Multilingual speech datasets
    Transcribed Indian language audio from real-world conversations and talks.

  • Cultural & regional text corpora
    Clean, annotated text datasets in Hindi, Tamil, Bengali, Marathi, and more.

  • Media-rich data from Indian contexts
    Video metadata, subtitles, and emotion-rich labels for AI-driven storytelling.

Stay tuned β€” our first releases are just around the corner!

🀝 Let's Collaborate

We welcome partnerships with researchers, universities, NGOs, and AI developers working on ASR challenges. If you're interested in contributing or using our datasets, reach out below.


πŸ“¬ Contact

🌐 Website: https://ai.joshtalks.com
πŸ’Ό LinkedIn: Josh Talks