🌍 Pukhto/Pashto Open Language Project

Community-led open-source project to make Pashto a first-class language in AI speech and language tooling.

πŸ”— Project Links

🎯 Core Goal

  • Build open datasets, benchmarks, and models for Pashto ASR, TTS, and NLP.
  • Keep work reproducible, transparent, and contribution-friendly.
  • Focus on public good and broad accessibility.

πŸ“š Featured External Dataset

πŸ™Œ Contribute Through Mozilla Common Voice

🌐 Community Resource Profiles

  • Hugging Face (external Pashto resource profile): https://huggingface.co/ihanif
  • Use this profile as a reference point for Pashto ASR/TTS datasets, models, and community experiments.

πŸš€ Start Here

  • πŸ“˜ Purpose: PROJECT_PURPOSE.md
  • 🀝 Contributing: CONTRIBUTING.md
  • πŸ—ΊοΈ Roadmap: ROADMAP.md
  • πŸ›οΈ Governance: GOVERNANCE.md
  • πŸ’¬ Community coordination: community/COMMUNICATION.md

🧩 Initial Workstreams

  • data/ Pashto data collection, cleaning, metadata
  • asr/ speech-to-text baselines and experiments
  • tts/ text-to-speech baselines and experiments
  • benchmarks/ fixed test sets and evaluation scripts
  • apps/desktop/ app integration references
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support