AI & ML interests

Natural Language Processing for low-resource indigenous languages of Meghalaya

Recent Activity

toiar  updated a collection 17 days ago
Pnar Datasets
toiar  updated a collection 17 days ago
Khasi Datasets
toiar  updated a collection 22 days ago
OCR Dataset
View all activity

Organization Card

Tynrai

Tynrai (Tynrai-AI) is an initiative dedicated to the preservation of language through technology. We focus on digitizing, documenting, and revitalizing the indigenous languages of Meghalaya, India.

We build, curate, and release datasets and models including conversational agents that prioritize real-world impact for the Khasic and Garo languages.

Mission

  • Preserve and digitize indigenous languages
  • Research in low-resource NLP
  • Build high-quality datasets and reproducible models

Areas of Focus

  • Neural Machine Translation (NMT): Specializing in Khasi, Garo, and Pnar
  • Automatic Speech Recognition (ASR): Speech-to-text for indigenous languages
  • Text-to-Speech (TTS): Natural speech generation for local languages
  • Conversational AI: Chat Bots and dialogue systems
  • Language Preservation: Documentation & corpus creation

What You’ll Find Here

  • Chat Bots: Interactive conversational agents for learning and assistance
  • Datasets: Parallel corpora, annotated text, speech resources, and QA Datasets
  • Models: Fine-tuned and experimental NLP models
  • Spaces: Demos and interactive experiments

Contact

For collaboration or questions: - Hugging Face Discussions

Low-resource does not mean low-impact.

models 0

None public yet

datasets 0

None public yet