Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Anuj Diwan's picture
4 1 7

Anuj Diwan

ajd12342
litagin's profile picture
·
https://ajd12342.github.io/
  • anuj_diwan
  • ajd12342

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago
ajd12342/paraspeechcaps
published a dataset about 1 month ago
ajd12342/final-test-dataset-splits-v5-with-all-prompts-with-negative-prompts
upvoted an article 3 months ago
There is no such thing as a tokenizer-free lunch
View all activity

Organizations

University of Texas at Austin's profile picture

ajd12342 's collections 1

ParaSpeechCaps: Rich Style Prompted TTS
The ParaSpeechCaps dataset and models trained on it
  • Scaling Rich Style-Prompted Text-to-Speech Datasets

    Paper • 2503.04713 • Published Mar 6 • 1
  • ajd12342/paraspeechcaps

    Viewer • Updated Nov 22 • 1.07M • 369 • 17
  • ajd12342/parler-tts-mini-v1-paraspeechcaps

    Text-to-Speech • 0.9B • Updated Sep 17 • 52 • 5
  • ajd12342/parler-tts-mini-v1-paraspeechcaps-only-base

    Text-to-Speech • 0.9B • Updated Sep 17 • 20 • 1
ParaSpeechCaps: Rich Style Prompted TTS
The ParaSpeechCaps dataset and models trained on it
  • Scaling Rich Style-Prompted Text-to-Speech Datasets

    Paper • 2503.04713 • Published Mar 6 • 1
  • ajd12342/paraspeechcaps

    Viewer • Updated Nov 22 • 1.07M • 369 • 17
  • ajd12342/parler-tts-mini-v1-paraspeechcaps

    Text-to-Speech • 0.9B • Updated Sep 17 • 52 • 5
  • ajd12342/parler-tts-mini-v1-paraspeechcaps-only-base

    Text-to-Speech • 0.9B • Updated Sep 17 • 20 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs