Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Duplicated from  Mayank022/Audio-Language-Model

teamvizuara
/
Vocal-LLM

Audio-Text-to-Text
Transformers
Safetensors
Hindi
English
audio
speech
audio-language-model
whisper
sarvam-m
lora
projector
indic
hindi
Model card Files Files and versions
xet
Community
Vocal-LLM
48.8 GB
  • 1 contributor
History: 5 commits
Mayank022's picture
Mayank022
Update README.md
14f64da verified 6 days ago
  • latest_checkpoint
    Duplicate from Mayank022/Audio-Language-Model 6 days ago
  • .gitattributes
    1.59 kB
    Duplicate from Mayank022/Audio-Language-Model 6 days ago
  • README.md
    4.08 kB
    Update README.md 6 days ago
  • config.py
    2.14 kB
    Duplicate from Mayank022/Audio-Language-Model 6 days ago
  • data.py
    3.78 kB
    Duplicate from Mayank022/Audio-Language-Model 6 days ago
  • inference.py
    1.69 kB
    Duplicate from Mayank022/Audio-Language-Model 6 days ago
  • model.py
    5.26 kB
    Duplicate from Mayank022/Audio-Language-Model 6 days ago
  • train.py
    10.7 kB
    Duplicate from Mayank022/Audio-Language-Model 6 days ago