Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

soham97
/
mellow

small audio-language model
ALM
audio
music
sound events
audio reasoning
audio captioning
audio question answering
zero-shot
audio-text
Model card Files Files and versions
xet
Community
mellow
1.35 GB
  • 1 contributor
History: 13 commits
soham97's picture
soham97
v0_s checkpoint
83672db 9 months ago
  • resource
    first 10 months ago
  • .gitattributes
    1.56 kB
    first 10 months ago
  • README.md
    6.54 kB
    v0_s checkpoint 9 months ago
  • config.json
    0 Bytes
    readme update 10 months ago
  • v0.ckpt
    670 MB
    xet
    first 10 months ago
  • v0.yaml
    373 Bytes
    first 10 months ago
  • v0_s.ckpt
    670 MB
    xet
    v0_s checkpoint 9 months ago