Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nvidia
/
audio-flamingo-next-captioner-hf

Audio-Text-to-Text
Transformers
Safetensors
English
audioflamingonext
text-generation
audio
speech
sound
music
reasoning
audio understanding
audio captioning
long-context
long-form-captioning
audio-language-model
long-audio
timestamp-grounding
Model card Files Files and versions
xet
Community
audio-flamingo-next-captioner-hf
16.5 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 14 commits
SreyanG-NVIDIA's picture
SreyanG-NVIDIA
Add AF-Skills tag
37b580a 2 days ago
  • static
    Add license files 4 days ago
  • .gitattributes
    1.57 kB
    Upload processor 12 days ago
  • README.md
    12.3 kB
    Add AF-Skills tag 2 days ago
  • chat_template.jinja
    1.11 kB
    Upload processor 12 days ago
  • config.json
    2.4 kB
    Upload AudioFlamingoNextForConditionalGeneration 9 days ago
  • generation_config.json
    147 Bytes
    Upload AudioFlamingoNextForConditionalGeneration 12 days ago
  • model.safetensors
    16.5 GB
    xet
    Upload AudioFlamingoNextForConditionalGeneration 4 days ago
  • processor_config.json
    548 Bytes
    Upload processor 12 days ago
  • tokenizer.json
    11.4 MB
    xet
    Upload processor 12 days ago
  • tokenizer_config.json
    514 Bytes
    Upload processor 12 days ago