Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
audio-flamingo-next-captioner-hf
like
6
Follow
NVIDIA
55k
Audio-Text-to-Text
Transformers
Safetensors
4 datasets
English
audioflamingonext
text-generation
audio
speech
sound
music
reasoning
audio understanding
audio captioning
long-context
long-form-captioning
audio-language-model
long-audio
timestamp-grounding
arxiv:
2604.10905
License:
other
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
audio-flamingo-next-captioner-hf
16.5 GB
Ctrl+K
Ctrl+K
1 contributor
History:
14 commits
SreyanG-NVIDIA
Add AF-Skills tag
37b580a
2 days ago
static
Add license files
4 days ago
.gitattributes
Safe
1.57 kB
Upload processor
12 days ago
README.md
12.3 kB
Add AF-Skills tag
2 days ago
chat_template.jinja
Safe
1.11 kB
Upload processor
12 days ago
config.json
Safe
2.4 kB
Upload AudioFlamingoNextForConditionalGeneration
9 days ago
generation_config.json
Safe
147 Bytes
Upload AudioFlamingoNextForConditionalGeneration
12 days ago
model.safetensors
16.5 GB
xet
Upload AudioFlamingoNextForConditionalGeneration
4 days ago
processor_config.json
Safe
548 Bytes
Upload processor
12 days ago
tokenizer.json
Safe
11.4 MB
xet
Upload processor
12 days ago
tokenizer_config.json
Safe
514 Bytes
Upload processor
12 days ago