Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
audio-flamingo-next-hf
like
27
Follow
NVIDIA
55k
Audio-Text-to-Text
Transformers
Safetensors
4 datasets
English
audioflamingonext
text-generation
audio
speech
sound
music
reasoning
audio understanding
ASR
audio captioning
long-context
audio-language-model
long-audio
timestamp-grounding
instruction-tuned
arxiv:
2604.10905
License:
other
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
audio-flamingo-next-hf
16.5 GB
Ctrl+K
Ctrl+K
1 contributor
History:
16 commits
SreyanG-NVIDIA
Add AF-Skills tag
14ea129
1 day ago
static
Add license files
3 days ago
.gitattributes
Safe
1.57 kB
Upload processor
11 days ago
README.md
12.4 kB
Add AF-Skills tag
1 day ago
chat_template.jinja
Safe
1.11 kB
Upload processor
11 days ago
config.json
Safe
2.4 kB
Upload AudioFlamingoNextForConditionalGeneration
8 days ago
generation_config.json
Safe
147 Bytes
Upload AudioFlamingoNextForConditionalGeneration
11 days ago
model.safetensors
Safe
16.5 GB
xet
Upload AudioFlamingoNextForConditionalGeneration
3 days ago
processor_config.json
Safe
548 Bytes
Upload processor
11 days ago
tokenizer.json
Safe
11.4 MB
xet
Upload processor
11 days ago
tokenizer_config.json
Safe
514 Bytes
Upload processor
8 days ago