Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
51
Follow
Electronic Engineering @Tsinghua University
66
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
Copy to bucket
new
main
SALMONN
/
qformer
53 kB
Ctrl+K
Ctrl+K
6 contributors
History:
1 commit
Changli
chore: release v1
0bf5005
over 2 years ago
LICENSE_Lavis
Safe
1.5 kB
chore: release v1
over 2 years ago
LICENSE_MiniGPT4
Safe
1.5 kB
chore: release v1
over 2 years ago
LICENSE_VideoLlama
Safe
1.53 kB
chore: release v1
over 2 years ago
Qformer.py
48.5 kB
chore: release v1
over 2 years ago