Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
51
Follow
Electronic Engineering @Tsinghua University
64
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
main
SALMONN
/
resource
3.64 MB
Ctrl+K
Ctrl+K
6 contributors
History:
3 commits
Changli
chore: release v1
0bf5005
over 2 years ago
audio_demo
chore: release v1
over 2 years ago
response_demo
chore: release v1
over 2 years ago
salmon.png
Safe
1.49 MB
xet
chore: release v1
over 2 years ago
structure.png
Safe
67.7 kB
chore: release v1
over 2 years ago