Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tsinghua-ee
/
SALMONN

Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
Model card Files Files and versions
xet
Community
3
SALMONN
404 MB
Ctrl+K
Ctrl+K
  • 6 contributors
History: 19 commits
Changli's picture
Changli
Update README.md
59f41c2 over 2 years ago
  • beats
    chore: release v1 over 2 years ago
  • other_third-party_licenses
    chore: release v1 over 2 years ago
  • qformer
    chore: release v1 over 2 years ago
  • resource
    chore: release v1 over 2 years ago
  • .gitattributes
    56 Bytes
    chore: release v1 over 2 years ago
  • .gitignore
    3.1 kB
    chore: release v1 over 2 years ago
  • LICENSE
    11.3 kB
    chore: release v1 over 2 years ago
  • README.md
    5.08 kB
    Update README.md over 2 years ago
  • cli_inference.py
    1.88 kB
    chore: release v1 over 2 years ago
  • index.html
    540 Bytes
    chore: release v1 over 2 years ago
  • model.py
    9.79 kB
    chore: release v1 over 2 years ago
  • salmonn_v1.pth

    Detected Pickle imports (4)

    • "torch._utils._rebuild_tensor_v2",
    • "torch.FloatStorage",
    • "collections.OrderedDict",
    • "torch.LongStorage"

    What is a pickle import?

    400 MB
    xet
    Upload salmonn_v1.pth over 2 years ago
  • web_demo.py
    7.2 kB
    chore: release v1 over 2 years ago