Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ayuto Tsutsumi's picture
3 56

Ayuto Tsutsumi

Atotti
dkakaie's profile picture cocoa2525's profile picture shylockasr's profile picture
·
  • Atotti

AI & ML interests

None yet

Recent Activity

new activity 7 days ago
Atotti/Kimi-Audio-Whisper-Encoder:What is the difference between the encoders for Qwen2 audio, Qwen2.5 Omni, Audio Flamingo 3 and Kimi audio?
liked a model 9 days ago
cyberagent/layerd-birefnet
liked a model 13 days ago
Aratako/T5Gemma-TTS-2b-2b
View all activity

Organizations

CyberAgent's profile picture

Atotti 's collections 1

ALM Audio Encoders
I'm currently in the process of preparing the inference code.
  • Atotti/Google-USM

    Feature Extraction • 0.7B • Updated Aug 12 • 541 • 18
  • Atotti/Qwen3-Omni-AudioTransformer

    0.6B • Updated Oct 4 • 1.68k • 32
  • Atotti/google-usm-bf16

    Feature Extraction • 0.7B • Updated Jul 9 • 64 • 1
  • Atotti/Qwen3-Omni-Captioner-AudioTransformer

    0.6B • Updated 20 days ago • 38
ALM Audio Encoders
I'm currently in the process of preparing the inference code.
  • Atotti/Google-USM

    Feature Extraction • 0.7B • Updated Aug 12 • 541 • 18
  • Atotti/Qwen3-Omni-AudioTransformer

    0.6B • Updated Oct 4 • 1.68k • 32
  • Atotti/google-usm-bf16

    Feature Extraction • 0.7B • Updated Jul 9 • 64 • 1
  • Atotti/Qwen3-Omni-Captioner-AudioTransformer

    0.6B • Updated 20 days ago • 38
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs