Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

augmem
/
AIST-95M

Feature Extraction
PyTorch
English
multimodal
embedding
trimodal
dual-audio
retrieval
cross-modal
image-text-audio
Model card Files Files and versions
xet
Community
AIST-95M
382 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
gcoderw's picture
gcoderw
Publish AIST-95M
789accf verified 13 days ago
  • .gitattributes
    93 Bytes
    Publish AIST-95M 13 days ago
  • AIST-95M.safetensors
    382 MB
    xet
    Publish AIST-95M 13 days ago
  • README.md
    4.3 kB
    Publish AIST-95M 13 days ago
  • parameter_breakdown.json
    1.09 kB
    Publish AIST-95M 13 days ago
  • te_mn20_whisper_d2_validaudio.yaml
    1.11 kB
    Publish AIST-95M 13 days ago
  • teacher_dual_mn20whisper_exact_gate_baseline_20260424T155324Z.json
    6.38 kB
    Publish AIST-95M 13 days ago