Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

espnet
/
DCASE23.AudioCaptioning.PreTrained

Automatic Speech Recognition
ESPnet
English
audio
audio_captioning
Model card Files Files and versions
xet
Community
DCASE23.AudioCaptioning.PreTrained / exp /asr_pt /images
416 kB
  • 1 contributor
History: 1 commit
shikhar7ssu's picture
shikhar7ssu
Upload 19 files
f8585bb verified about 1 year ago
  • acc.png
    30.9 kB
    Upload 19 files about 1 year ago
  • backward_time.png
    32 kB
    Upload 19 files about 1 year ago
  • cer.png
    30.8 kB
    Upload 19 files about 1 year ago
  • clip.png
    14.4 kB
    Upload 19 files about 1 year ago
  • forward_time.png
    31.4 kB
    Upload 19 files about 1 year ago
  • gpu_max_cached_mem_GB.png
    20.8 kB
    Upload 19 files about 1 year ago
  • grad_norm.png
    27.9 kB
    Upload 19 files about 1 year ago
  • iter_time.png
    28.3 kB
    Upload 19 files about 1 year ago
  • loss.png
    29.7 kB
    Upload 19 files about 1 year ago
  • loss_att.png
    30.6 kB
    Upload 19 files about 1 year ago
  • loss_scale.png
    17.1 kB
    Upload 19 files about 1 year ago
  • optim0_lr0.png
    33.6 kB
    Upload 19 files about 1 year ago
  • optim_step_time.png
    30.3 kB
    Upload 19 files about 1 year ago
  • train_time.png
    30.5 kB
    Upload 19 files about 1 year ago
  • wer.png
    28 kB
    Upload 19 files about 1 year ago