Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
espnet
/
DCASE23.AudioCaptioning.FineTuned
like
0
Follow
ESPnet
340
ESPnet
clotho_v2
slseanwu/clotho-chatgpt-mixup-50K
audiocaps
English
audio
audio_captioning
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Community
Use this model
main
DCASE23.AudioCaptioning.FineTuned
/
exp
/
asr_ft
/
images
415 kB
1 contributor
History:
1 commit
shikhar7ssu
Upload 19 files
e7d1aab
verified
over 1 year ago
acc.png
Safe
29.2 kB
Upload 19 files
over 1 year ago
backward_time.png
Safe
34.2 kB
Upload 19 files
over 1 year ago
cer.png
Safe
28.3 kB
Upload 19 files
over 1 year ago
clip.png
Safe
13.9 kB
Upload 19 files
over 1 year ago
forward_time.png
Safe
34.9 kB
Upload 19 files
over 1 year ago
gpu_max_cached_mem_GB.png
Safe
21.1 kB
Upload 19 files
over 1 year ago
grad_norm.png
Safe
24.4 kB
Upload 19 files
over 1 year ago
iter_time.png
Safe
34.3 kB
Upload 19 files
over 1 year ago
loss.png
Safe
32.9 kB
Upload 19 files
over 1 year ago
loss_att.png
Safe
33.8 kB
Upload 19 files
over 1 year ago
loss_scale.png
Safe
16.6 kB
Upload 19 files
over 1 year ago
optim0_lr0.png
Safe
28.5 kB
Upload 19 files
over 1 year ago
optim_step_time.png
Safe
29.3 kB
Upload 19 files
over 1 year ago
train_time.png
Safe
29.4 kB
Upload 19 files
over 1 year ago
wer.png
Safe
24 kB
Upload 19 files
over 1 year ago