Initial release: sound-effect captioning Whisper (stage-2 best, val loss 1.494) 8440d36 verified ChristophSchuhmann commited on about 23 hours ago