Instructions to use espnet/DCASE23.AudioCaptioning.FineTuned with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- ESPnet
How to use espnet/DCASE23.AudioCaptioning.FineTuned with ESPnet:
unknown model type (must be text-to-speech or automatic-speech-recognition)
- Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -2,9 +2,11 @@
|
|
| 2 |
tags:
|
| 3 |
- espnet
|
| 4 |
- audio
|
| 5 |
-
-
|
| 6 |
language: en
|
| 7 |
datasets:
|
| 8 |
- clotho_v2
|
|
|
|
|
|
|
| 9 |
license: cc-by-4.0
|
| 10 |
---
|
|
|
|
| 2 |
tags:
|
| 3 |
- espnet
|
| 4 |
- audio
|
| 5 |
+
- audio_captioning
|
| 6 |
language: en
|
| 7 |
datasets:
|
| 8 |
- clotho_v2
|
| 9 |
+
- slseanwu/clotho-chatgpt-mixup-50K
|
| 10 |
+
- audiocaps
|
| 11 |
license: cc-by-4.0
|
| 12 |
---
|