moonshotai
/

Kimi-Audio-7B-Instruct

audio-language-model

speech-recognition

audio-understanding

audio-generation

Model card Files Files and versions

Resources

View closed (6)

Free studio vocal data for Kimi Audio vocal pipeline benchmarking

#21 opened 3 months ago by

Add Kimi-Audio EOS and pad token ids

#20 opened 5 months ago by

Kaggle code needs update

#19 opened 12 months ago by

Fix incorrect unk_id assignment

#16 opened about 1 year ago by

Request: DOI

#14 opened about 1 year ago by

supported languages?

#12 opened about 1 year ago by

nononameneeded2001

About the weight files of the Whisper Encoder

#11 opened about 1 year ago by

how can I fine tune this for farsi?

#10 opened about 1 year ago by

Cannot Run Model in Hugging Face Spaces: AutoProcessor/Processor Not Found

#9 opened about 1 year ago by

Будет ли поддержка Русского языка?

#8 opened about 1 year ago by

A video on how to set up this in a Colab notebook

#7 opened about 1 year ago by

Vocoder Architecture?

#6 opened about 1 year ago by

Base model?

#4 opened about 1 year ago by

Issue with long audio (~1 min) output, or prompt instruct following

#2 opened about 1 year ago by

Update correct task tag

#1 opened about 1 year ago by