Is there a official script for inference or finetuning with audio modality?

#50

by JasonLee996 - opened Jul 16, 2025

Jul 16, 2025

As title. Meanwhile, I find it said in README at github repo that the model could be tuned with Align-Anything, but in the funetune.py in offical repo, MiniCPM-o/finetune/finetune.py set "init_audio=False", so will it be OK if I use this script to tuning the audio pathway with this parameter "True"?

DennisHuang648

OpenBMB org Aug 5, 2025

We have supported audio modality fine-tuning for MiniCPM-o on LLaMA-Factory. You can refer to the LLaMA-Factory documentation for processing audio datasets.😊

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment