Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
openbmb
/
MiniCPM-o-2_6
like
1.28k
Follow
OpenBMB
2.42k
Any-to-Any
Transformers
Safetensors
openbmb/RLAIF-V-Dataset
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
arxiv:
2405.17220
arxiv:
2408.01800
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
57
Deploy
Use this model
refs/pr/35
MiniCPM-o-2_6
/
assets
/
input_examples
5.67 MB
14 contributors
History:
3 commits
bokesyo
Upload assistant_female_voice.wav
a323bd6
verified
11 months ago
Trump_WEF_2018_10s.mp3
161 kB
update readme, add audio input examples
about 1 year ago
assistant_default_female_voice.wav
224 kB
xet
add usage samples to readme
about 1 year ago
assistant_female_voice.wav
235 kB
xet
Upload assistant_female_voice.wav
11 months ago
assistant_male_voice.wav
144 kB
xet
add usage samples to readme
about 1 year ago
audio_understanding.mp3
321 kB
update readme, add audio input examples
about 1 year ago
chi-english-1.wav
492 kB
update readme, add audio input examples
about 1 year ago
cxk_original.wav
384 kB
update readme, add audio input examples
about 1 year ago
exciting-emotion.wav
696 kB
update readme, add audio input examples
about 1 year ago
fast-pace.wav
986 kB
update readme, add audio input examples
about 1 year ago
icl_20.wav
619 kB
xet
add usage samples to readme
about 1 year ago
indian-accent.wav
1.41 MB
xet
update readme, add audio input examples
about 1 year ago