Any-to-Any
Transformers
Safetensors
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
4-bit precision
gptq
Instructions to use openbmb/MiniCPM-o-2_6-int4 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openbmb/MiniCPM-o-2_6-int4 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("openbmb/MiniCPM-o-2_6-int4", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Web server demo Int4
#3
by lktinhtemp - opened
Can I replace code in README.md for web_server demo to use int4 ?
Yes, you should replace the model initialization to AutoGPTQForCausalLM.from_quantized