--- license: apache-2.0 base_model: AIDC-AI/Ovis2-4B tags: - quantized - awq - vision-language-model - vlm --- # AIDC-AI__Ovis2-4B__awq_w4_complete This is a **AWQ** (4-bit) quantized version of [AIDC-AI/Ovis2-4B](https://huggingface.co/AIDC-AI/Ovis2-4B). ## Quantization Details - **Method**: AWQ - **Bits**: 4 - **Base model**: AIDC-AI/Ovis2-4B - **Group size**: 128 - **Quantized portion**: LLM backbone (Qwen2ForCausalLM inside Ovis2) - **Vision tokenizer + visual embedding**: FP16 (unchanged) - **Loading**: AutoModelForCausalLM.from_pretrained(..., trust_remote_code=True) ## Usage ```python from transformers import AutoProcessor, AutoModelForImageTextToText import torch model = AutoModelForImageTextToText.from_pretrained( "{REPO_ID}", torch_dtype=torch.float16, device_map="auto", trust_remote_code=True, ) processor = AutoProcessor.from_pretrained("{REPO_ID}", trust_remote_code=True) ``` Replace `{REPO_ID}` with the repo ID of this model. ## Original Model See [AIDC-AI/Ovis2-4B](https://huggingface.co/AIDC-AI/Ovis2-4B) for the original FP16 model.