runninglsy commited on
Commit
44beb8e
·
verified ·
1 Parent(s): 011ccee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -2
README.md CHANGED
@@ -38,10 +38,33 @@ We introduce **Ovis2.6-30B-A3B**, the latest advancement in the Ovis series of M
38
 
39
  ## Performance
40
 
41
- ![table](https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/rZUzeRVCLg0GPaMOZj3KZ.png)
42
 
 
43
 
44
- ## Quick Inference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
45
 
46
  Below is a simple example demonstrating how to run Ovis2.6 with a single image input.
47
 
 
38
 
39
  ## Performance
40
 
41
+ The following table presents a detailed performance comparison. Please note that superscripted results are sourced from external technical reports, and Qwen scores represent the highest value between its Think and Instruct versions. For quick reference, the best results are highlighted in red, and the second-best results are underlined. All values are rounded to one decimal place.
42
 
43
+ ![bench](https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/k1TQ0r-qrPRZqvKsv_cdG.png)
44
 
45
+ ## Quick Inference (vLLM)
46
+
47
+ ```bash
48
+ uv pip install -U vllm --torch-backend=auto --extra-index-url https://wheels.vllm.ai/nightly
49
+ vllm serve AIDC-AI/Ovis2.6-30B-A3B --trust-remote-code --tensor-parallel-size 4
50
+ curl http://localhost:8000/v1/chat/completions \
51
+ -H "Content-Type: application/json" \
52
+ -H "Authorization: Bearer EMPTY" \
53
+ -d '{
54
+ "model": "AIDC-AI/Ovis2.6-30B-A3B",
55
+ "messages": [
56
+ {
57
+ "role": "user",
58
+ "content": [
59
+ {"type": "image_url", "image_url": {"url": "https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/TIlymOb86R6_Mez3bpmcB.png"}},
60
+ {"type": "text", "text": "Calculate the sum of the numbers in the middle box in figure (c)."}
61
+ ]
62
+ }
63
+ ]
64
+ }'
65
+ ```
66
+
67
+ ## Quick Inference (transformers)
68
 
69
  Below is a simple example demonstrating how to run Ovis2.6 with a single image input.
70