AXERA-TECH
/

InternVL3_5-2B

Image-Text-to-Text

Model card Files Files and versions

yongqiang commited on 16 days ago

Commit

d5f9fc5

·

1 Parent(s): b30c13a

Update readme

Files changed (1) hide show

README.md +11 -9

README.md CHANGED Viewed

@@ -2,10 +2,10 @@
 library_name: transformers
 license: bsd-3-clause
 base_model:
-- OpenGVLab/InternVL3_5-1B
 tags:
 - InternVL3
-- InternVL3_5-1B
 - Int8
 - VLM
 pipeline_tag: image-text-to-text
@@ -13,21 +13,23 @@ language:
 - en
 ---
-# InternVL3_5-1B
-This version of InternVL3_5-1B has been converted to run on the Axera NPU using **w8a16** quantization.
 This model has been optimized with the following LoRA:
-Compatible with Pulsar2 version: 4.1
 ## Convert tools links:
 For those who are interested in model conversion, you can try to export axmodel through the original repo:
-https://huggingface.co/OpenGVLab/InternVL3_5-1B
-[How to Convert LLM from Huggingface to axmodel](https://github.com/AXERA-TECH/InternVL3_5-1B.axera/tree/main/model_convert)
 [AXera NPU HOST LLM Runtime](https://github.com/AXERA-TECH/ax-llm/tree/ax-internvl)
@@ -94,7 +96,7 @@ Image understanding:
 Run the following command on the Axera board to start a chat conversation:
 ```sh
-$ cd InternVL3_5-1B.axera/python
 $ python3 infer_axmodel.py --hf_model internvl3-5_tokenizer/ --axmodel_path internvl3-5_axmodel/ --question "请计算函数[y=2x^2+2]的导数, 并提供 markdown 格式的推理过程"
 ```
@@ -127,7 +129,7 @@ y' = 4x
 Enter the following command to perform the single-image understanding task:
 ```sh
-$ cd InternVL3_5-1B.axera/python
 $ python3 infer_axmodel.py --hf_model internvl3-5_tokenizer/ --axmodel_path internvl3-5_axmodel/ --question "请描述这幅图" -i examples/image_0.jpg --vit_model vit-models/internvl_vit_model_1x3x448x448.axmodel
 ```

 library_name: transformers
 license: bsd-3-clause
 base_model:
+- OpenGVLab/InternVL3_5-2B
 tags:
 - InternVL3
+- InternVL3_5-2B
 - Int8
 - VLM
 pipeline_tag: image-text-to-text
 - en
 ---
+# InternVL3_5-2B
+This version of InternVL3_5-2B has been converted to run on the Axera NPU using **w8a16** quantization.
 This model has been optimized with the following LoRA:
+Compatible with Pulsar2 version: 5.1-patch1.
+Please note that the context of the model is 2k and the maximum prefill length is 1k.
 ## Convert tools links:
 For those who are interested in model conversion, you can try to export axmodel through the original repo:
+https://huggingface.co/OpenGVLab/InternVL3_5-2B
+[How to Convert LLM from Huggingface to axmodel](https://github.com/AXERA-TECH/InternVL3_5-2B.axera/tree/main/model_convert)
 [AXera NPU HOST LLM Runtime](https://github.com/AXERA-TECH/ax-llm/tree/ax-internvl)
 Run the following command on the Axera board to start a chat conversation:
 ```sh
+$ cd InternVL3_5-2B.axera/python
 $ python3 infer_axmodel.py --hf_model internvl3-5_tokenizer/ --axmodel_path internvl3-5_axmodel/ --question "请计算函数[y=2x^2+2]的导数, 并提供 markdown 格式的推理过程"
 ```
 Enter the following command to perform the single-image understanding task:
 ```sh
+$ cd InternVL3_5-2B.axera/python
 $ python3 infer_axmodel.py --hf_model internvl3-5_tokenizer/ --axmodel_path internvl3-5_axmodel/ --question "请描述这幅图" -i examples/image_0.jpg --vit_model vit-models/internvl_vit_model_1x3x448x448.axmodel
 ```