RedHatAI
/

Qwen3-VL-235B-A22B-Instruct-FP8-dynamic

Text Generation

compressed-tensors

Model card Files Files and versions

nm-research commited on Oct 3, 2025

Commit

96efa89

·

verified ·

1 Parent(s): f452fe1

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -133,7 +133,7 @@ processor.save_pretrained(SAVE_DIR)
 ## Evaluation
-The model was evaluated on the OpenLLMv1 leaderboard task, using [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness), on reasoning tasks using [lighteval](https://github.com/huggingface/lighteval).
 [vLLM](https://docs.vllm.ai/en/stable/) was used for all evaluations.
 <details>
@@ -173,6 +173,15 @@ The model was evaluated on the OpenLLMv1 leaderboard task, using [lm-evaluation-
     --tasks lighteval|aime25|0 \
   ```
 </details>
 ### Accuracy

 ## Evaluation
+The model was evaluated on the OpenLLMv1 leaderboard task, using [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness), on reasoning tasks using [lighteval](https://github.com/huggingface/lighteval) and on vision tasks using [lmms-eval](https://github.com/EvolvingLMMs-Lab/lmms-eval).
 [vLLM](https://docs.vllm.ai/en/stable/) was used for all evaluations.
 <details>
     --tasks lighteval|aime25|0 \
   ```
+ **lmms-eval**
+```
+python3 -m lmms_eval \
+    --model vllm \
+    --model_args model=RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic,tensor_parallel_size=4,max_model_len=8192,gpu_memory_utilization=0.9 \
+    --tasks mmmu_val, chartqa\
+    --batch_size 1
+```
 </details>
 ### Accuracy