RedHatAI
/

Pixtral-Large-Instruct-2411-hf-FP8-dynamic

Image-Text-to-Text

compressed-tensors

Model card Files Files and versions

shubhrapandit commited on Feb 26, 2025

Commit

d618b87

·

verified ·

1 Parent(s): a70c24d

Update README.md

Files changed (1) hide show

README.md +54 -1

README.md CHANGED Viewed

@@ -532,7 +532,60 @@ outputs = llm.chat(messages, sampling_params=sampling_params)
 print(outputs[0].outputs[0].text)
 ```
-## Accuracy
 <table>
   <thead>
     <tr>

 print(outputs[0].outputs[0].text)
 ```
+## Evaluation
+The model was evaluated using [mistral-evals](https://github.com/neuralmagic/mistral-evals) for vision-related tasks and using [lm_evaluation_harness](https://github.com/neuralmagic/lm-evaluation-harness) for select text-based benchmarks. The evaluations were conducted using the following commands:
+<details>
+<summary>Evaluation Commands</summary>
+### Vision Tasks
+- vqav2
+- docvqa
+- mathvista
+- mmmu
+- chartqa
+```
+vllm serve neuralmagic/pixtral-12b-quantized.w8a8 --tensor_parallel_size 1 --max_model_len 25000 --trust_remote_code --max_num_seqs 8 --gpu_memory_utilization 0.9 --dtype float16 --limit_mm_per_prompt image=7
+python -m eval.run eval_vllm \
+        --model_name neuralmagic/pixtral-12b-quantized.w8a8 \
+        --url http://0.0.0.0:8000 \
+        --output_dir ~/tmp \
+        --eval_name <vision_task_name>
+```
+### Text-based Tasks
+#### MMLU
+```
+lm_eval \
+  --model vllm \
+  --model_args pretrained="<model_name>",dtype=auto,add_bos_token=True,max_model_len=4096,tensor_parallel_size=<n>,gpu_memory_utilization=0.8,enable_chunked_prefill=True,trust_remote_code=True \
+  --tasks mmlu \
+  --num_fewshot 5 \
+  --batch_size auto \
+  --output_path output_dir
+```
+#### MGSM
+```
+lm_eval \
+  --model vllm \
+  --model_args pretrained="<model_name>",dtype=auto,max_model_len=4096,max_gen_toks=2048,max_num_seqs=128,tensor_parallel_size=<n>,gpu_memory_utilization=0.9 \
+  --tasks mgsm_cot_native \
+  --num_fewshot 0 \
+  --batch_size auto \
+  --output_path output_dir
+```
+</details>
+### Accuracy
 <table>
   <thead>
     <tr>