RedHatAI
/

Meta-Llama-3-8B-Instruct-FP8-KV

Text Generation

text-generation-inference

Model card Files Files and versions

robgreenberg3 commited on Sep 15, 2025

Commit

6ab84bb

·

verified ·

1 Parent(s): 4b1f72a

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -2,6 +2,8 @@
 tags:
 - fp8
 - vllm
 ---
 # Meta-Llama-3-8B-Instruct-FP8-KV
@@ -52,5 +54,4 @@ model.save_quantized(quantized_model_dir)
 ### Open LLM Leaderboard evaluation scores
 |                      | Meta-Llama-3-8B-Instruct | Meta-Llama-3-8B-Instruct-FP8 | Meta-Llama-3-8B-Instruct-FP8-KV<br>(this model) |
 | :------------------: | :----------------------: | :--------------------------: | :---------------------------------------------: |
-| gsm8k<br>5-shot      | 75.44                    | 74.37                        | 74.98                                           |

 tags:
 - fp8
 - vllm
+base_model:
+- meta-llama/Meta-Llama-3-8B-Instruct
 ---
 # Meta-Llama-3-8B-Instruct-FP8-KV
 ### Open LLM Leaderboard evaluation scores
 |                      | Meta-Llama-3-8B-Instruct | Meta-Llama-3-8B-Instruct-FP8 | Meta-Llama-3-8B-Instruct-FP8-KV<br>(this model) |
 | :------------------: | :----------------------: | :--------------------------: | :---------------------------------------------: |
+| gsm8k<br>5-shot      | 75.44                    | 74.37                        | 74.98                                           |