clowman
/

Llama-3.1-8B-Instruct-Dynamic-F8

Text Generation

compressed-tensors

Model card Files Files and versions

clowman commited on Apr 2, 2025

Commit

a3d10ba

·

verified ·

1 Parent(s): b5b869c

Update README.md

Files changed (1) hide show

README.md +19 -16

README.md CHANGED Viewed

@@ -1,19 +1,3 @@
-# Quantization
-Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov  6 2024, 20:22:13) [GCC 11.4.0]`
-Base Model: [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
-Quantized using [llmcompressor==0.4.1](https://github.com/vllm-project/llm-compressor)
-Steps to create:
-1. `git clone https://github.com/LambdaLabsML/lambda-quant`
-2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
-3. `python quantize.py -m meta-llama/Llama-3.1-8B-Instruct -q Dynamic-F8`
-## Evaluation
-TODO
-## Benchmarks
-TODO
-# Base Model README.md
 ---
 language:
 - en
@@ -204,6 +188,25 @@ extra_gated_description: The information you provide will be collected, stored,
   and shared in accordance with the [Meta Privacy Policy](https://www.facebook.com/privacy/policy/).
 extra_gated_button_content: Submit
 ---
 ## Model Information

 ---
 language:
 - en
   and shared in accordance with the [Meta Privacy Policy](https://www.facebook.com/privacy/policy/).
 extra_gated_button_content: Submit
 ---
+# Quantization
+Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov  6 2024, 20:22:13) [GCC 11.4.0]`
+Base Model: [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
+Quantized using [llmcompressor==0.4.1](https://github.com/vllm-project/llm-compressor)
+Steps to create:
+1. `git clone https://github.com/LambdaLabsML/lambda-quant`
+2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
+3. `python quantize.py -m meta-llama/Llama-3.1-8B-Instruct -q Dynamic-F8`
+## Evaluation
+TODO
+## Benchmarks
+TODO
+# Base Model README.md
 ## Model Information