clowman
/

Llama-3.1-8B-Instruct-GPTQ-Int4

Text Generation

4-bit precision

Model card Files Files and versions

clowman commited on Apr 2, 2025

Commit

4c4cae7

·

verified ·

1 Parent(s): e9117d8

Update README.md

Files changed (1) hide show

README.md +20 -16

README.md CHANGED Viewed

@@ -1,19 +1,3 @@
-# Quantization
-Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov  6 2024, 20:22:13) [GCC 11.4.0]`
-Base Model: [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
-Quantized using [gptqmodel==2.1.0](https://github.com/ModelCloud/GPTQModel)
-Steps to create:
-1. `git clone https://github.com/LambdaLabsML/lambda-quant`
-2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
-3. `python quantize.py -m meta-llama/Llama-3.1-8B-Instruct -q GPTQ-Int4`
-## Evaluation
-TODO
-## Benchmarks
-TODO
-# Base Model README.md
 ---
 language:
 - en
@@ -204,6 +188,26 @@ extra_gated_description: The information you provide will be collected, stored,
   and shared in accordance with the [Meta Privacy Policy](https://www.facebook.com/privacy/policy/).
 extra_gated_button_content: Submit
 ---
 ## Model Information

 ---
 language:
 - en
   and shared in accordance with the [Meta Privacy Policy](https://www.facebook.com/privacy/policy/).
 extra_gated_button_content: Submit
 ---
+# Quantization
+Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov  6 2024, 20:22:13) [GCC 11.4.0]`
+Base Model: [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
+Quantized using [gptqmodel==2.1.0](https://github.com/ModelCloud/GPTQModel)
+Steps to create:
+1. `git clone https://github.com/LambdaLabsML/lambda-quant`
+2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
+3. `python quantize.py -m meta-llama/Llama-3.1-8B-Instruct -q GPTQ-Int4`
+## Evaluation
+TODO
+## Benchmarks
+TODO
+# Base Model README.md
 ## Model Information