Updated Readme

Files changed (1) hide show

README.md CHANGED Viewed

@@ -30,7 +30,7 @@ This repo contains 4-bit quantized (using ExLlamaV2) model of Meta's meta-llama/
 - Original model: [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
-### About 4 bit quantization using bitsandbytes
 - ExLlamaV2 github repo: [ExLlamaV2 github repo](https://github.com/turboderp/exllamav2)
@@ -39,9 +39,10 @@ This repo contains 4-bit quantized (using ExLlamaV2) model of Meta's meta-llama/
 # How to Get Started with the Model
 Use the code below to get started with the model.
-## How to run from Python code
 #### First install the package
 ```shell

 - Original model: [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
+### About 4 bit quantization using ExLlamaV2
 - ExLlamaV2 github repo: [ExLlamaV2 github repo](https://github.com/turboderp/exllamav2)
 # How to Get Started with the Model
 Use the code below to get started with the model.
+I will update how to inference using Python code later.
+## How to run using ExLlamaV2
 #### First install the package
 ```shell