gotzmann
/

LLaMA-GGML-v2

Model card Files Files and versions

gotzmann commited on May 13, 2023

Commit

caf25ad

·

1 Parent(s): e5f7430

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -6,15 +6,15 @@ tags:
 # LLAMA-GGML-v2
-This is GGML format quantised 4bit models of LLaMA models for the latest GGML format v2.
-This repo is the result of quantising to 4bit GGML for CPU inference using [llama.cpp](https://github.com/ggerganov/llama.cpp).
-## THE FILES IN MAIN BRANCH REQUIRES LATEST LLAMA.CPP (May 12th 2023 - commit b9fd7ee)!
 llama.cpp recently made a breaking change to its quantisation methods.
-I have quantised the GGML files in this repo with the latest version. Therefore you will require llama.cpp compiled on May 12th or later (commit `b9fd7ee` or later) to use them.
 ## How to run in `text-generation-webui`

 # LLAMA-GGML-v2
+This is repo for LLaMA models quantised down to 4bit for the latest [llama.cpp](https://github.com/ggerganov/llama.cpp) GGML v2 format.
+## THE FILES REQUIRES LATEST LLAMA.CPP (May 12th 2023 - commit b9fd7ee)!
 llama.cpp recently made a breaking change to its quantisation methods.
+I have quantised the GGML files in this repo with the latest version.
+Therefore you will require llama.cpp compiled on May 12th or later (commit `b9fd7ee` or later) to use them.
 ## How to run in `text-generation-webui`