TheBloke
/

Athena-v2-AWQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

TheBloke commited on Sep 23, 2023

Commit

a78bb39

·

1 Parent(s): c9024c7

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -38,13 +38,13 @@ quantized_by: TheBloke
 <!-- header end -->
 # Athena V2 - AWQ
-- Model creator: [IkariDev](https://huggingface.co/IkariDev)
 - Original model: [Athena V2](https://huggingface.co/IkariDev/Athena-v2)
 <!-- description start -->
 ## Description
-This repo contains AWQ model files for [IkariDev's Athena V2](https://huggingface.co/IkariDev/Athena-v2).
 ### About AWQ
@@ -59,7 +59,7 @@ It is also now supported by continuous batching server [vLLM](https://github.com
 * [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/Athena-v2-AWQ)
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/Athena-v2-GPTQ)
 * [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/Athena-v2-GGUF)
-* [IkariDev's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/IkariDev/Athena-v2)
 <!-- repositories-available end -->
 <!-- prompt-template start -->
@@ -258,7 +258,7 @@ And thank you again to a16z for their generous grant.
 <!-- footer end -->
-# Original model card: IkariDev's Athena V2
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630dfb008df86f1e5becadc3/y9gdW2923RkORUxejcLVL.png)

 <!-- header end -->
 # Athena V2 - AWQ
+- Model creator: [IkariDev and Undi95](https://huggingface.co/IkariDev)
 - Original model: [Athena V2](https://huggingface.co/IkariDev/Athena-v2)
 <!-- description start -->
 ## Description
+This repo contains AWQ model files for [IkariDev and Undi95's Athena V2](https://huggingface.co/IkariDev/Athena-v2).
 ### About AWQ
 * [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/Athena-v2-AWQ)
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/Athena-v2-GPTQ)
 * [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/Athena-v2-GGUF)
+* [IkariDev and Undi95's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/IkariDev/Athena-v2)
 <!-- repositories-available end -->
 <!-- prompt-template start -->
 <!-- footer end -->
+# Original model card: IkariDev and Undi95's Athena V2
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630dfb008df86f1e5becadc3/y9gdW2923RkORUxejcLVL.png)