bartowski
/

internlm2-chat-7b-llama-exl2

Text Generation

Model card Files Files and versions

bartowski commited on Jan 27, 2024

Commit

c11d68a

·

verified ·

1 Parent(s): 79d4968

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -4,6 +4,8 @@ license: other
 quantized_by: bartowski
 ---
 ## Exllama v2 Quantizations of internlm2-chat-7b-llama
 Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.12">turboderp's ExLlamaV2 v0.0.12</a> for quantization.

 quantized_by: bartowski
 ---
+Update Jan 27: This has been redone with the proper token mappings and rope scaling, performance seems improved, please comment if not
 ## Exllama v2 Quantizations of internlm2-chat-7b-llama
 Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.12">turboderp's ExLlamaV2 v0.0.12</a> for quantization.