SerialKicked
/

ModelTestingBed

Model card Files Files and versions

SerialKicked commited on May 24, 2024

Commit

8cf827b

·

verified ·

1 Parent(s): 03508cf

Update README.md

Files changed (1) hide show

README.md +2 -6

README.md CHANGED Viewed

@@ -14,15 +14,11 @@ In the meantime, you can check [this topic](https://huggingface.co/LWDCLS/LLM-Di
 # Testing Environment
-All models are loaded in Q8_0 (GGUF) using KoboldCPP 1.65 for Windows using CUDA 12. Using CuBLAS but not using mmq.
-All layers are on the GPU (NVidia RTX3060 12GB).
 Frontend is staging version of Silly Tavern.
-All models are extended to 16K context length (auto rope from KCPP) with Flash Attention enabled.
-Response size set to 1024 tokens max.
 Fixed Seed for all tests: 123

 # Testing Environment
+All models are loaded in Q8_0 (GGUF) using KoboldCPP 1.65 for Windows using CUDA 12. Using CuBLAS but not using mmq. All layers are on the GPU (NVidia RTX3060 12GB).
 Frontend is staging version of Silly Tavern.
+All models are extended to 16K context length (auto rope from KCPP) with Flash Attention enabled. Response size set to 1024 tokens max.
 Fixed Seed for all tests: 123