turboderp commited on
Commit
c2ec1cb
·
verified ·
1 Parent(s): fb99c38

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model: karpathy/nanochat-d34
4
+ base_model_relation: quantized
5
+ quantized_by: turboderp
6
+ tags:
7
+ - exl3
8
+ ---
9
+
10
+ EXL3 quants of [nanochat-d34](https://huggingface.co/karpathy/nanochat-d34)
11
+
12
+ ⚠️ Requires ExLlamaV3 v0.0.19 (or v0.0.18 `dev` branch)
13
+
14
+ Base bitrates:
15
+
16
+ [2.00 bits per weight](https://huggingface.co/turboderp/nanochat-d34-exl3/tree/2.00bpw)
17
+ [3.00 bits per weight](https://huggingface.co/turboderp/nanochat-d34-exl3/tree/3.00bpw)
18
+ [4.00 bits per weight](https://huggingface.co/turboderp/nanochat-d34-exl3/tree/4.00bpw)
19
+ [5.00 bits per weight](https://huggingface.co/turboderp/nanochat-d34-exl3/tree/5.00bpw)
20
+ [6.00 bits per weight](https://huggingface.co/turboderp/nanochat-d34-exl3/tree/6.00bpw)