huggingkot commited on
Commit
7ae3be7
·
1 Parent(s): 508a978
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.json filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ base_model:
4
+ - Seeker38/gemma-2-9b-it-abc-notation
5
+ ---
6
+
7
+ This is a converted weight from [gemma-2-9b-it-abc-notation](https://huggingface.co/Seeker38/gemma-2-9b-it-abc-notation) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
8
+
9
+ ## About this Conversion
10
+
11
+ This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
12
+
13
+ ### How 4-bit Quantization Works
14
+ - The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
15
+ - **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
16
+
17
+ This allows for reduced memory usage and faster inference while keeping the model compact.
config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb932335e68f3b6579d066e3767ae19d1b4ba010efb85fb39955d276144bf435
3
+ size 1294
generation_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9afda47f28cc959f49d3cfc2ecd9ff893f108a13cdeaa585dc0339cf4ded2d59
3
+ size 190
model-00001-of-00002.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:790f72db4ce9e595eaada3845c26aa0889a5140b5324d271bab0454efaf0ef78
3
+ size 4979101152
model-00002-of-00002.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7477003e9d5e40054441cbc131ef7fd0c9688acf796ba1656e1ff2bbd03de67c
3
+ size 1151605126
model.safetensors.index.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce6c1bf5d8b4c206013efb736d592cc98afdc291285293b45988909a8db148b6
3
+ size 181353
special_tokens_map.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:baec30ea10906f16adb8c18af7a34023002c1746542612b8b41c9f09e1351351
3
+ size 636
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5f7eee611703c5ce5d1eee32d9cdcfe465647b8aff0c1dfb3bed7ad7dbb05060
3
+ size 34362873
tokenizer.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:61a7b147390c64585d6c3543dd6fc636906c9af3865a5548f27f31aee1d4c8e2
3
+ size 4241003
tokenizer_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89d63bf32272ecb82d2bbcf8f90bbf1ebc2b7aa4fedfc1dfee099541628726fe
3
+ size 47025