Kearm commited on
Commit
ed522a2
·
verified ·
1 Parent(s): ecd9ca2

Initial upload: GLM-4.7-Flash MMFP4 quantization (GPTQ + actorder)

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. .gitattributes +1 -0
  2. README.md +182 -0
  3. config.json +45 -0
  4. generation_config.json +11 -0
  5. model-00001-of-00048.safetensors +3 -0
  6. model-00002-of-00048.safetensors +3 -0
  7. model-00003-of-00048.safetensors +3 -0
  8. model-00004-of-00048.safetensors +3 -0
  9. model-00005-of-00048.safetensors +3 -0
  10. model-00006-of-00048.safetensors +3 -0
  11. model-00007-of-00048.safetensors +3 -0
  12. model-00008-of-00048.safetensors +3 -0
  13. model-00009-of-00048.safetensors +3 -0
  14. model-00010-of-00048.safetensors +3 -0
  15. model-00011-of-00048.safetensors +3 -0
  16. model-00012-of-00048.safetensors +3 -0
  17. model-00013-of-00048.safetensors +3 -0
  18. model-00014-of-00048.safetensors +3 -0
  19. model-00015-of-00048.safetensors +3 -0
  20. model-00016-of-00048.safetensors +3 -0
  21. model-00017-of-00048.safetensors +3 -0
  22. model-00018-of-00048.safetensors +3 -0
  23. model-00019-of-00048.safetensors +3 -0
  24. model-00020-of-00048.safetensors +3 -0
  25. model-00021-of-00048.safetensors +3 -0
  26. model-00022-of-00048.safetensors +3 -0
  27. model-00023-of-00048.safetensors +3 -0
  28. model-00024-of-00048.safetensors +3 -0
  29. model-00025-of-00048.safetensors +3 -0
  30. model-00026-of-00048.safetensors +3 -0
  31. model-00027-of-00048.safetensors +3 -0
  32. model-00028-of-00048.safetensors +3 -0
  33. model-00029-of-00048.safetensors +3 -0
  34. model-00030-of-00048.safetensors +3 -0
  35. model-00031-of-00048.safetensors +3 -0
  36. model-00032-of-00048.safetensors +3 -0
  37. model-00033-of-00048.safetensors +3 -0
  38. model-00034-of-00048.safetensors +3 -0
  39. model-00035-of-00048.safetensors +3 -0
  40. model-00036-of-00048.safetensors +3 -0
  41. model-00037-of-00048.safetensors +3 -0
  42. model-00038-of-00048.safetensors +3 -0
  43. model-00039-of-00048.safetensors +3 -0
  44. model-00040-of-00048.safetensors +3 -0
  45. model-00041-of-00048.safetensors +3 -0
  46. model-00042-of-00048.safetensors +3 -0
  47. model-00043-of-00048.safetensors +3 -0
  48. model-00044-of-00048.safetensors +3 -0
  49. model-00045-of-00048.safetensors +3 -0
  50. model-00046-of-00048.safetensors +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,182 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ - zh
6
+ base_model: zai-org/GLM-4.7-Flash
7
+ pipeline_tag: text-generation
8
+ tags:
9
+ - quantized
10
+ - Mixture of Experts
11
+ - 4-bit
12
+ - GPTQ
13
+ - MMFP4
14
+ - glm
15
+ - metal-marlin
16
+ - moe
17
+ library_name: transformers
18
+ arxiv: "2508.06471"
19
+ ---
20
+
21
+ # GLM-4.7-Flash-Marlin-MMFP4
22
+
23
+ ![](https://raw.githubusercontent.com/zai-org/GLM-4.5/refs/heads/main/resources/logo.svg)
24
+
25
+ **MMFP4-quantized GLM-4.7-Flash** — a 30B-A3B MoE model compressed to **4 bits per weight** using GPTQ with actorder and Metal Marlin's E2M1 FP4 format.
26
+
27
+ | Metric | Value |
28
+ |--------|-------|
29
+ | **Effective bits** | 4.0 bpw |
30
+ | **Compression** | 4× vs FP16 |
31
+ | **Model size** | ~16 GB (vs ~60 GB FP16) |
32
+ | **Parameters** | 29.3B |
33
+ | **Format** | HuggingFace sharded safetensors |
34
+
35
+ ## Model Description
36
+
37
+ This is a quantized version of [zai-org/GLM-4.7-Flash](https://huggingface.co/zai-org/GLM-4.7-Flash), the strongest model in the 30B class that balances performance and efficiency.
38
+
39
+ GLM-4.7-Flash features:
40
+
41
+ - **30B-A3B MoE architecture** (64 experts + shared expert, 2-4 active per token)
42
+ - **Multi-head Latent Attention (MLA)** for 8× KV cache compression
43
+ - **State-of-the-art reasoning** (91.6% on AIME 2025, 59.2% on SWE-bench Verified)
44
+ - **Bilingual** (English + Chinese)
45
+
46
+ ## Quantization Details
47
+
48
+ Quantized using **MR-GPTQ** (Metal Marlin GPTQ) with CUDA acceleration:
49
+
50
+ ### Method
51
+
52
+ - **Format**: MMFP4 (E2M1 FP4) — Metal Marlin's native FP4 format
53
+ - **Quantization**: GPTQ with actorder (activation-order column permutation)
54
+ - **Hessian calibration**: Pre-computed Hessians for attention layers
55
+ - **Expert quantization**: Identity Hessian with actorder (no calibration data for MoE experts)
56
+ - **Group size**: 128
57
+ - **Hardware**: NVIDIA RTX 3090 Ti (CUDA-accelerated Cholesky factorization)
58
+
59
+ ### Quantization Statistics
60
+
61
+ | Component | Bit Width | Notes |
62
+ |-----------|-----------|-------|
63
+ | Embeddings | FP16 | Full precision |
64
+ | LM Head | FP16 | Full precision |
65
+ | Attention (q/k/v/o) | 4-bit | GPTQ with Hessians |
66
+ | MoE Experts (64×) | 4-bit | GPTQ with actorder |
67
+ | Layer Norms | FP16 | Full precision |
68
+ | Router Weights | FP16 | Full precision |
69
+
70
+ - **Total tensors**: 19,066
71
+ - **Shards**: 48 safetensors files
72
+ - **Quantization time**: ~20 minutes (RTX 3090 Ti)
73
+
74
+ ## Files
75
+
76
+ ```
77
+ GLM-4.7-Flash-Marlin-MMFP4/
78
+ ├── model-00001-of-00048.safetensors # Layer 0 (embeddings)
79
+ ├── model-00002-of-00048.safetensors # Layer 1
80
+ ├── ...
81
+ ├── model-00048-of-00048.safetensors # Layer 47 + lm_head
82
+ ├── model.safetensors.index.json # Weight map
83
+ ├── config.json # Model config
84
+ ├── generation_config.json
85
+ ├── tokenizer.json # Tokenizer
86
+ └── tokenizer_config.json
87
+ ```
88
+
89
+ ## Usage
90
+
91
+ ### With Metal Marlin (Apple Silicon)
92
+
93
+ ```python
94
+ from metal_marlin import MarlinForCausalLM
95
+ from transformers import AutoTokenizer
96
+
97
+ model = MarlinForCausalLM.from_pretrained(
98
+ "RESMP-DEV/GLM-4.7-Flash-Marlin-MMFP4",
99
+ device="mps"
100
+ )
101
+ tokenizer = AutoTokenizer.from_pretrained("zai-org/GLM-4.7-Flash")
102
+
103
+ prompt = "<|user|>\nExplain quantum computing in simple terms.\n<|assistant|>\n"
104
+ input_ids = tokenizer(prompt, return_tensors="pt").input_ids.to("mps")
105
+ output = model.generate(input_ids, max_new_tokens=256, temperature=0.7)
106
+ print(tokenizer.decode(output[0], skip_special_tokens=True))
107
+ ```
108
+
109
+ ### Tensor Format
110
+
111
+ Each quantized weight tensor has corresponding scale factors:
112
+
113
+ - `{name}.weight`: Packed FP4 weights (uint8)
114
+ - `{name}.scales`: FP16 per-group scales (group_size=128)
115
+
116
+ ## Hardware Requirements
117
+
118
+ | Device | Memory | Notes |
119
+ |--------|--------|-------|
120
+ | Apple M4 Max | 36 GB+ | Via Metal Marlin |
121
+ | Apple M2 Ultra | 64 GB+ | Via Metal Marlin |
122
+ | NVIDIA RTX 3090 | 24 GB | With offloading |
123
+ | NVIDIA RTX 4090 | 24 GB | Native |
124
+
125
+ ## Benchmarks
126
+
127
+ ### Original Model Performance (from Z.AI)
128
+
129
+ | Benchmark | GLM-4.7-Flash | Qwen3-30B-A3B | GPT-OSS-20B |
130
+ |-----------|---------------|---------------|-------------|
131
+ | AIME 2025 | **91.6** | 85.0 | 91.7 |
132
+ | GPQA | **75.2** | 73.4 | 71.5 |
133
+ | SWE-bench Verified | **59.2** | 22.0 | 34.0 |
134
+ | τ²-Bench | **79.5** | 49.0 | 47.7 |
135
+ | BrowseComp | **42.8** | 2.29 | 28.3 |
136
+
137
+ ### Quantized Model Notes
138
+
139
+ - GPTQ with actorder minimizes quality loss vs RTN
140
+ - Expected degradation: ~1-2% on benchmarks vs FP16
141
+ - E2M1 FP4 format optimized for Metal Performance Shaders
142
+
143
+ ## Comparison with Trellis Quant
144
+
145
+ | Model | Format | Size | Bits | Method |
146
+ |-------|--------|------|------|--------|
147
+ | [GLM-4.7-Flash-Trellis-MM](https://huggingface.co/RESMP-DEV/GLM-4.7-Flash-Trellis-MM) | Trellis | 14 GB | 3.78 bpw | EXL3-style mixed precision |
148
+ | **This model** | MMFP4 | 16 GB | 4.0 bpw | GPTQ + actorder |
149
+
150
+ Choose **Trellis** for smaller size, **MMFP4** for simpler tensor format and potentially better compatibility.
151
+
152
+ ## Limitations
153
+
154
+ - **Metal Marlin required** for optimal inference on Apple Silicon
155
+ - **No speculative decoding** yet
156
+ - **Quality loss**: ~1-2% on benchmarks vs FP16 (typical for 4-bit quantization)
157
+
158
+ ## Credits
159
+
160
+ - **Original model**: [Z.AI / GLM Team](https://huggingface.co/zai-org/GLM-4.7-Flash)
161
+ - **Quantization method**: GPTQ with actorder
162
+ - **Quantization toolkit**: [Metal Marlin](https://github.com/RESMP-DEV/metal-marlin)
163
+
164
+ ## Citation
165
+
166
+ If you use this model, please cite the original GLM-4.5 paper:
167
+
168
+ ```bibtex
169
+ @misc{glm2025glm45,
170
+ title={GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models},
171
+ author={GLM Team and Aohan Zeng and Xin Lv and others},
172
+ year={2025},
173
+ eprint={2508.06471},
174
+ archivePrefix={arXiv},
175
+ primaryClass={cs.CL},
176
+ url={https://arxiv.org/abs/2508.06471},
177
+ }
178
+ ```
179
+
180
+ ## License
181
+
182
+ This quantized model inherits the **MIT License** from the original GLM-4.7-Flash model.
config.json ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "Glm4MoeLiteForCausalLM"
4
+ ],
5
+ "attention_bias": false,
6
+ "attention_dropout": 0.0,
7
+ "pad_token_id": 154820,
8
+ "eos_token_id": [
9
+ 154820,
10
+ 154827,
11
+ 154829
12
+ ],
13
+ "hidden_act": "silu",
14
+ "hidden_size": 2048,
15
+ "intermediate_size": 10240,
16
+ "max_position_embeddings": 202752,
17
+ "model_type": "glm4_moe_lite",
18
+ "moe_intermediate_size": 1536,
19
+ "topk_method": "noaux_tc",
20
+ "norm_topk_prob": true,
21
+ "num_attention_heads": 20,
22
+ "n_group": 1,
23
+ "topk_group": 1,
24
+ "n_routed_experts": 64,
25
+ "n_shared_experts": 1,
26
+ "routed_scaling_factor": 1.8,
27
+ "num_experts_per_tok": 4,
28
+ "first_k_dense_replace": 1,
29
+ "num_hidden_layers": 47,
30
+ "num_key_value_heads": 20,
31
+ "num_nextn_predict_layers": 1,
32
+ "partial_rotary_factor": 1.0,
33
+ "rms_norm_eps": 1e-05,
34
+ "rope_scaling": null,
35
+ "rope_theta": 1000000,
36
+ "tie_word_embeddings": false,
37
+ "dtype": "bfloat16",
38
+ "transformers_version": "5.0.0rc0",
39
+ "q_lora_rank": 768,
40
+ "kv_lora_rank": 512,
41
+ "qk_nope_head_dim": 192,
42
+ "qk_rope_head_dim": 64,
43
+ "v_head_dim": 256,
44
+ "vocab_size": 154880
45
+ }
generation_config.json ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "eos_token_id": [
4
+ 154820,
5
+ 154827,
6
+ 154829
7
+ ],
8
+ "pad_token_id": 154820,
9
+ "temperature": 1.0,
10
+ "transformers_version": "5.0.0.dev0"
11
+ }
model-00001-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d9c75eac1933aefdf174d6248f129c47b8f824380d67ac16d7cb5a6147926d69
3
+ size 43672248
model-00002-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3f1b7a84f2090d15ee804ef8d548177f8b82b6ec7965ced8188988f3bb3a87d
3
+ size 327833240
model-00003-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5ecdc9ce4adb8c055d01b87d64076ad728ff68e666dc928e03da78bac377c920
3
+ size 327833240
model-00004-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b006af82949a3315aef90b7baf1a2a9882bb3f43f9ce3c8cd65b4c76c9e38207
3
+ size 327833240
model-00005-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac3e9ef0665b550cb3372a710f8821dc0a1a92064659d387638a5ecb40ca9884
3
+ size 327833240
model-00006-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b8a5e3764b1084fd6b83adf38607d33f086c3940f9fa058b0e45db98f47fda0c
3
+ size 327833240
model-00007-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4fe2dc6014f82667508eb8a83a3e784507d1a9848326777dbf93b3c8b9cf64d5
3
+ size 327833240
model-00008-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a7aeadd95910d968c8865af1512c273b50ae3b52487c5a8844d198ed8ed1c94
3
+ size 327833240
model-00009-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a37a52a176a1055b54499d69790be9e6d002d292a1f279e191b0bc0e78ebf6d4
3
+ size 327833240
model-00010-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fc713771ef57f3f1a2ddddbc8975840b039009211af61283f317fe82f3b0f803
3
+ size 327833240
model-00011-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5c9d3ea9e470875dc56792322a46722b6512fd3297cb6cd278f3946e72dc1b57
3
+ size 327833648
model-00012-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ed7dd635f55f86789b8ad28da9b2f7e35720711e22b90c9a3b6d5238beb9e10
3
+ size 327833648
model-00013-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:826e321f91601aa166993da7a792b2e1e31b9de8eac98ea3c629bec6edae15bc
3
+ size 327833648
model-00014-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:25b088ccb049f761d50f94ed9688929a046990b4ff9b9728fe4e4f6a8afe99c8
3
+ size 327833648
model-00015-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bf329d81710a99098bebd05de1d809174a9c7ad2ef508845b07416f1ba8a938b
3
+ size 327833648
model-00016-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:07bcea67819f6c8d448ad70dfd65ee40cdcf025732a94c20cf8c845013a4fd90
3
+ size 327833648
model-00017-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8136af0e71dd04218a927dbf44b415c93160df2fa9299db978bb0d82e0c82e5b
3
+ size 327833648
model-00018-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0b27e105f72ddf7a01b1ab864ce664a8d3d232880e3ff380a626e20fd63dafe3
3
+ size 327833648
model-00019-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f6f39fcfccbc3e9a4333bd484ac12cd816dc2c6fc266fc357e42a4a773ae051
3
+ size 327833648
model-00020-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:660343ac01528ac8d18ca2d2e1af3158c23adb215284a724c619b074980ae42f
3
+ size 327833648
model-00021-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:279237957148596abb9ce53e783e4583b3ac99ff6270cbe2242202f18891cc5d
3
+ size 327833648
model-00022-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:350d4635ff553896c5a193f12721042083d2741baa34e4cb7fc2e4dd608e144d
3
+ size 327833648
model-00023-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc0cd17264aeac8156ff8b620f759d5a6c4fa9fb376b31950f28935720320859
3
+ size 327833648
model-00024-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e1bb3f7ecfb31ad86e0ca08eb7272b79041750a3d37e3fa53406746ffaed4fd
3
+ size 327833648
model-00025-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:636c23bd06de9f265e7bd504de64f6b6f52d129a425dbab7eeaffa7854119ada
3
+ size 327833648
model-00026-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd142bfc41b48bcf30f173febf3b425aa640aa2a87a88dc418948b0686437fb3
3
+ size 327833648
model-00027-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:74d684936b534f4da73488c37df32177b5f19fd7a349431bdb0b0edfe6f6567c
3
+ size 327833648
model-00028-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:17abe4e3ef81c14065300e33198a44237ac51880f383912f1cf3200d69e2dee6
3
+ size 327833648
model-00029-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0a2311764b9408632af65ae0dc21b5f0353986c3c47a0031a626098df65fc05d
3
+ size 327833648
model-00030-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b2b03130258c4c6c60026ceb61e0190020da76a0b9aef48e7f761932d7c656ce
3
+ size 327833648
model-00031-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ae64f3e55603f1a1e3a9d2fce50fb88a547d66f27acb31f8043a93eb97dbb2ee
3
+ size 327833648
model-00032-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c21366f8b23ed73d2bcf75bdbab9c0122f17abed308b6ba3319060b9bc8f0adc
3
+ size 327833648
model-00033-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:daa7c2cfe969844ff866c80c7c46dee90de2abac06b886b246550a2b53bbbc25
3
+ size 327833648
model-00034-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd7c93ce6893a3cc825987ca92c197261464e903506af51779ae5df5d50972f3
3
+ size 327833648
model-00035-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:328c8c578a97cd6d37518f953613c22123f2f4de1475d7777fd3ea8b6b7f5163
3
+ size 327833648
model-00036-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:92cce91528e2230b446321c5aff6d543812512a3eb00acb80944c82aee588752
3
+ size 327833648
model-00037-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cdb2b3661d4333703c1e7d6ed30ecdc94a24e9c7a45ea390166c24cceb01ce9a
3
+ size 327833648
model-00038-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bfce66c57e056ff78615e0ee624bd49bc88c1dba1eb23b9c64bbb732b48d6a7f
3
+ size 327833648
model-00039-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:771f8155c915eeeb5c1f1f38c44e3f3cb37be4b9be5c287bb3db5f8cca95f932
3
+ size 327833648
model-00040-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9369d29ce83807e45840032a764473a8d4e6338497351b97b5a98ad01fbb9fea
3
+ size 327833648
model-00041-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f5fb20256de1f464af76e118ba3da9344f464f54164dec7179c3e5517ccb1ba6
3
+ size 327833648
model-00042-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd23a8ca75ce66417c60e2b562476fc85d73d80f61034f1e178fc4594e37f8dc
3
+ size 327833648
model-00043-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:11bafe4c66fd086814c5c0427f3699d7a1b1f2749a563853cd54bfcf4c2b1bc2
3
+ size 327833648
model-00044-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:608d8caec230802251d6283475815be2939140e969d559c63f37502da7f28d9b
3
+ size 327833648
model-00045-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:256ee66546a588a35ff7d67ef48e1766989e53f579b6db736fa6ae40098bf7d0
3
+ size 327833648
model-00046-of-00048.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8b0b38005a15ee13d81fa77971c50ff541d18db75e20d948eed4484a26689bf3
3
+ size 327833648