catalystsec
/

MiniMax-M2.5-3bit-DWQ

Text Generation

Model card Files Files and versions

kernelpool commited on Feb 18

Commit

37cf26d

·

verified ·

1 Parent(s): 1b773a2

Update README.md

Files changed (1) hide show

README.md +36 -0

README.md CHANGED Viewed

@@ -8,3 +8,39 @@ base_model: MiniMaxAI/MiniMax-M2.5
 tags:
 - mlx
 ---

 tags:
 - mlx
 ---
+# catalystsec/MiniMax-M2.5-3bit-DWQ
+This model was quantized to 3-bit using DWQ with mlx-lm version **0.30.7**.
+| Parameter                 | Value                          |
+|---------------------------|--------------------------------|
+| DWQ learning rate         | 3e-7                           |
+| Batch size                | 1                              |
+| Dataset                   | `allenai/tulu-3-sft-mixture`   |
+| Initial validation loss   | 0.183                          |
+| Final validation loss     | 0.110                          |
+| Relative KL reduction     | ≈40 %                          |
+| Tokens processed          | ≈1.11 M                        |
+## Use with mlx
+```bash
+pip install mlx-lm
+```
+```python
+from mlx_lm import load, generate
+model, tokenizer = load("catalystsec/MiniMax-M2.5-3bit-DWQ")
+prompt = "hello"
+if tokenizer.chat_template is not None:
+    prompt = tokenizer.apply_chat_template(
+        [{"role": "user", "content": prompt}],
+        add_generation_prompt=True,
+    )
+response = generate(model, tokenizer, prompt=prompt, verbose=True)
+print(response)
+```