e1732a364fed commited on
Commit
f9ef5d0
·
verified ·
1 Parent(s): 9050850

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -2
README.md CHANGED
@@ -3,7 +3,31 @@ license: apache-2.0
3
  base_model: stepfun-ai/Step-3.5-Flash
4
  tags:
5
  - mlx
 
6
  ---
7
 
8
- # e1732a364fed/Step-3.5-Flash-mlx-8Bit
9
- The Model [e1732a364fed/Step-3.5-Flash-mlx-8Bit](https://huggingface.co/e1732a364fed/Step-3.5-Flash-mlx-8Bit) was converted to MLX format...
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  base_model: stepfun-ai/Step-3.5-Flash
4
  tags:
5
  - mlx
6
+ pipeline_tag: text-generation
7
  ---
8
 
9
+ # mlx-community/Step-3.5-Flash-8bit
10
+
11
+ The Model [mlx-community/Step-3.5-Flash-8bit](https://huggingface.co/mlx-community/Step-3.5-Flash-8bit) was converted to MLX format from [stepfun-ai/Step-3.5-Flash](https://huggingface.co/stepfun-ai/Step-3.5-Flash) using mlx-lm version **0.30.6**.
12
+
13
+ ## Use with mlx
14
+
15
+ ```bash
16
+ pip install mlx-lm
17
+ ```
18
+
19
+ ```python
20
+ from mlx_lm import load, generate
21
+
22
+ model, tokenizer = load("mlx-community/Step-3.5-Flash-8bit")
23
+
24
+ prompt="hello"
25
+
26
+ if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
27
+ messages = [{"role": "user", "content": prompt}]
28
+ prompt = tokenizer.apply_chat_template(
29
+ messages, tokenize=False, add_generation_prompt=True
30
+ )
31
+
32
+ response = generate(model, tokenizer, prompt=prompt, verbose=True)
33
+ ```