MLX
Safetensors
Russian
llama
8-bit precision
steamdroid commited on
Commit
e5372a1
·
verified ·
1 Parent(s): ad6c81d

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ru
4
+ datasets:
5
+ - IlyaGusev/saiga_scored
6
+ - IlyaGusev/saiga_preferences
7
+ license: other
8
+ license_name: yandexgpt-5-lite-8b-pretrain
9
+ license_link: LICENSE
10
+ base_model: IlyaGusev/saiga_yandexgpt_8b
11
+ tags:
12
+ - mlx
13
+ ---
14
+
15
+ # steamdroid/saiga_yandexgpt_8b-mlx-8Bit
16
+
17
+ The Model [steamdroid/saiga_yandexgpt_8b-mlx-8Bit](https://huggingface.co/steamdroid/saiga_yandexgpt_8b-mlx-8Bit) was converted to MLX format from [IlyaGusev/saiga_yandexgpt_8b](https://huggingface.co/IlyaGusev/saiga_yandexgpt_8b) using mlx-lm version **0.26.3**.
18
+
19
+ ## Use with mlx
20
+
21
+ ```bash
22
+ pip install mlx-lm
23
+ ```
24
+
25
+ ```python
26
+ from mlx_lm import load, generate
27
+
28
+ model, tokenizer = load("steamdroid/saiga_yandexgpt_8b-mlx-8Bit")
29
+
30
+ prompt="hello"
31
+
32
+ if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
33
+ messages = [{"role": "user", "content": prompt}]
34
+ prompt = tokenizer.apply_chat_template(
35
+ messages, tokenize=False, add_generation_prompt=True
36
+ )
37
+
38
+ response = generate(model, tokenizer, prompt=prompt, verbose=True)
39
+ ```