ssdataanalysis commited on
Commit
bb165d9
·
verified ·
1 Parent(s): ba97f7d

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: nvidia-open-model-license
4
+ license_link: https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
5
+ pipeline_tag: text-generation
6
+ language:
7
+ - en
8
+ - he
9
+ tags:
10
+ - pretrained
11
+ - mlx
12
+ - mlx-my-repo
13
+ inference:
14
+ parameters:
15
+ temperature: 0.6
16
+ base_model: dicta-il/DictaLM-3.0-Nemotron-12B-Instruct
17
+ ---
18
+
19
+ # ssdataanalysis/DictaLM-3.0-Nemotron-12B-Instruct-mlx-8Bit
20
+
21
+ The Model [ssdataanalysis/DictaLM-3.0-Nemotron-12B-Instruct-mlx-8Bit](https://huggingface.co/ssdataanalysis/DictaLM-3.0-Nemotron-12B-Instruct-mlx-8Bit) was converted to MLX format from [dicta-il/DictaLM-3.0-Nemotron-12B-Instruct](https://huggingface.co/dicta-il/DictaLM-3.0-Nemotron-12B-Instruct) using mlx-lm version **0.29.1**.
22
+
23
+ ## Use with mlx
24
+
25
+ ```bash
26
+ pip install mlx-lm
27
+ ```
28
+
29
+ ```python
30
+ from mlx_lm import load, generate
31
+
32
+ model, tokenizer = load("ssdataanalysis/DictaLM-3.0-Nemotron-12B-Instruct-mlx-8Bit")
33
+
34
+ prompt="hello"
35
+
36
+ if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
37
+ messages = [{"role": "user", "content": prompt}]
38
+ prompt = tokenizer.apply_chat_template(
39
+ messages, tokenize=False, add_generation_prompt=True
40
+ )
41
+
42
+ response = generate(model, tokenizer, prompt=prompt, verbose=True)
43
+ ```