tgetsov commited on
Commit
28898c8
·
verified ·
1 Parent(s): 06e451b

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -17,16 +17,16 @@ tags:
17
  - mlx
18
  ---
19
 
20
- # marvy-14B-lora
21
 
22
- **LoRA adapter for marvy-14B — the first open model for the full ServiceNow delivery lifecycle. Compose on top of Qwen2.5-14B-Instruct.**
23
 
24
  This is the **adapter-only** release (~175 MB). Apply it on
25
  [`Qwen/Qwen2.5-14B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
26
  to specialize the base for end-to-end ServiceNow delivery work. For ready-to-run
27
  weights see the merged model
28
- [`MainStack/marvy-14B`](https://huggingface.co/MainStack/marvy-14B) or the
29
- quantized [`MainStack/marvy-14B-GGUF`](https://huggingface.co/MainStack/marvy-14B-GGUF).
30
 
31
  > Released under **Apache-2.0**. Built with Qwen — see `NOTICE`.
32
 
@@ -63,7 +63,7 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
63
  base = "Qwen/Qwen2.5-14B-Instruct"
64
  tok = AutoTokenizer.from_pretrained(base)
65
  model = AutoModelForCausalLM.from_pretrained(base, torch_dtype="auto", device_map="auto")
66
- model = PeftModel.from_pretrained(model, "MainStack/marvy-14B-lora")
67
  ```
68
 
69
  > Note: the adapter was trained with MLX-LM. The MLX `adapter_config.json` /
 
17
  - mlx
18
  ---
19
 
20
+ # marvy-1-14B-lora
21
 
22
+ **LoRA adapter for marvy-1-14B — the first open model for the full ServiceNow delivery lifecycle. Compose on top of Qwen2.5-14B-Instruct.**
23
 
24
  This is the **adapter-only** release (~175 MB). Apply it on
25
  [`Qwen/Qwen2.5-14B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
26
  to specialize the base for end-to-end ServiceNow delivery work. For ready-to-run
27
  weights see the merged model
28
+ [`MainStack/marvy-1-14B`](https://huggingface.co/MainStack/marvy-1-14B) or the
29
+ quantized [`MainStack/marvy-1-14B-GGUF`](https://huggingface.co/MainStack/marvy-1-14B-GGUF).
30
 
31
  > Released under **Apache-2.0**. Built with Qwen — see `NOTICE`.
32
 
 
63
  base = "Qwen/Qwen2.5-14B-Instruct"
64
  tok = AutoTokenizer.from_pretrained(base)
65
  model = AutoModelForCausalLM.from_pretrained(base, torch_dtype="auto", device_map="auto")
66
+ model = PeftModel.from_pretrained(model, "MainStack/marvy-1-14B-lora")
67
  ```
68
 
69
  > Note: the adapter was trained with MLX-LM. The MLX `adapter_config.json` /