Instructions to use MainStack/marvy-1-14B-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use MainStack/marvy-1-14B-lora with PEFT:
Task type is invalid.
- MLX
How to use MainStack/marvy-1-14B-lora with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # if on a CUDA device, also pip install mlx[cuda] # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("MainStack/marvy-1-14B-lora") prompt = "Once upon a time in" text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- MLX LM
How to use MainStack/marvy-1-14B-lora with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Generate some text mlx_lm.generate --model "MainStack/marvy-1-14B-lora" --prompt "Once upon a time"
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -17,16 +17,16 @@ tags:
|
|
| 17 |
- mlx
|
| 18 |
---
|
| 19 |
|
| 20 |
-
# marvy-14B-lora
|
| 21 |
|
| 22 |
-
**LoRA adapter for marvy-14B — the first open model for the full ServiceNow delivery lifecycle. Compose on top of Qwen2.5-14B-Instruct.**
|
| 23 |
|
| 24 |
This is the **adapter-only** release (~175 MB). Apply it on
|
| 25 |
[`Qwen/Qwen2.5-14B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
|
| 26 |
to specialize the base for end-to-end ServiceNow delivery work. For ready-to-run
|
| 27 |
weights see the merged model
|
| 28 |
-
[`MainStack/marvy-14B`](https://huggingface.co/MainStack/marvy-14B) or the
|
| 29 |
-
quantized [`MainStack/marvy-14B-GGUF`](https://huggingface.co/MainStack/marvy-14B-GGUF).
|
| 30 |
|
| 31 |
> Released under **Apache-2.0**. Built with Qwen — see `NOTICE`.
|
| 32 |
|
|
@@ -63,7 +63,7 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
| 63 |
base = "Qwen/Qwen2.5-14B-Instruct"
|
| 64 |
tok = AutoTokenizer.from_pretrained(base)
|
| 65 |
model = AutoModelForCausalLM.from_pretrained(base, torch_dtype="auto", device_map="auto")
|
| 66 |
-
model = PeftModel.from_pretrained(model, "MainStack/marvy-14B-lora")
|
| 67 |
```
|
| 68 |
|
| 69 |
> Note: the adapter was trained with MLX-LM. The MLX `adapter_config.json` /
|
|
|
|
| 17 |
- mlx
|
| 18 |
---
|
| 19 |
|
| 20 |
+
# marvy-1-14B-lora
|
| 21 |
|
| 22 |
+
**LoRA adapter for marvy-1-14B — the first open model for the full ServiceNow delivery lifecycle. Compose on top of Qwen2.5-14B-Instruct.**
|
| 23 |
|
| 24 |
This is the **adapter-only** release (~175 MB). Apply it on
|
| 25 |
[`Qwen/Qwen2.5-14B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
|
| 26 |
to specialize the base for end-to-end ServiceNow delivery work. For ready-to-run
|
| 27 |
weights see the merged model
|
| 28 |
+
[`MainStack/marvy-1-14B`](https://huggingface.co/MainStack/marvy-1-14B) or the
|
| 29 |
+
quantized [`MainStack/marvy-1-14B-GGUF`](https://huggingface.co/MainStack/marvy-1-14B-GGUF).
|
| 30 |
|
| 31 |
> Released under **Apache-2.0**. Built with Qwen — see `NOTICE`.
|
| 32 |
|
|
|
|
| 63 |
base = "Qwen/Qwen2.5-14B-Instruct"
|
| 64 |
tok = AutoTokenizer.from_pretrained(base)
|
| 65 |
model = AutoModelForCausalLM.from_pretrained(base, torch_dtype="auto", device_map="auto")
|
| 66 |
+
model = PeftModel.from_pretrained(model, "MainStack/marvy-1-14B-lora")
|
| 67 |
```
|
| 68 |
|
| 69 |
> Note: the adapter was trained with MLX-LM. The MLX `adapter_config.json` /
|