NexaAI
/

DeepSeek-R1-Distill-Llama-8B-NexaQuant

Model card Files Files and versions

Davidqian123 commited on Feb 6, 2025

Commit

033f9f3

·

verified ·

1 Parent(s): 60185e8

Create README.md

Files changed (1) hide show

README.md +62 -0

README.md ADDED Viewed

	@@ -0,0 +1,62 @@

+---
+base_model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
+library_name: transformers
+license: llama3.1
+tags:
+- deepseek
+- transformers
+- llama
+- llama-3
+- meta
+---
+# DeepSeek-R1-Distill-Llama-8B-NexaQuant
+## Introduction
+**DeepSeek-R1-Distill-Llama-8B-NexaQuant** is a ... (TODO)
+---
+## How to Use on Your Device
+Below, we outline multiple ways to run the model locally.
+#### Option 1: Using Nexa SDK
+**Step 1: Install Nexa SDK**
+Follow the installation instructions in Nexa SDK's [GitHub repository](https://github.com/NexaAI/nexa-sdk).
+**Step 2: Run the model with Nexa**
+Execute the following command in your terminal:
+```bash
+nexa run DeepSeek-R1-Distill-Llama-8B-NexaQuant:q4_0
+```
+#### Option 2: Using llama.cpp
+**Step 1: Build llama.cpp on Your Device**
+Follow the "Building the project" instructions in the llama.cpp [repository](https://github.com/ggerganov/llama.cpp) to build the project.
+**Step 2: Run the Model with llama.cpp**
+Once built, run `llama-cli` under `<build_dir>/bin/`:
+```bash
+./llama-cli \
+    --model your/local/path/to/DeepSeek-R1-Distill-Llama-8B-NexaQuant \
+    --prompt 'Provide step-by-step reasoning enclosed in <think> </think> tags, followed by the final answer enclosed in \boxed{} tags.' \
+```
+#### Option 3: Using LM Studio
+**Step 1: Download and Install LM Studio**
+Get the latest version from the [official website](https://lmstudio.ai/).
+**Step 2: Load and Run the Model**
+2. In LM Studio's top panel, search for and select `NexaAIDev/DeepSeek-R1-Distill-Llama-8B-NexaQuant`.
+3. Click `Download` (if not already downloaded) and wait for the model to load.
+4. Once loaded, go to the chat window and start a conversation.
+---