NexaAI
/

DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant

Model card Files Files and versions

yli-nexa4ai commited on Feb 12, 2025

Commit

b5a8c4e

·

verified ·

1 Parent(s): 7db3329

Update README.md

Files changed (1) hide show

README.md +2 -4

README.md CHANGED Viewed

@@ -48,8 +48,6 @@ NexaQuant on Reasoning Benchmarks Compared to BF16 and LMStudio's Q4_K_M
   <img src="https://cdn-uploads.huggingface.co/production/uploads/66abfd6f65beb23afa427d8a/Cyh1zVvDHNBT598IkLHkd.png" width="80%" alt="Example" />
 </div>
-The general capacity has also greatly improved:
 **General Capacity:**
 | Benchmark                  | Full 16-bit | llama.cpp (4-bit) | NexaQuant (4-bit)|
@@ -114,9 +112,9 @@ Get the latest version from the [official website](https://lmstudio.ai/).
 ## What's next
-1. Inference Nexa Quantized Deepseek-R1 distilled model on NPU.
-2. This model is designed for complex problem-solving, which is why it has a longer thinking process. We understand this can be an issue in some cases, and we're actively working on improvements.
 ### Follow us

   <img src="https://cdn-uploads.huggingface.co/production/uploads/66abfd6f65beb23afa427d8a/Cyh1zVvDHNBT598IkLHkd.png" width="80%" alt="Example" />
 </div>
 **General Capacity:**
 | Benchmark                  | Full 16-bit | llama.cpp (4-bit) | NexaQuant (4-bit)|
 ## What's next
+1. This model is built for complex problem-solving, which is why it sometimes takes a long thinking process even for simple questions. We recognize this and are working on improving it in the next update.
+2. Inference Nexa Quantized Deepseek-R1 distilled model on NPU.
 ### Follow us