INCModel1 commited on
Commit
e7db23b
·
verified ·
1 Parent(s): 5dcf821

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - unsloth/DeepSeek-R1-BF16
5
+ ---
6
+ ## Model Details
7
+
8
+ This model card is for mxfp8 quantization of [unsloth/DeepSeek-R1-BF16](https://huggingface.co/unsloth/DeepSeek-R1-BF16) based on [intel/auto-round](https://github.com/intel/auto-round).
9
+ Please follow the license of the original model.
10
+
11
+ ## How to Use
12
+
13
+ The step-by-step README of quantization and evaluation can be found in [Intel Neural Compressor Examples](https://github.com/intel/neural-compressor/blob/master/examples/pytorch/nlp/huggingface_models/language-modeling/quantization/auto_round/deepseek/README.md).
14
+
15
+ ## Evaluate Results
16
+
17
+
18
+ | Task | backend | BF16 | MXFP8 |
19
+ |:-----------:|:-------:|:----------:|:----------:|
20
+ | hellaswag | vllm | 0.6903 | 0.6956 |
21
+ | piqa | vllm | 0.8319 | 0.8324 |
22
+ | mmlu | vllm | 0.8489 | 0.8532 |
23
+ | gsm8k | vllm | 0.9568 | 0.9583 |
24
+ | **average** | vllm | **0.8320** | **0.8349** |
25
+
26
+
27
+ ## Ethical Considerations and Limitations
28
+
29
+ The model can produce factually incorrect output, and should not be relied on to produce factually accurate information.
30
+ Because of the limitations of the pretrained model and the finetuning datasets, it is possible that this model could generate lewd, biased or otherwise offensive outputs.
31
+
32
+ Therefore, before deploying any applications of the model, developers should perform safety testing.
33
+
34
+ ## Caveats and Recommendations
35
+
36
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.
37
+
38
+ Here are a couple of useful links to learn more about Intel's AI software:
39
+
40
+ - [Intel Neural Compressor](https://github.com/intel/neural-compressor)
41
+ - [AutoRound](https://github.com/intel/auto-round)
42
+
43
+ ## Disclaimer
44
+
45
+ The license on this model does not constitute legal advice.
46
+ We are not responsible for the actions of third parties who use this model.
47
+ Please consult an attorney before using this model for commercial purposes.