Briko commited on
Commit
ecee4da
·
verified ·
1 Parent(s): 34c0a29

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -0
README.md CHANGED
@@ -1,3 +1,35 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - en
5
+ - zh
6
+ tags:
7
+ - GGUF
8
+ - llama.cpp
9
+ - apex
10
+ - quantized
11
+ - Mixture of Experts
12
+ base_model:
13
+ - AIDC-AI/Marco-Nano-Instruct
14
+ - mradermacher/Marco-Nano-Instruct-GGUF
15
+ pipeline_tag: text-generation
16
  ---
17
+
18
+ # Marco-Nano-Instruct-APEX APEX Quantized (GGUF)
19
+
20
+ This repository contains APEX-quantized GGUF files for [AIDC-AI's Marco-Nano-Instruct](https://huggingface.co/AIDC-AI/Marco-Nano-Instruct).
21
+
22
+ The quantization was performed using the [mudler/apex-quant](https://github.com/mudler/apex-quant) project, focusing on maximizing quality-to-size ratio using importance matrix (imatrix) guided quantization.
23
+
24
+ ## 📥 Source & Credits
25
+
26
+ - **Base Model**: [AIDC-AI's Marco-Nano-Instruct](https://huggingface.co/AIDC-AI/Marco-Nano-Instruct).
27
+ - **F16 GGUF & Imatrix**: The F16 source model and the importance matrix file used for quantization were sourced from [mradermacher's GGUF repository](https://huggingface.co/mradermacher/Marco-Nano-Instruct-i1-GGUF).
28
+
29
+ > **Special thanks to [@mradermacher](https://huggingface.co/mradermacher) for providing the high-quality imatrix file!**
30
+
31
+ ## ⚠️ For technical validation only
32
+
33
+ - Severe accuracy loss due to quantization; outputs may contain hallucinations, gibberish, or fail basic tasks.
34
+ - Suitable **only** for researching quantization noise, debugging conversion scripts, or comparing compression artifacts.
35
+ - No post-training calibration, fine-tuning, or recovery techniques were applied.