Briko's picture
Update README.md
ecee4da verified
metadata
license: apache-2.0
language:
  - en
  - zh
tags:
  - GGUF
  - llama.cpp
  - apex
  - quantized
  - Mixture of Experts
base_model:
  - AIDC-AI/Marco-Nano-Instruct
  - mradermacher/Marco-Nano-Instruct-GGUF
pipeline_tag: text-generation

Marco-Nano-Instruct-APEX APEX Quantized (GGUF)

This repository contains APEX-quantized GGUF files for AIDC-AI's Marco-Nano-Instruct.

The quantization was performed using the mudler/apex-quant project, focusing on maximizing quality-to-size ratio using importance matrix (imatrix) guided quantization.

📥 Source & Credits

Special thanks to @mradermacher for providing the high-quality imatrix file!

⚠️ For technical validation only

  • Severe accuracy loss due to quantization; outputs may contain hallucinations, gibberish, or fail basic tasks.
  • Suitable only for researching quantization noise, debugging conversion scripts, or comparing compression artifacts.
  • No post-training calibration, fine-tuning, or recovery techniques were applied.