Add pipeline tag, library name and code link

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +24 -2
README.md CHANGED
@@ -1,8 +1,30 @@
1
  ---
2
- license: apache-2.0
3
  base_model:
4
  - mistralai/Mistral-Nemo-Instruct-2407
 
5
  tags:
6
  - nvfp4
 
 
7
  ---
8
- Quantized NVFP4 version of [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) with the `down_proj` tensors calibrated in `FP8_DYNAMIC` and the rest using [Four Over Six](https://arxiv.org/abs/2512.02010) adaptive block scaling, created to compare against my [hybrid quant]([https://huggingface.co/DataSnake/Mistral-Nemo-Instruct-2407-NVFP4-FP8). Made with the same version of llm-compressor and compressed-tensors, using the same calibration data, to isolate the variables as much as possible.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
2
  base_model:
3
  - mistralai/Mistral-Nemo-Instruct-2407
4
+ license: apache-2.0
5
  tags:
6
  - nvfp4
7
+ pipeline_tag: text-generation
8
+ library_name: transformers
9
  ---
10
+
11
+ Quantized NVFP4 version of [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) with the `down_proj` tensors calibrated in `FP8_DYNAMIC` and the rest using [Four Over Six](https://arxiv.org/abs/2512.02010) adaptive block scaling, created to compare against my [hybrid quant](https://huggingface.co/DataSnake/Mistral-Nemo-Instruct-2407-NVFP4-FP8). Made with the same version of llm-compressor and compressed-tensors, using the same calibration data, to isolate the variables as much as possible.
12
+
13
+ ## Resources
14
+
15
+ - **Paper:** [Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling](https://arxiv.org/abs/2512.02010)
16
+ - **GitHub Repository:** [mit-han-lab/fouroversix](https://github.com/mit-han-lab/fouroversix)
17
+
18
+ ## Citation
19
+
20
+ ```bibtex
21
+ @misc{cook2025sixaccuratenvfp4quantization,
22
+ title={Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling},
23
+ author={Jack Cook and Junxian Guo and Guangxuan Xiao and Yujun Lin and Song Han},
24
+ year={2025},
25
+ eprint={2512.02010},
26
+ archivePrefix={arXiv},
27
+ primaryClass={cs.CL},
28
+ url={https://arxiv.org/abs/2512.02010},
29
+ }
30
+ ```