nex-agi/agent-sft
Preview • Updated • 1.37k • 116
First attempt at quantization.
A lot of existing quantizations used the default dataset of ultrachat or CNN/DailyMail but these are generic datasets.
I thought of doing 512 samples at 4096 seq length but with the actual nex-n2 dataset which the original model was trained on which is the nex-agi/agent-sft
See more at https://github.com/Apro123/quantize-nex-efforts
All respective credits go to the Nex-AGI team, Nvidia for ModelOpt, VLLM for the llmcompressor tools.
Refer to the original model card for details on the underlying model
Base model
nex-agi/Nex-N2-mini