Llama-3.1-8B-ModelOpt-FP8-QAT / hf_quant_config.json

Commit History

Upload Llama-3.1-8B quantized with ModelOpt FP8-QAT
6c85a47
verified

genai2eliza commited on