Llama-3.1-8B-ModelOpt-FP8 / NVFP4 /hf_quant_config.json

Commit History

Upload Llama-3.1-8B quantized with ModelOpt FP8
33d5dec
verified

genai2eliza commited on