Llama-3.1-8B-ModelOpt-NVFP4 / hf_quant_config.json

Commit History

Upload Llama-3.1-8B quantized with ModelOpt NVFP4
813cbb5
verified

genai2eliza commited on