sh0ck0r's picture
Upload FP8 quantized version of TareksGraveyard/Inception-LLaMa-70B
04f7658 verified
raw
history blame contribute delete
136 Bytes
default_stage:
default_modifiers:
QuantizationModifier:
targets: [Linear]
ignore: [lm_head]
scheme: FP8_DYNAMIC