sh0ck0r
/

Inception-LLaMa-70B-FP8-Dynamic

Text Generation

compressed-tensors

text-generation-inference

Model card Files Files and versions

Inception-LLaMa-70B-FP8-Dynamic / recipe.yaml

sh0ck0r's picture

Upload FP8 quantized version of TareksGraveyard/Inception-LLaMa-70B

04f7658 verified 4 months ago

history blame contribute delete

136 Bytes

	default_stage:
	default_modifiers:
	QuantizationModifier:
	targets: [Linear]
	ignore: [lm_head]
	scheme: FP8_DYNAMIC