Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
sh0ck0r
/
Inception-LLaMa-70B-FP8-Dynamic
like
0
Text Generation
Transformers
Safetensors
llama
fp8
vllm
compressed-tensors
quantized
llmcompressor
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Inception-LLaMa-70B-FP8-Dynamic
/
recipe.yaml
sh0ck0r
Upload FP8 quantized version of TareksGraveyard/Inception-LLaMa-70B
04f7658
verified
4 months ago
raw
Copy download link
history
blame
contribute
delete
Safe
136 Bytes
default_stage:
default_modifiers:
QuantizationModifier:
targets:
[
Linear
]
ignore:
[
lm_head
]
scheme:
FP8_DYNAMIC