Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
/
Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor
like
0
Follow
NM Testing
92
Transformers
kv-cache
fp8
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor
Commit History
Upload README.md with huggingface_hub
76b4488
verified
krishnateja95
commited on
Nov 25
Upload README.md with huggingface_hub
3b8571a
verified
krishnateja95
commited on
Nov 25
Upload README.md with huggingface_hub
7679af8
verified
krishnateja95
commited on
Nov 25
initial commit
b3203db
verified
krishnateja95
commited on
Nov 25