Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
/
Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Head
like
0
Follow
NM Testing
94
Transformers
kv-cache
fp8
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Head
Commit History
Upload README.md with huggingface_hub
8f9af52
verified
krishnateja95
commited on
Nov 25, 2025
Upload README.md with huggingface_hub
452e7b4
verified
krishnateja95
commited on
Nov 25, 2025
Upload README.md with huggingface_hub
db7ba98
verified
krishnateja95
commited on
Nov 25, 2025
initial commit
9e30e9b
verified
krishnateja95
commited on
Nov 25, 2025