Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
miike-ai
/
LeanLlama-8B
like
0
Safetensors
llama
custom_code
Model card
Files
Files and versions
xet
Community
main
LeanLlama-8B
16.1 GB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
miike-ai
Add 128K validation results and chunked prefill usage example
fa4671c
verified
about 2 months ago
.gitattributes
Safe
1.57 kB
Initial upload: LeanLlama-8B with KV cache compression
about 2 months ago
README.md
Safe
3.37 kB
Add 128K validation results and chunked prefill usage example
about 2 months ago
chat_template.jinja
Safe
4.61 kB
Initial upload: LeanLlama-8B with KV cache compression
about 2 months ago
compression_config.json
Safe
451 Bytes
Initial upload: LeanLlama-8B with KV cache compression
about 2 months ago
config.json
Safe
1.7 kB
Initial upload: LeanLlama-8B with KV cache compression
about 2 months ago
generation_config.json
Safe
183 Bytes
Initial upload: LeanLlama-8B with KV cache compression
about 2 months ago
model.safetensors
Safe
16.1 GB
xet
Initial upload: LeanLlama-8B with KV cache compression
about 2 months ago
modeling_lean_llama.py
Safe
11.9 kB
Fix compression to handle all new tokens (chunked prefill support)
about 2 months ago
tokenizer.json
Safe
17.2 MB
xet
Initial upload: LeanLlama-8B with KV cache compression
about 2 months ago
tokenizer_config.json
Safe
296 Bytes
Initial upload: LeanLlama-8B with KV cache compression
about 2 months ago