Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Duplicated from
djuna/G2-GSHT
djuna
/
G2-GSHT-32K
like
1
Text Generation
Transformers
Safetensors
gemma2
mergekit
Merge
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
G2-GSHT-32K
20.3 GB
1 contributor
History:
2 commits
djuna
Increase context window using rope base freq
6f3f9ea
verified
over 1 year ago
.gitattributes
1.57 kB
Duplicate from djuna/G2-GSHT
over 1 year ago
README.md
1.09 kB
Duplicate from djuna/G2-GSHT
over 1 year ago
config.json
905 Bytes
Increase context window using rope base freq
over 1 year ago
mergekit_config.yml
372 Bytes
Duplicate from djuna/G2-GSHT
over 1 year ago
model-00001-of-00005.safetensors
4.96 GB
xet
Duplicate from djuna/G2-GSHT
over 1 year ago
model-00002-of-00005.safetensors
4.98 GB
xet
Duplicate from djuna/G2-GSHT
over 1 year ago
model-00003-of-00005.safetensors
4.93 GB
xet
Duplicate from djuna/G2-GSHT
over 1 year ago
model-00004-of-00005.safetensors
4.98 GB
xet
Duplicate from djuna/G2-GSHT
over 1 year ago
model-00005-of-00005.safetensors
470 MB
xet
Duplicate from djuna/G2-GSHT
over 1 year ago
model.safetensors.index.json
37.3 kB
Duplicate from djuna/G2-GSHT
over 1 year ago
special_tokens_map.json
636 Bytes
Duplicate from djuna/G2-GSHT
over 1 year ago
tokenizer.json
17.5 MB
xet
Duplicate from djuna/G2-GSHT
over 1 year ago
tokenizer.model
4.24 MB
xet
Duplicate from djuna/G2-GSHT
over 1 year ago
tokenizer_config.json
41 kB
Duplicate from djuna/G2-GSHT
over 1 year ago