Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Norapom
/
experimental_gqa_1_5b
like
0
English
megatron-lm
pretrained
gqa
megatron
experimental
License:
other
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
experimental_gqa_1_5b
Ctrl+K
Ctrl+K
2 contributors
History:
8 commits
AaronWang04
.
9892cde
3 days ago
assets
Upload cl100k_base tokenizer
12 days ago
iter_0016000
Upload iter_16000 checkpoint shards
12 days ago
tokenizer
Add Cl100kChatTokenizer (chat/think/tool reserved tokens)
12 days ago
.gitattributes
Safe
3.58 kB
Upload iter_16000 checkpoint shards
12 days ago
README.md
Safe
1.27 kB
.
3 days ago
latest_checkpointed_iteration.txt
Safe
5 Bytes
Set latest iter to 16000
12 days ago