Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ManasMittal2005
/
moe_transformer_topk_gqa_kv2
like
0
Safetensors
moe_transformer_scratch
Model card
Files
Files and versions
xet
Community
main
moe_transformer_topk_gqa_kv2
2.54 GB
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
ManasMittal2005
Add moe_transformer_topk_gqa_kv2 model checkpoint (Epoch 6)
159a646
verified
6 months ago
checkpoint-epoch-10
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 10)
6 months ago
checkpoint-epoch-20
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 10)
6 months ago
checkpoint-epoch-5
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 10)
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
config.json
Safe
451 Bytes
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 10)
6 months ago
evaluation_metrics.json
Safe
148 Bytes
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 10)
6 months ago
merges.txt
Safe
456 kB
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 1)
6 months ago
model.safetensors
796 MB
xet
Add moe_transformer_topk_gqa_kv2 model checkpoint (Epoch 6)
6 months ago
results.json
Safe
1.96 MB
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 10)
6 months ago
special_tokens_map.json
Safe
279 Bytes
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 1)
6 months ago
tokenizer.json
Safe
3.56 MB
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 1)
6 months ago
tokenizer_config.json
1.24 kB
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 1)
6 months ago
vocab.json
Safe
798 kB
Add moe_transformer_topk_gqa_kv2 model and results (Epochs: 1)
6 months ago