Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
GODELEV
/
Ant-10M
like
4
Text Generation
Safetensors
GODELEV/Archaea-5M-T
English
llama
causal-lm
gqa
rope
swiglu
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Copy to bucket
new
main
Ant-10M
/
training_meta.json
GODELEV
step 1264 | val_ppl=12.46
28299b6
verified
24 days ago
Raw
Download with hf CLI
Copy download link
History
Blame
Contribute
Delete
Safe
179 Bytes
{
"step"
:
1264
,
"val_loss"
:
2.5222082792282103
,
"val_ppl"
:
12.45607280175252
,
"params_M"
:
9.902
,
"pushed_at"
:
"2026-06-11T16:46:41.317193"
,
"tokens_seen"
:
2979215382
}