Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
iamshnoo
/
combined_without_metadata_3b_step8k
like
0
Text Generation
Transformers
Safetensors
llama
metadata-localization
global
3b
without-metadata
pretraining
intermediate-checkpoint
text-generation-inference
arxiv:
2601.15236
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
combined_without_metadata_3b_step8k
/
assets
87.2 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
iamshnoo
Update model card and embedded training curves
04cbcc5
verified
18 days ago
tokens_per_sec.png
32.6 kB
Update model card and embedded training curves
18 days ago
train_loss.png
24.9 kB
Update model card and embedded training curves
18 days ago
val_perplexity.png
29.8 kB
Update model card and embedded training curves
18 days ago