Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Viharikvs
/
CMBA-768M-FineWeb
like
0
Text Generation
PyTorch
HuggingFaceFW/fineweb-edu
English
causal-lm
mamba
hrm
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
CMBA-768M-FineWeb
Commit History
Model card updated after epoch 2
b329ee8
verified
Viharikvs
commited on
Oct 5
End of Epoch 2: Val Loss 8.1216, Perplexity 3366.37
a5c4c37
verified
Viharikvs
commited on
Oct 5
Checkpoint at step 1500 (Epoch 2)
34dc57a
verified
Viharikvs
commited on
Oct 5
Model card updated after epoch 1
abd1f87
verified
Viharikvs
commited on
Oct 5
End of Epoch 1: Val Loss 8.0398, Perplexity 3102.13
91d2777
verified
Viharikvs
commited on
Oct 5
Checkpoint at step 1000 (Epoch 1)
028385d
verified
Viharikvs
commited on
Oct 5
Model card updated after epoch 0
31dbfd1
verified
Viharikvs
commited on
Oct 5
End of Epoch 0: Val Loss 8.8860, Perplexity 7230.03
cc22e49
verified
Viharikvs
commited on
Oct 5
Checkpoint at step 500 (Epoch 0)
a7c1969
verified
Viharikvs
commited on
Oct 5
initial commit
87f2cc7
verified
Viharikvs
commited on
Oct 5