Commit History

Add PyTorch weights (.pt) converted from JLD2 checkpoint
a7ea2a7
verified

LisaMegaWatts commited on

Fix model card: match actual HF checkpoint (d=512, 8L, 8Q/2KV, ~23M params, ctx=256, FFN=1344)
afa692e
verified

LisaMegaWatts commited on

Fix model card: actual trained model is d=256, 4 layers, 4Q/2KV, ~4M params (was incorrectly listed as 10M)
287076b
verified

LisaMegaWatts commited on

Fix model card: context_length=256 (not 512), dropout=0.1 (not 0.0) per checkpoint
5907abe
verified

LisaMegaWatts commited on

Add model card with architecture details, provenance, and training metrics
9c956d0
verified

LisaMegaWatts commited on

Delete julia-slm/5m-chinchilla/config.toml with huggingface_hub
91a8ddf
verified

LisaMegaWatts commited on

Delete julia-slm/5m-chinchilla/step_12000.jld2 with huggingface_hub
36bde8c
verified

LisaMegaWatts commited on

Delete julia-slm/5m-chinchilla/final.jld2 with huggingface_hub
ac4afd0
verified

LisaMegaWatts commited on

Upload julia-slm/5m-chinchilla/step_12000.jld2 with huggingface_hub
c0014cd
verified

LisaMegaWatts commited on

Upload julia-slm/5m-chinchilla/final.jld2 with huggingface_hub
5667a2a
verified

LisaMegaWatts commited on

Upload julia-slm/5m-chinchilla/config.toml with huggingface_hub
01972e5
verified

LisaMegaWatts commited on

Fix tokenizer: trim to 2000 vocab to match trained model
db0e784
verified

LisaMegaWatts commited on

Upload checkpoint_interrupted.jld2 (261.1 MB)
b61a07c
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
3b4a52a
verified

LisaMegaWatts commited on

Upload checkpoint_interrupted.jld2 (261.1 MB)
319a7a5
verified

LisaMegaWatts commited on

Upload final_model.jld2 (261.1 MB)
2440491
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
20cb6be
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.2 MB)
e825dce
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.2 MB)
b8fddc2
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.2 MB)
1e78279
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.2 MB)
51d7933
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.2 MB)
12aa2f0
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.2 MB)
2e78061
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.2 MB)
1667120
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.2 MB)
8c2cfb2
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
cc7b028
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
d755d74
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
7a9da3e
verified

LisaMegaWatts commited on

Upload final_model.jld2 (261.1 MB)
5805138
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
b742ef3
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
37a2041
verified

LisaMegaWatts commited on

Upload final_model.jld2 (261.1 MB)
f8a550d
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
50d8cf1
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
78c0464
verified

LisaMegaWatts commited on

Upload final_model.jld2 (261.1 MB)
4b86ceb
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
fcc671a
verified

LisaMegaWatts commited on

Upload final_model.jld2 (261.1 MB)
d4e408c
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
107ab06
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
b931fa4
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
3ca123f
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
1ec1fff
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
1397976
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
586f0bb
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
b994abe
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
cd6666c
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
d31166a
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
bf24214
verified

LisaMegaWatts commited on

Upload best_model.jld2 (261.1 MB)
0e8955c
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
b1f2137
verified

LisaMegaWatts commited on

Upload checkpoint_latest.jld2 (261.1 MB)
a615aaa
verified

LisaMegaWatts commited on