Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LisaMegaWatts
/
JuliaFluxGPT-distilled
like
0
Text Generation
PyTorch
English
llama-style
rope
swiglu
gqa
rmsnorm
bpe
philosophy
openai-compatible
symbiogenesis
distillation
cross-species
Eval Results (legacy)
License:
mit
Model card
Files
Files and versions
xet
Community
main
JuliaFluxGPT-distilled
91.3 MB
1 contributor
History:
6 commits
LisaMegaWatts
Add model card
e67b473
verified
1 day ago
.gitattributes
Safe
1.52 kB
initial commit
1 day ago
README.md
3.76 kB
Add model card
1 day ago
juliaflux_distilled_warm_best.pt
91.3 MB
xet
Add distilled checkpoint (warm start, val_loss=3.687)
1 day ago
juliaflux_model.py
10.1 kB
Add model definition
1 day ago
merges.txt
Safe
13.8 kB
Add tokenizer merges
1 day ago
vocab.json
Safe
33.6 kB
Add tokenizer vocab
1 day ago