Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LisaMegaWatts
/
JuliaGPTDistill
like
0
Text Generation
LisaMegaWatts/philosophy-corpus
English
flux
julia
flux-jl
distillation
knowledge-distillation
llama-style
gqa
rope
rmsnorm
swiglu
bpe
philosophy
Eval Results (legacy)
License:
mit
Model card
Files
Files and versions
xet
Community
main
JuliaGPTDistill
/
final_model.jld2
Commit History
Upload final_model.jld2 (39.7 MB)
7a4e879
verified
LisaMegaWatts
commited on
6 days ago