Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LisaMegaWatts
/
JuliaGPTDistill

Text Generation
English
flux
julia
flux-jl
distillation
knowledge-distillation
llama-style
gqa
rope
rmsnorm
swiglu
bpe
philosophy
Eval Results (legacy)
Model card Files Files and versions
xet
Community
JuliaGPTDistill
125 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 26 commits
LisaMegaWatts's picture
LisaMegaWatts
Upload final_model.jld2 (39.7 MB)
7a4e879 verified about 2 months ago
  • .gitattributes
    1.68 kB
    Upload final_model.jld2 (39.7 MB) about 2 months ago
  • README.md
    24 Bytes
    initial commit about 2 months ago
  • best_model.jld2
    41.7 MB
    xet
    Upload best_model.jld2 (39.7 MB) about 2 months ago
  • checkpoint_latest.jld2
    41.7 MB
    xet
    Upload checkpoint_latest.jld2 (39.7 MB) about 2 months ago
  • final_model.jld2
    41.7 MB
    xet
    Upload final_model.jld2 (39.7 MB) about 2 months ago