Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LisaMegaWatts
/
JuliaFluxGPT

Text Generation
English
flux
julia
flux-jl
llama-style
gqa
grouped-query-attention
rope
rmsnorm
swiglu
bpe
philosophy
Model card Files Files and versions
xet
Community
JuliaFluxGPT
1.1 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 256 commits
LisaMegaWatts's picture
LisaMegaWatts
Fix tokenizer: trim to 2000 vocab to match trained model
db0e784 verified 2 months ago
  • .gitattributes
    1.8 kB
    Upload checkpoint_interrupted.jld2 (108.3 MB) 2 months ago
  • README.md
    6.66 kB
    Update model card with architecture details, training config, and usage instructions 2 months ago
  • best_model.jld2
    274 MB
    xet
    Upload best_model.jld2 (261.1 MB) 2 months ago
  • checkpoint_interrupted.jld2
    274 MB
    xet
    Upload checkpoint_interrupted.jld2 (261.1 MB) 2 months ago
  • checkpoint_latest.jld2
    274 MB
    xet
    Upload checkpoint_latest.jld2 (261.2 MB) 2 months ago
  • final_model.jld2
    274 MB
    xet
    Upload final_model.jld2 (261.1 MB) 2 months ago
  • tokenizer.json
    59.5 kB
    Fix tokenizer: trim to 2000 vocab to match trained model 2 months ago