Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
LisaMegaWatts
/
JuliaFluxGPT
like
0
Text Generation
LisaMegaWatts/philosophy-corpus
English
flux
julia
flux-jl
llama-style
gqa
grouped-query-attention
rope
rmsnorm
swiglu
bpe
philosophy
License:
mit
Model card
Files
Files and versions
xet
Community
db0e784
JuliaFluxGPT
1.1 GB
Ctrl+K
Ctrl+K
1 contributor
History:
256 commits
LisaMegaWatts
Fix tokenizer: trim to 2000 vocab to match trained model
db0e784
verified
2 months ago
.gitattributes
Safe
1.8 kB
Upload checkpoint_interrupted.jld2 (108.3 MB)
2 months ago
README.md
Safe
6.66 kB
Update model card with architecture details, training config, and usage instructions
2 months ago
best_model.jld2
Safe
274 MB
xet
Upload best_model.jld2 (261.1 MB)
2 months ago
checkpoint_interrupted.jld2
Safe
274 MB
xet
Upload checkpoint_interrupted.jld2 (261.1 MB)
2 months ago
checkpoint_latest.jld2
Safe
274 MB
xet
Upload checkpoint_latest.jld2 (261.2 MB)
2 months ago
final_model.jld2
Safe
274 MB
xet
Upload final_model.jld2 (261.1 MB)
2 months ago
tokenizer.json
Safe
59.5 kB
Fix tokenizer: trim to 2000 vocab to match trained model
2 months ago