Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
LisaMegaWatts
/
JuliaFluxGPT-distilled
like
0
Text Generation
PyTorch
English
llama-style
rope
swiglu
gqa
rmsnorm
bpe
philosophy
openai-compatible
symbiogenesis
distillation
cross-species
Eval Results (legacy)
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
JuliaFluxGPT-distilled
Ctrl+K
Ctrl+K
1 contributor
History:
6 commits
LisaMegaWatts
Add model card
e67b473
verified
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
README.md
3.76 kB
Add model card
3 months ago
juliaflux_distilled_warm_best.pt
91.3 MB
xet
Add distilled checkpoint (warm start, val_loss=3.687)
3 months ago
juliaflux_model.py
10.1 kB
Add model definition
3 months ago
merges.txt
Safe
13.8 kB
Add tokenizer merges
3 months ago
vocab.json
Safe
33.6 kB
Add tokenizer vocab
3 months ago