JuliaGPTDistill / README.md

Commit History

Add proper model card: 256d/4L/4H/2KV, vocab=2000, distilled from JuliaFluxGPT
b2f06c9
verified

LisaMegaWatts commited on