Commit History

Cache Monarch matrices + causal mask for faster inference
76b7110
Running
verified

LisaMegaWatts commited on

Fix completion_tokens: count tokens not decoded characters
f0aedd4
verified

LisaMegaWatts commited on

Upload README.md with huggingface_hub
03128a8
verified

LisaMegaWatts commited on

Upload Project.toml with huggingface_hub
7e8c519
verified

LisaMegaWatts commited on

Upload Dockerfile with huggingface_hub
67a64ea
verified

LisaMegaWatts commited on

Upload server.jl with huggingface_hub
91c86b7
verified

LisaMegaWatts commited on

Upload checkpoint.jl with huggingface_hub
3724bdb
verified

LisaMegaWatts commited on

Upload model.jl with huggingface_hub
0575c49
verified

LisaMegaWatts commited on