Commit History

Cache Monarch matrix + causal mask for faster inference
676f15f
Running
verified

LisaMegaWatts commited on

Fix completion_tokens: count tokens not decoded characters
d167fb8
verified

LisaMegaWatts commited on

Upload server.jl with huggingface_hub
62c96ea
verified

LisaMegaWatts commited on

Upload checkpoint.jl with huggingface_hub
c126311
verified

LisaMegaWatts commited on

Upload model.jl with huggingface_hub
98a6731
verified

LisaMegaWatts commited on

Upload Project.toml with huggingface_hub
7dcd958
verified

LisaMegaWatts commited on

Upload Dockerfile with huggingface_hub
8e47ba1
verified

LisaMegaWatts commited on

Upload README.md with huggingface_hub
0854dbe
verified

LisaMegaWatts commited on