Cache Monarch matrix + causal mask for faster inference 676f15f Running verified LisaMegaWatts commited on 30 days ago
Fix completion_tokens: count tokens not decoded characters d167fb8 verified LisaMegaWatts commited on Feb 26