Efficient Drop-In Replacement for the Classification Head in Language Model Inference. https://github.com/embedl/flash-head