view post Post 4337 gpt-oss was possible thanks to new engineering efforts in 🤗 transformers. We just dropped a blog covering them:- Kernels from the Hub- MXFP4 Quantization- Tensor & Expert Parallelism- Dynamic Sliding Window & Cache- Continuous Batching & Paged AttentionGrab a coffee & dive in! ☕️https://huggingface.co/blog/faster-transformers See translation 🔥 12 12 🧠 2 2 👍 2 2 + Reply