view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8, 2025 • 31
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 279
cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit Text Generation • Updated about 4 hours ago • 184k • 22
MathArena Outputs Collection Outputs of models on the MathArena Benchmark. • 16 items • Updated Dec 8, 2025 • 1
view article Article Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation Sep 16, 2025 • 17
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 179