TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate Paper • 2504.19874 • Published Apr 28, 2025 • 30
Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning Paper • 2502.19655 • Published Feb 27, 2025 • 1