view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 15 days ago • 854
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 2 days ago • 268
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization Paper • 2602.03537 • Published Feb 3 • 4
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 173