deepseek-ai/DeepSeek-V4-Pro Text Generation โข 862B โข Updated about 2 hours ago โข 631k โข โข 3.59k
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper โข 2604.12374 โข Published 22 days ago โข 36
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation โข 67B โข Updated 5 days ago โข 891k โข 295
view post Post 10527 1440GB of VRAM is incredibly satisfying ๐ See translation 17 replies ยท ๐ฅ 31 31 ๐ 10 10 โค๏ธ 4 4 ๐คฏ 2 2 + Reply