nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated 29 minutes ago • 56k • 106
AfriNLLB: Efficient Translation Models for African Languages Paper • 2602.09373 • Published Feb 10 • 1
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published 18 days ago • 4
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published 18 days ago • 4
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published 18 days ago • 4
Quantized VibeThinker-1.5B Collection Verified models. Tested with vLLM. • 5 items • Updated Dec 6, 2025 • 1
AfriNLP/AfriNLLB-12enc-8dec-iterative-548m-ft Translation • 0.5B • Updated 8 days ago • 897 • 1
AfriNLLB: Efficient Translation Models for African Languages Paper • 2602.09373 • Published Feb 10 • 1 • 1