4 23 35

Soumye Singhal

soumye

AI & ML interests

LLM Post-training

Recent Activity

liked a model 21 days ago

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

liked a model 21 days ago

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

liked a model 4 months ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16

View all activity

Organizations

liked 2 models 21 days ago

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Text Generation • 335B • Updated about 19 hours ago • 383k • • 214

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Text Generation • 561B • Updated 15 days ago • 129k • • 246

liked 5 models 4 months ago

liked a model 5 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • 8B • Updated Mar 2 • 362k • 2.57k

authored 3 papers 6 months ago

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published Nov 6, 2025 • 32

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 44

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 43

upvoted 2 papers 6 months ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 43

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 44

upvoted an article 6 months ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 112

liked 2 models 6 months ago

nvidia/Qwen3-Nemotron-235B-A22B-GenRM

Text Generation • 235B • Updated Dec 15, 2025 • 86 • 31

unsloth/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated Dec 31, 2025 • 20.3k • 317

upvoted 2 collections 6 months ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 14 days ago • 167

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 14 days ago • 170

liked 2 models 6 months ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated Mar 15 • 340k • • 351

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated Mar 15 • 1.13M • • 774

Soumye Singhal

AI & ML interests

Recent Activity

Organizations

soumye's activity

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models