👋 Open to Work

Dung Vo PRO

tuandunghcmut

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 16 hours ago

tuandunghcmut/nvidia_instruction_following_if_split_v3_non_thinking

published a dataset about 16 hours ago

tuandunghcmut/nvidia_instruction_following_if_split_v3_non_thinking

updated a dataset about 16 hours ago

tuandunghcmut/nvidia_instruction_following_if_split_v3

View all activity

Organizations

upvoted 2 collections about 1 month ago

Moonlight-A3B

Moonshot's Compute-efficient MoE LLM, first Scaling Up of Muon Optimizer • 3 items • Updated Jan 27 • 14

Nemotron Chat & Instruction Following

Datasets for building helpful, multi-turn, instruction-following conversational models across single and multi-turn settings. • 19 items • Updated 24 days ago • 5

upvoted a collection 3 months ago

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 24 days ago • 50

upvoted a paper 3 months ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 120

upvoted 2 collections 4 months ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 23 items • Updated 24 days ago • 333

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 24 days ago • 169

upvoted a paper 4 months ago

MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks

Paper • 2601.14652 • Published Jan 21 • 4

upvoted a collection 4 months ago

Layout Generation Dataset

2 items • Updated Jul 27, 2024 • 1

upvoted 2 papers 4 months ago

ToolComp: A Multi-Tool Reasoning & Process Supervision Benchmark

Paper • 2501.01290 • Published Jan 2, 2025 • 1

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 249

upvoted a collection 4 months ago

Nemotron-Terminal

We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 24 days ago • 35

upvoted 3 papers 4 months ago

Endless Terminals: Scaling RL Environments for Terminal Agents

Paper • 2601.16443 • Published Jan 23 • 19

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 211

Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

Paper • 2602.18292 • Published Feb 20 • 13

upvoted an article 5 months ago

Article

SmolLM-Smashed: Tiny Giants, Optimized for Speed

PrunaAI

•

Jan 13

• 15

upvoted a paper 5 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 180

upvoted an article 5 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

LinkedIn

•

Jan 27

• 80

upvoted a collection 5 months ago

Enterprise Agents and Benchmarks

Enterprise agent ecosystem featuring AssetOpsBench (industrial) and ITBench (SRE, FinOps, CISO), CUGA to accelerate AI Automation • 21 items • Updated 12 days ago • 18

upvoted 2 papers 5 months ago

Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published Jan 20 • 57

Deep Research: A Systematic Survey

Paper • 2512.02038 • Published Nov 24, 2025 • 73