AIMS: Intent-Aware Safety Classification Collection Human-annotated intent dataset and intent-aware safety classifiers (SFT, DPO, distillation, GRPO) for robust LLM guardrails. • 5 items • Updated about 13 hours ago • 3
Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models Paper • 2505.20087 • Published May 26, 2025 • 3
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models nvidia • Dec 15, 2025 • 112
view article Article Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications nvidia • Dec 2, 2025 • 26
view article Article Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications nvidia • Dec 2, 2025 • 26
NemoGuard Collection Essential datasets and models for content safety, topic-following, and security guardrails • 13 items • Updated 12 days ago • 23
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Paper • 2405.01481 • Published May 2, 2024 • 30