Explainability Subcard

Field	Description
Intended Domain	Multimodal Content Safety
Model Type	Safety Classifier
Intended Users	AI/ML Engineers, LLM Developers, Safety Assurance Teams
Output	Text
Describe how the model works:	Type: Finetuned Transformer (Decoder-only) working as a classifier. Backbone: Google Gemma-3-4B-it Parameters: 4B (Billion)
Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of:	Not Applicable
Technical Limitations:	• The model only accepts a single text input along with an optional image. The model does not accept more than one image.
Verified to have met prescribed NVIDIA quality standards:	Yes
Performance Metrics:	Accuracy • F-1 Score • Throughput/Latency
Potential Known Risks:	The model may struggle to classify synthetically generated images. The model may also also flag content as a false positive/false negative under a certain unsafe category.
Terms of Use:	Use of the model is governed by the OpenMDW License Agreement, version 1.1 (OpenMDW-1.1), Gemma Terms of Use and Gemma Prohibited Use Policy.