# Explainability Subcard | Field | Description | | ----------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | Intended Domain | Multimodal Content Safety | | Model Type | Safety Classifier | | Intended Users | AI/ML Engineers, LLM Developers, Safety Assurance Teams | | Output | Text | | Describe how the model works: | Type: Finetuned Transformer (Decoder-only) working as a classifier. Backbone: Google Gemma-3-4B-it Parameters: 4B (Billion) | | Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of: | Not Applicable | | Technical Limitations: | • The model only accepts a single text input along with an optional image. The model does not accept more than one image. | | Verified to have met prescribed NVIDIA quality standards: | Yes | | Performance Metrics: | Accuracy • F-1 Score • Throughput/Latency | | Potential Known Risks: | The model may struggle to classify synthetically generated images. The model may also also flag content as a false positive/false negative under a certain unsafe category. | | Terms of Use: | Use of the model is governed by the [OpenMDW License Agreement, version 1.1](https://raw.githubusercontent.com/OpenMDW/OpenMDW/refs/heads/main/1.1/LICENSE.OpenMDW-1.1) (OpenMDW-1.1), [Gemma Terms of Use](https://ai.google.dev/gemma/terms) and [Gemma Prohibited Use Policy](https://ai.google.dev/gemma/prohibited_use_policy). |