# Explainability Subcard


| Field                                                                                                 | Description                                                                                                                                                                                                                                                                                             |
| ----------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| Intended Domain                                                                                       | Multimodal Content Safety                                                                                                                                                                                                                                                                               |
| Model Type                                                                                            | Safety Classifier                                                                                                                                                                                                                                                                                       |
| Intended Users                                                                                        | AI/ML Engineers, LLM Developers, Safety Assurance Teams                                                                                                                                                                                                                                                 |
| Output                                                                                                | Text                                                                                                                                                                                                                                                                                                    |
| Describe how the model works:                                                                         | Type: Finetuned Transformer (Decoder-only) working as a classifier. Backbone: Google Gemma-3-4B-it Parameters: 4B (Billion)                                                                                                                                                                             |
| Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of: | Not Applicable                                                                                                                                                                                                                                                                                          |
| Technical Limitations:                                                                                | • The model only accepts a single text input along with an optional image. The model does not accept more than one image.                                                                                                                                                                                   |
| Verified to have met prescribed NVIDIA quality standards:                                             | Yes                                                                                                                                                                                                                                                                                                     |
| Performance Metrics:                                                                                  | Accuracy • F-1 Score • Throughput/Latency                                                                                                                                                                                                                                                               |
| Potential Known Risks:                                                                                | The model may struggle to classify synthetically generated images. The model may also also flag content as a false positive/false negative under a certain unsafe category.                                                                                                                             |
| Terms of Use:                                                                                         | Use of the model is governed by the [OpenMDW License Agreement, version 1.1](https://raw.githubusercontent.com/OpenMDW/OpenMDW/refs/heads/main/1.1/LICENSE.OpenMDW-1.1) (OpenMDW-1.1), [Gemma Terms of Use](https://ai.google.dev/gemma/terms) and [Gemma Prohibited Use Policy](https://ai.google.dev/gemma/prohibited_use_policy). |