| Intended Domain |
Multimodal Content Safety |
| Model Type |
Safety Classifier |
| Intended Users |
AI/ML Engineers, LLM Developers, Safety Assurance Teams |
| Output |
Text |
| Describe how the model works: |
Type: Finetuned Transformer (Decoder-only) working as a classifier. Backbone: Google Gemma-3-4B-it Parameters: 4B (Billion) |
| Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of: |
Not Applicable |
| Technical Limitations: |
• The model only accepts a single text input along with an optional image. The model does not accept more than one image. |
| Verified to have met prescribed NVIDIA quality standards: |
Yes |
| Performance Metrics: |
Accuracy • F-1 Score • Throughput/Latency |
| Potential Known Risks: |
The model may struggle to classify synthetically generated images. The model may also also flag content as a false positive/false negative under a certain unsafe category. |
| Terms of Use: |
Use of the model is governed by the OpenMDW License Agreement, version 1.1 (OpenMDW-1.1), Gemma Terms of Use and Gemma Prohibited Use Policy. |