| ================================================================================ | |
| DETOXIFY-MEDIUM MODEL BENCHMARK RESULTS | |
| ================================================================================ | |
| π EXECUTIVE SUMMARY | |
| -------------------------------------------------- | |
| Benchmark Date: 2025-09-18 13:17:58 | |
| Model: Detoxify-Medium | |
| Dataset: ParaDetox (https://github.com/s-nlp/paradetox) | |
| Total Samples: 1011 | |
| π― OVERALL PERFORMANCE METRICS | |
| -------------------------------------------------- | |
| Toxicity Reduction: 0.178 | |
| Semantic Preservation: 0.561 | |
| Fluency: 0.929 | |
| Average Latency: 160.2ms | |
| Original Toxicity: 0.196 | |
| Final Toxicity: 0.018 | |
| π DATASET BREAKDOWN | |
| -------------------------------------------------- | |
| πΉ PARADETOX TOXIC NEUTRAL | |
| Samples: 1000 | |
| Toxicity Reduction: 0.044 | |
| Semantic Preservation: 0.645 | |
| Fluency: 0.922 | |
| Latency: 156.2ms | |
| Original Toxicity: 0.051 | |
| Final Toxicity: 0.007 | |
| πΉ PARADETOX HIGH TOXICITY | |
| Samples: 11 | |
| Toxicity Reduction: 0.313 | |
| Semantic Preservation: 0.477 | |
| Fluency: 0.936 | |
| Latency: 164.3ms | |
| Original Toxicity: 0.342 | |
| Final Toxicity: 0.029 | |
| ================================================================================ |