Rename README.md to Added usage & limitations

Browse files

Files changed (1) hide show

README.md → Added usage & limitations +24 -2

README.md → Added usage & limitations RENAMED Viewed

@@ -11,7 +11,7 @@ A tiny comment toxicity classifier model at only 2M parameters. With only ~10MB
 A paper on this model is being released soon.
-### Benchmarks
 The Tiny-Toxic-Detector achieves an impressive 90.26% on the Toxigen benchmark and 87.34% on the Jigsaw-Toxic-Comment-Classification-Challenge. Here we compare our results against other toxic classification models:
@@ -26,7 +26,7 @@ The Tiny-Toxic-Detector achieves an impressive 90.26% on the Toxigen benchmark a
 | **Tiny-toxic-detector**           | **2M**            | **90.26**   | 87.34      |
-### Usage
 This model uses custom architecture and requires some extra custom code to work. Below you can find the architecture and a fully-usable example.
 <details>
@@ -203,3 +203,25 @@ with torch.no_grad():
   logits = outputs["logits"].squeeze()
   prediction = "Toxic" if logits > 0.5 else "Not Toxic"
 ```

 A paper on this model is being released soon.
+## Benchmarks
 The Tiny-Toxic-Detector achieves an impressive 90.26% on the Toxigen benchmark and 87.34% on the Jigsaw-Toxic-Comment-Classification-Challenge. Here we compare our results against other toxic classification models:
 | **Tiny-toxic-detector**           | **2M**            | **90.26**   | 87.34      |
+## Usage
 This model uses custom architecture and requires some extra custom code to work. Below you can find the architecture and a fully-usable example.
 <details>
   logits = outputs["logits"].squeeze()
   prediction = "Toxic" if logits > 0.5 else "Not Toxic"
 ```
+## Usage and Limitations
+Toxicity classification models always have certain limitations you should be aware of, and this model is no different.
+### Intended Usage
+The Tiny-toxic-detector is designed to classify comments for toxicity. It is particularly useful in scenarios where minimal resource usage and rapid inference are essential. Key features include:
+* Low Resource Consumption: With a requirement of (roughly) only 10MB of RAM and 8MB of VRAM, this model is well-suited for environments with limited hardware resources.
+* Fast Inference: The model provides high-speed inference. The Tiny-toxic-detector significantly outperforms larger models on CPU-based systems. Due to the overhead of using GPU inference, small models with a relatively small number of input tokens are often faster on CPU. This includes the Tiny-toxic-detector.
+### Limitations
+* Training Data
+  * The Tiny-toxic-detector has been trained exclusively on English-language data, limiting its ability to classify toxicity in other languages.
+* Maximum Context Length
+  * The model can handle up to 512 input tokens. Comments exceeding this length are not in the scope of this model.
+  * While extending the context length is possible, such modifications have not been trained for or validated. Early tests with a 4096-token context resulted in a performance drop of over 10% on the Toxigen benchmark.
+* Language Ambiguity
+  * The Tiny-toxic-detector may struggle with ambiguous or nuanced language as any other model would. Even though benchmarks like Toxigen evaluate the model’s performance with ambiguous language, it may still misclassify comments where toxicity is not clearly defined.