Update README.md
Browse files
README.md
CHANGED
|
@@ -5,7 +5,7 @@ license: apache-2.0
|
|
| 5 |
# Refusal Classifier
|
| 6 |
|
| 7 |
<div align="left">
|
| 8 |
-
<img src="figures/words.png" width="
|
| 9 |
</div>
|
| 10 |
|
| 11 |
*Tired of seeing these? You've come to the right place.*
|
|
@@ -36,9 +36,9 @@ Majority vote from multiple refusal classifiers and LLM-as-a-judge were employed
|
|
| 36 |
|
| 37 |
### Evaluation
|
| 38 |
<div align="left">
|
| 39 |
-
<img src="figures/plot.png" width="
|
| 40 |
</div>
|
| 41 |
-
Inference throughput vs F1 score on the test set (2,900 non-refusals and 2,900 refusals) for several refusal open-source classifiers. Throughput benchmarked with sequence length 512, batch size 16 on 1x RTX Pro 6000.
|
| 42 |
|
| 43 |
`alpha_model` is a earlier checkpoint that I wasn't completely satisfied with, but it was leveraged for the final round of data curation.
|
| 44 |
|
|
|
|
| 5 |
# Refusal Classifier
|
| 6 |
|
| 7 |
<div align="left">
|
| 8 |
+
<img src="figures/words.png" width="100%" alt="Words"/>
|
| 9 |
</div>
|
| 10 |
|
| 11 |
*Tired of seeing these? You've come to the right place.*
|
|
|
|
| 36 |
|
| 37 |
### Evaluation
|
| 38 |
<div align="left">
|
| 39 |
+
<img src="figures/plot.png" width="100%" alt="Plot"/>
|
| 40 |
</div>
|
| 41 |
+
Inference throughput vs F1 score on the test set (2,900 non-refusals and 2,900 refusals) for several refusal open-source classifiers. Throughput benchmarked with sequence length 512, batch size 16 on 1x NVIDIA RTX Pro 6000.
|
| 42 |
|
| 43 |
`alpha_model` is a earlier checkpoint that I wasn't completely satisfied with, but it was leveraged for the final round of data curation.
|
| 44 |
|