natong19
/

refusal_classifier

Model card Files Files and versions

natong19 commited on 12 days ago

Commit

6669ff0

·

verified ·

1 Parent(s): 159487c

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: apache-2.0
 # Refusal Classifier
 <div align="left">
-<img src="figures/words.png" width="60%" alt="Words"/>
 </div>
 *Tired of seeing these? You've come to the right place.*
@@ -36,9 +36,9 @@ Majority vote from multiple refusal classifiers and LLM-as-a-judge were employed
 ### Evaluation
 <div align="left">
-<img src="figures/plot.png" width="60%" alt="Plot"/>
 </div>
-Inference throughput vs F1 score on the test set (2,900 non-refusals and 2,900 refusals) for several refusal open-source classifiers. Throughput benchmarked with sequence length 512, batch size 16 on 1x RTX Pro 6000.
 `alpha_model` is a earlier checkpoint that I wasn't completely satisfied with, but it was leveraged for the final round of data curation.

 # Refusal Classifier
 <div align="left">
+<img src="figures/words.png" width="100%" alt="Words"/>
 </div>
 *Tired of seeing these? You've come to the right place.*
 ### Evaluation
 <div align="left">
+<img src="figures/plot.png" width="100%" alt="Plot"/>
 </div>
+Inference throughput vs F1 score on the test set (2,900 non-refusals and 2,900 refusals) for several refusal open-source classifiers. Throughput benchmarked with sequence length 512, batch size 16 on 1x NVIDIA RTX Pro 6000.
 `alpha_model` is a earlier checkpoint that I wasn't completely satisfied with, but it was leveraged for the final round of data curation.