Threshold for AST MODEL
#1
by yyw2683 - opened
Hi, I'm trying to replicate the results, can you please share the threshold using for classification at your convenience?
Apologies for this delayed and slow reply.
We didn’t use a separately tuned threshold. The AST fine-tuned classifier outputs 2-class logits, and we take the argmax over the logits to obtain the predicted label (default decision rule). For the binary case, this is equivalent to using a 0.5 threshold on the softmax probability for the “Synthetic” class. I hope this helps.