GLUE fine-tuning task
To run the experiment, you need to
run ./mnli.sh for fine-tuning mnli base model,
run ./mnli.sh for fine-tuning mnli large model.
run ./cola.sh for fine-tuning cola large model.
run ./sst2.sh for fine-tuning sst2 large model.
run ./stsb.sh for fine-tuning stsb large model.
run ./rte.sh for fine-tuning rte large model.
run ./qqp.sh for fine-tuning qqp large model.
run ./qnli.sh for fine-tuning qnli large model.
run ./mrpc.sh for fine-tuning mrpc large model.
Export model to ONNX format and quantization
To export model to onnx format during evaluation, use argument --export_ort_model True.
To export quantized model, use --fp16 False --export_ort_model True.
The exported model will be under output folder, and end with
<prefix>__onnx_fp16.bin if fp16 is True, otherwise the outputs will be <prefix>__onnx_fp32.bin and <prefix>__onnx_qt.bin.
Please check ONNX document for more details.