task: text-classification
Fixed parameters:
- model_name_or_path:
Bhumika/roberta-base-finetuned-sst2 - dataset:
- path:
glue - eval_split:
validation - data_keys:
{'primary': 'sentence'} - ref_keys:
['label'] - name:
sst2
- path:
- quantization_approach:
dynamic - node_exclusion:
[] - per_channel:
False - framework:
onnxruntime - framework_args:
- opset:
15 - optimization_level:
1
- opset:
- aware_training:
False
Benchmarked parameters:
- operators_to_quantize:
['Add', 'MatMul'],['Add']
Evaluation
Below, time metrics for
- Batch size: 8
- Input length: 128
operators_to_quantize latency_mean (original, ms) latency_mean (optimized, ms) throughput (original, /s) throughput (optimized, /s) accuracy (original) accuracy (optimized) ['Add', 'MatMul']| 619.76 161.66 | 1.80 6.20 | 1.000 1.000 ['Add']| 611.74 478.48 | 1.80 2.20 | 1.000 1.000