Add real Test Dataset inference benchmarks (ONNX Runtime)
Browse files
real_test_inference_metrics_T4_onnx.csv
ADDED
|
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
|
|
|
| 1 |
+
model,engine,provider,gpu_used,test_samples,total_valid_tokens,total_time_seconds,latency_ms_per_sample,throughput_samples_per_sec,throughput_tokens_per_sec
|
| 2 |
+
google-bert-bert-base-multilingual-cased-onnx,onnxruntime,CUDAExecutionProvider,Tesla T4,4677,207365,133.99,28.65,34.91,1547.66
|