PuxAI commited on
Commit
cc58d74
·
verified ·
1 Parent(s): d5d32c9

Add real Test Dataset inference benchmarks (ONNX Runtime)

Browse files
real_test_inference_metrics_T4_onnx.csv ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ model,engine,provider,gpu_used,test_samples,total_valid_tokens,total_time_seconds,latency_ms_per_sample,throughput_samples_per_sec,throughput_tokens_per_sec
2
+ google-bert-bert-base-multilingual-cased-onnx,onnxruntime,CUDAExecutionProvider,Tesla T4,4677,207365,133.99,28.65,34.91,1547.66