rag-model-evaluation / model_comparison_results.csv
Mã Lương Khánh
Initial commit: RAG model evaluation system
8248295
raw
history blame contribute delete
259 Bytes
Model,Win_Rate(%),Elo_Score,Avg_Score,Total_Score
RAG,89.60000000000001,6273564.000000055,4.48,224
GPT,42.8,2658521.562229258,2.14,107
ailuat,43.6,2657619.9600229254,2.18,109
law&press,39.2,2113909.6166749434,1.96,98
lexcentra,36.8,1906093.5376822255,1.84,92