| 2025-04-01 01:12:07,678 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 01:12:07,681 - Evaluation - INFO - task: conv |
| 2025-04-01 01:12:07,682 - Evaluation - INFO - model_id: 8, |
| 2025-04-01 01:12:07,682 - Evaluation - INFO - average: 113.3, |
| 2025-04-01 01:12:07,683 - Evaluation - INFO - question: 30, |
| 2025-04-01 01:12:07,684 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 01:12:48,255 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 01:12:48,256 - Evaluation - INFO - task: detail |
| 2025-04-01 01:12:48,256 - Evaluation - INFO - model_id: 8, |
| 2025-04-01 01:12:48,257 - Evaluation - INFO - average: 111.4, |
| 2025-04-01 01:12:48,258 - Evaluation - INFO - question: 30, |
| 2025-04-01 01:12:48,258 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 01:13:25,181 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 01:13:25,182 - Evaluation - INFO - task: complex |
| 2025-04-01 01:13:25,182 - Evaluation - INFO - model_id: 8, |
| 2025-04-01 01:13:25,183 - Evaluation - INFO - average: 103.7, |
| 2025-04-01 01:13:25,184 - Evaluation - INFO - question: 30, |
| 2025-04-01 01:13:25,184 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 01:13:25,185 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 01:13:25,186 - Evaluation - INFO - model_id: 8, |
| 2025-04-01 01:13:25,187 - Evaluation - INFO - total_average: 109.46666666666665, |
| 2025-04-01 01:13:25,187 - Evaluation - INFO - total_question: 90, |
| 2025-04-01 01:13:25,188 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
|
|