| 2025-04-01 20:09:01,480 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:09:01,482 - Evaluation - INFO - task: conv |
| 2025-04-01 20:09:01,483 - Evaluation - INFO - model_id: 1, |
| 2025-04-01 20:09:01,484 - Evaluation - INFO - average: 4.5, |
| 2025-04-01 20:09:01,485 - Evaluation - INFO - question: 30, |
| 2025-04-01 20:09:01,485 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:19:21,294 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:19:21,295 - Evaluation - INFO - task: detail |
| 2025-04-01 20:19:21,295 - Evaluation - INFO - model_id: 1, |
| 2025-04-01 20:19:21,296 - Evaluation - INFO - average: 3.7, |
| 2025-04-01 20:19:21,297 - Evaluation - INFO - question: 30, |
| 2025-04-01 20:19:21,297 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:21:08,262 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:21:08,262 - Evaluation - INFO - task: complex |
| 2025-04-01 20:21:08,263 - Evaluation - INFO - model_id: 1, |
| 2025-04-01 20:21:08,264 - Evaluation - INFO - average: 5.6, |
| 2025-04-01 20:21:08,264 - Evaluation - INFO - question: 30, |
| 2025-04-01 20:21:08,265 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:21:08,266 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:21:08,267 - Evaluation - INFO - model_id: 1, |
| 2025-04-01 20:21:08,267 - Evaluation - INFO - total_average: 4.6, |
| 2025-04-01 20:21:08,268 - Evaluation - INFO - total_question: 90, |
| 2025-04-01 20:21:08,269 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
|
|