| 2025-04-01 20:24:51,773 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:24:51,776 - Evaluation - INFO - task: conv |
| 2025-04-01 20:24:51,777 - Evaluation - INFO - model_id: 2, |
| 2025-04-01 20:24:51,777 - Evaluation - INFO - average: 26.3, |
| 2025-04-01 20:24:51,778 - Evaluation - INFO - question: 30, |
| 2025-04-01 20:24:51,779 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:35:09,449 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:35:09,450 - Evaluation - INFO - task: detail |
| 2025-04-01 20:35:09,451 - Evaluation - INFO - model_id: 2, |
| 2025-04-01 20:35:09,451 - Evaluation - INFO - average: 18.5, |
| 2025-04-01 20:35:09,452 - Evaluation - INFO - question: 30, |
| 2025-04-01 20:35:09,453 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:35:34,601 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:35:34,602 - Evaluation - INFO - task: complex |
| 2025-04-01 20:35:34,603 - Evaluation - INFO - model_id: 2, |
| 2025-04-01 20:35:34,603 - Evaluation - INFO - average: 43.5, |
| 2025-04-01 20:35:34,604 - Evaluation - INFO - question: 30, |
| 2025-04-01 20:35:34,605 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:35:34,606 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
| 2025-04-01 20:35:34,607 - Evaluation - INFO - model_id: 2, |
| 2025-04-01 20:35:34,607 - Evaluation - INFO - total_average: 29.433333333333334, |
| 2025-04-01 20:35:34,608 - Evaluation - INFO - total_question: 90, |
| 2025-04-01 20:35:34,609 - Evaluation - INFO - +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ |
|
|