Automated MNLP evaluation report (2026-05-17)

#4
by zechen-nlp - opened
Files changed (1) hide show
  1. EVAL_REPORT.md +6 -6
EVAL_REPORT.md CHANGED
@@ -2,7 +2,7 @@
2
 
3
  - **Model repo:** [`cs-552-2026-catma/multilingual_model`](https://huggingface.co/cs-552-2026-catma/multilingual_model)
4
  - **Owner(s):** group **catma**
5
- - **Generated at:** 2026-05-16T04:57:46+00:00 (UTC)
6
  - **Pipeline:** [mnlp-project-ci](https://github.com/eric11eca/mnlp-project-ci)
7
 
8
  _This PR is opened automatically by the course CI. It is **non-blocking** — you do not need to merge it. The next nightly run will refresh this file._
@@ -31,7 +31,7 @@ _Prompts are intentionally omitted to avoid revealing benchmark contents. For mu
31
 
32
  ```text
33
  <think>
34
- विकल्पों का विश्लेषण करते हुए, सही उत्तर D है।
35
  </think>
36
 
37
  \boxed{D}
@@ -39,15 +39,15 @@ _Prompts are intentionally omitted to avoid revealing benchmark contents. For mu
39
 
40
  **Incorrect** (1 shown)
41
 
42
- - **reference**: `B`
43
  - **overall** (0/1 completions correct)
44
- - **extracted** (✗): `A`
45
  - **completion**:
46
 
47
  ```text
48
  <think>
49
- Analizando las opciones, la respuesta correcta es A.
50
  </think>
51
 
52
- \boxed{A}
53
  ```
 
2
 
3
  - **Model repo:** [`cs-552-2026-catma/multilingual_model`](https://huggingface.co/cs-552-2026-catma/multilingual_model)
4
  - **Owner(s):** group **catma**
5
+ - **Generated at:** 2026-05-17T04:52:19+00:00 (UTC)
6
  - **Pipeline:** [mnlp-project-ci](https://github.com/eric11eca/mnlp-project-ci)
7
 
8
  _This PR is opened automatically by the course CI. It is **non-blocking** — you do not need to merge it. The next nightly run will refresh this file._
 
31
 
32
  ```text
33
  <think>
34
+ 分析各个选项,正确答案是D
35
  </think>
36
 
37
  \boxed{D}
 
39
 
40
  **Incorrect** (1 shown)
41
 
42
+ - **reference**: `D`
43
  - **overall** (0/1 completions correct)
44
+ - **extracted** (✗): `C`
45
  - **completion**:
46
 
47
  ```text
48
  <think>
49
+ Проанализировав варианты, правильный ответ C.
50
  </think>
51
 
52
+ \boxed{C}
53
  ```