zechen-nlp commited on
Commit
ea8e1bd
·
verified ·
1 Parent(s): 2f105b8

Update Automated MNLP evaluation report (2026-05-17)

Browse files
Files changed (1) hide show
  1. EVAL_REPORT.md +7 -7
EVAL_REPORT.md CHANGED
@@ -2,7 +2,7 @@
2
 
3
  - **Model repo:** [`cs-552-2026-thinkinsidethebox/general_knowledge_model`](https://huggingface.co/cs-552-2026-thinkinsidethebox/general_knowledge_model)
4
  - **Owner(s):** group **thinkinsidethebox**
5
- - **Generated at:** 2026-05-16T04:57:46+00:00 (UTC)
6
  - **Pipeline:** [mnlp-project-ci](https://github.com/eric11eca/mnlp-project-ci)
7
 
8
  _This PR is opened automatically by the course CI. It is **non-blocking** — you do not need to merge it. The next nightly run will refresh this file._
@@ -24,22 +24,22 @@ _Prompts are intentionally omitted to avoid revealing benchmark contents. For mu
24
 
25
  **Correct** (1 shown)
26
 
27
- - **reference**: `C`
28
  - **overall** (1/1 completions correct)
29
- - **extracted** (✓): `C`
30
  - **completion**:
31
 
32
  ```text
33
- \boxed{C}
34
  ```
35
 
36
  **Incorrect** (1 shown)
37
 
38
- - **reference**: `G`
39
  - **overall** (0/1 completions correct)
40
- - **extracted** (✗): `A`
41
  - **completion**:
42
 
43
  ```text
44
- \boxed{A}
45
  ```
 
2
 
3
  - **Model repo:** [`cs-552-2026-thinkinsidethebox/general_knowledge_model`](https://huggingface.co/cs-552-2026-thinkinsidethebox/general_knowledge_model)
4
  - **Owner(s):** group **thinkinsidethebox**
5
+ - **Generated at:** 2026-05-17T04:52:19+00:00 (UTC)
6
  - **Pipeline:** [mnlp-project-ci](https://github.com/eric11eca/mnlp-project-ci)
7
 
8
  _This PR is opened automatically by the course CI. It is **non-blocking** — you do not need to merge it. The next nightly run will refresh this file._
 
24
 
25
  **Correct** (1 shown)
26
 
27
+ - **reference**: `J`
28
  - **overall** (1/1 completions correct)
29
+ - **extracted** (✓): `J`
30
  - **completion**:
31
 
32
  ```text
33
+ \boxed{J}
34
  ```
35
 
36
  **Incorrect** (1 shown)
37
 
38
+ - **reference**: `A`
39
  - **overall** (0/1 completions correct)
40
+ - **extracted** (✗): `J`
41
  - **completion**:
42
 
43
  ```text
44
+ \boxed{J}
45
  ```