M ShreeRaj commited on
Change grader function to 'models.grader'
Browse filesUpdated grader function references from 'models.deterministic_grader' to 'models.grader' for all tasks.
- openenv.yaml +4 -4
openenv.yaml
CHANGED
|
@@ -38,26 +38,26 @@ schema:
|
|
| 38 |
graders:
|
| 39 |
deterministic_grader:
|
| 40 |
description: "Evaluates agent performance based on task completion, deadline adherence, and energy efficiency"
|
| 41 |
-
fn: "models.
|
| 42 |
tasks:
|
| 43 |
- id: easy
|
| 44 |
difficulty: easy
|
| 45 |
description: "2 easy tasks with no deadlines. Agent must complete both tasks without burning out."
|
| 46 |
max_steps: 50
|
| 47 |
grader:
|
| 48 |
-
fn: "models.
|
| 49 |
description: "Evaluates agent performance based on task completion, deadline adherence, and energy efficiency"
|
| 50 |
- id: medium
|
| 51 |
difficulty: medium
|
| 52 |
description: "5 medium tasks with deadlines. Agent must balance speed and energy to meet deadlines."
|
| 53 |
max_steps: 50
|
| 54 |
grader:
|
| 55 |
-
fn: "models.
|
| 56 |
description: "Evaluates agent performance based on task completion, deadline adherence, and energy efficiency"
|
| 57 |
- id: hard
|
| 58 |
difficulty: hard
|
| 59 |
description: "8 hard tasks with tight deadlines and hidden fatigue mechanics. Agent must manage stress and interruptions."
|
| 60 |
max_steps: 50
|
| 61 |
grader:
|
| 62 |
-
fn: "models.
|
| 63 |
description: "Evaluates agent performance based on task completion, deadline adherence, and energy efficiency"
|
|
|
|
| 38 |
graders:
|
| 39 |
deterministic_grader:
|
| 40 |
description: "Evaluates agent performance based on task completion, deadline adherence, and energy efficiency"
|
| 41 |
+
fn: "models.grader"
|
| 42 |
tasks:
|
| 43 |
- id: easy
|
| 44 |
difficulty: easy
|
| 45 |
description: "2 easy tasks with no deadlines. Agent must complete both tasks without burning out."
|
| 46 |
max_steps: 50
|
| 47 |
grader:
|
| 48 |
+
fn: "models.grader"
|
| 49 |
description: "Evaluates agent performance based on task completion, deadline adherence, and energy efficiency"
|
| 50 |
- id: medium
|
| 51 |
difficulty: medium
|
| 52 |
description: "5 medium tasks with deadlines. Agent must balance speed and energy to meet deadlines."
|
| 53 |
max_steps: 50
|
| 54 |
grader:
|
| 55 |
+
fn: "models.grader"
|
| 56 |
description: "Evaluates agent performance based on task completion, deadline adherence, and energy efficiency"
|
| 57 |
- id: hard
|
| 58 |
difficulty: hard
|
| 59 |
description: "8 hard tasks with tight deadlines and hidden fatigue mechanics. Agent must manage stress and interruptions."
|
| 60 |
max_steps: 50
|
| 61 |
grader:
|
| 62 |
+
fn: "models.grader"
|
| 63 |
description: "Evaluates agent performance based on task completion, deadline adherence, and energy efficiency"
|