dvilasuero commited on
Commit
94c4a6b
·
verified ·
1 Parent(s): 1c04900

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +30 -8
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- title: Test Local
3
  emoji: 📊
4
  colorFrom: blue
5
  colorTo: purple
@@ -8,16 +8,38 @@ sdk_version: "latest"
8
  pinned: false
9
  ---
10
 
11
- # Test Local
12
 
13
- Live log viewer for eval results stored in [dvilasuero/test-local](https://huggingface.co/dvilasuero/test-local).
14
 
15
- This Space runs `inspect view` to display real-time evaluation logs from the dataset.
16
 
17
- ## View Logs
 
 
 
 
 
18
 
19
- Logs are automatically displayed from: `hf://datasets/dvilasuero/test-local/logs`
20
 
21
- ## Dataset
22
 
23
- Results are stored in: [dvilasuero/test-local](https://huggingface.co/dvilasuero/test-local)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Arc Easy
3
  emoji: 📊
4
  colorFrom: blue
5
  colorTo: purple
 
8
  pinned: false
9
  ---
10
 
11
+ # arc_easy
12
 
13
+ This eval was run using [evaljobs](https://github.com/dvsrepo/evaljobs).
14
 
15
+ ## Command
16
 
17
+ ```bash
18
+ evaljobs inspect_evals/arc_easy \
19
+ --model hf-inference-providers/openai/gpt-oss-20b:fastest,hf-inference-providers/openai/gpt-oss-120b:fastest \
20
+ --name test-local \
21
+ --limit 1
22
+ ```
23
 
24
+ ## Run with other models
25
 
26
+ To run this eval with a different model, use:
27
 
28
+ ```bash
29
+ evaljobs inspect_evals/arc_easy \
30
+ --model <your-model> \
31
+ --name <your-name> \
32
+ --flavor cpu-basic
33
+ ```
34
+
35
+ ## Inspect eval command
36
+
37
+ The eval was executed with:
38
+
39
+ ```bash
40
+ inspect eval-set inspect_evals/arc_easy \
41
+ --model hf-inference-providers/openai/gpt-oss-20b:fastest,hf-inference-providers/openai/gpt-oss-120b:fastest \
42
+ --limit 1 \
43
+ --log-shared \
44
+ --log-buffer 100
45
+ ```