dvilasuero commited on
Commit
b728137
·
verified ·
1 Parent(s): e3b6d7a

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +28 -8
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- title: Bfcl Olmo
3
  emoji: 📊
4
  colorFrom: blue
5
  colorTo: purple
@@ -8,16 +8,36 @@ sdk_version: "latest"
8
  pinned: false
9
  ---
10
 
11
- # Bfcl Olmo
12
 
13
- Live log viewer for eval results stored in [dvilasuero/bfcl-olmo](https://huggingface.co/dvilasuero/bfcl-olmo).
14
 
15
- This Space runs `inspect view` to display real-time evaluation logs from the dataset.
16
 
17
- ## View Logs
 
 
 
 
18
 
19
- Logs are automatically displayed from: `hf://datasets/dvilasuero/bfcl-olmo/logs`
20
 
21
- ## Dataset
22
 
23
- Results are stored in: [dvilasuero/bfcl-olmo](https://huggingface.co/dvilasuero/bfcl-olmo)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Bfcl
3
  emoji: 📊
4
  colorFrom: blue
5
  colorTo: purple
 
8
  pinned: false
9
  ---
10
 
11
+ # bfcl
12
 
13
+ This eval was run using [evaljobs](https://github.com/dvsrepo/evaljobs).
14
 
15
+ ## Command
16
 
17
+ ```bash
18
+ evaljobs inspect_evals/bfcl \
19
+ --model hf-inference-providers/allenai/Olmo-3-7B-Instruct \
20
+ --name bfcl-olmo
21
+ ```
22
 
23
+ ## Run with other models
24
 
25
+ To run this eval with a different model, use:
26
 
27
+ ```bash
28
+ evaljobs inspect_evals/bfcl \
29
+ --model <your-model> \
30
+ --name <your-name> \
31
+ --flavor cpu-basic
32
+ ```
33
+
34
+ ## Inspect eval command
35
+
36
+ The eval was executed with:
37
+
38
+ ```bash
39
+ inspect eval inspect_evals/bfcl \
40
+ --model hf-inference-providers/allenai/Olmo-3-7B-Instruct \
41
+ --log-shared \
42
+ --log-buffer 100
43
+ ```