dvilasuero commited on
Commit
852e47b
·
verified ·
1 Parent(s): 1af51ae

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +30 -8
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- title: Simpleqa Verified Sample
3
  emoji: 📊
4
  colorFrom: blue
5
  colorTo: purple
@@ -8,16 +8,38 @@ sdk_version: "latest"
8
  pinned: false
9
  ---
10
 
11
- # Simpleqa Verified Sample
12
 
13
- Live log viewer for eval results stored in [dvilasuero/simpleqa_verified-sample](https://huggingface.co/dvilasuero/simpleqa_verified-sample).
14
 
15
- This Space runs `inspect view` to display real-time evaluation logs from the dataset.
16
 
17
- ## View Logs
 
 
 
 
 
18
 
19
- Logs are automatically displayed from: `hf://datasets/dvilasuero/simpleqa_verified-sample/logs`
20
 
21
- ## Dataset
22
 
23
- Results are stored in: [dvilasuero/simpleqa_verified-sample](https://huggingface.co/dvilasuero/simpleqa_verified-sample)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Simpleqa Verified Custom
3
  emoji: 📊
4
  colorFrom: blue
5
  colorTo: purple
 
8
  pinned: false
9
  ---
10
 
11
+ # simpleqa_verified_custom
12
 
13
+ This eval was run using [evaljobs](https://github.com/dvsrepo/evaljobs).
14
 
15
+ ## Command
16
 
17
+ ```bash
18
+ evaljobs examples/simpleqa_verified_custom.py \
19
+ --model hf-inference-providers/openai/gpt-oss-20b:cheapest \
20
+ --name simpleqa_verified-sample \
21
+ --limit 10
22
+ ```
23
 
24
+ ## Run with other models
25
 
26
+ To run this eval with a different model, use:
27
 
28
+ ```bash
29
+ evaljobs https://huggingface.co/spaces/dvilasuero/simpleqa_verified-sample \
30
+ --model <your-model> \
31
+ --name <your-name> \
32
+ --flavor cpu-basic
33
+ ```
34
+
35
+ ## Inspect eval command
36
+
37
+ The eval was executed with:
38
+
39
+ ```bash
40
+ inspect eval eval.py \
41
+ --model hf-inference-providers/openai/gpt-oss-20b:cheapest \
42
+ --limit 10 \
43
+ --log-shared \
44
+ --log-buffer 100
45
+ ```