arc-easy-evals-test / README.md
dvilasuero's picture
Upload README.md with huggingface_hub
1030ebd verified
metadata
title: Arc Easy
emoji: 📊
colorFrom: blue
colorTo: purple
sdk: docker
sdk_version: latest
pinned: false

arc_easy

This eval was run using evaljobs.

Command

evaljobs inspect_evals/arc_easy \
  --model hf-inference-providers/openai/gpt-oss-20b:cheapest \
  --name arc-easy-evals-test \
  --limit 1

Run with other models

To run this eval with a different model, use:

evaljobs inspect_evals/arc_easy \
  --model <your-model> \
  --name <your-name> \
  --flavor cpu-basic

Inspect eval command

The eval was executed with:

inspect eval inspect_evals/arc_easy \
  --model hf-inference-providers/openai/gpt-oss-20b:cheapest \
  --limit 1 \
  --log-shared \
  --log-buffer 100