Commit History

Webhook receiver mode: Space receives POSTs, spawns GPU Jobs
9a21135
Running

ouhenio commited on

Reduce max_length to 2560 (observed max is 2049, saves KV cache)
6338890

ouhenio commited on

Set max_length=4096 for olmes (5-shot MMLU exceeds 2048)
3e3ffdc

ouhenio commited on

Fix patches.sh: use find_spec instead of import (CUDA unavailable at build)
1ebc4ac

ouhenio commited on

Force-reinstall lm-eval 0.4.3, add patch verification
f575e4e

ouhenio commited on

Remove separate lm-eval install: olmes pins 0.4.3, patches.sh handles vLLM compat
90b371c

ouhenio commited on

Add lm-eval[math] extras (fixes minerva_math import)
1e19863

ouhenio commited on

Pin lm-eval to 0.4.11 (TokensPrompt fix for vLLM 0.17)
2340010

ouhenio commited on

Remove deep_ep: fixes libnvshmem ImportError on A100
b5812b7

ouhenio commited on

Add python symlink (olmes needs python not python3)
9cbb2a1

ouhenio commited on

Bundle eval.yaml, fallback when GitHub repo is private
2c70552

ouhenio commited on

Reset ENTRYPOINT: vllm base crashes on CPU Space
f12cfa4

ouhenio commited on

Use CMD instead of ENTRYPOINT: keep Space alive for Jobs
d390195

ouhenio commited on

Fix entrypoint: use python3
b9ce0eb

ouhenio commited on

Fix olmes install: clone then pip install
0b148dd

ouhenio commited on

Fix Dockerfile: install olmes from GitHub, use base image torch/vLLM
514462b

ouhenio commited on

Initial eval-runner pipeline
e2f615e

ouhenio commited on

initial commit
6232566
verified

ouhenio commited on