Spaces:

evaleval
/

entity-registry

Sleeping

App Files Files Community

entity-registry / README.md

j-chim

Upload folder using huggingface_hub

a969e99 verified about 1 month ago

preview code

raw

history blame contribute delete

2.2 kB

	---
	title: eval-card-registry
	emoji: 🗂️
	colorFrom: blue
	colorTo: green
	sdk: docker
	app_port: 7860
	pinned: false
	---

	# eval-card-registry

	Query-only disambiguation API for AI evaluation entity names. Resolves raw benchmark / model / metric / harness strings (e.g. `"MATH Level 5"`) to stable canonical IDs (`math`).

	This Space runs in read-only mode — it serves lookups against pre-built entity data. Write operations (entity creation, alias edits) happen in a separate pipeline.

	## Base URL

	```
	https://evaleval-entity-registry.hf.space/api/v1
	```

	## Resolve

	```bash
	curl -X POST https://evaleval-entity-registry.hf.space/api/v1/resolve \
	-H 'Content-Type: application/json' \
	-d '{"raw_value": "MATH Level 5", "entity_type": "benchmark"}'
	```

	Response:

	```json
	{
	"canonical_id": "math-level-5",
	"strategy": "exact",
	"confidence": 1.0,
	"created_new": false,
	"review_status": "reviewed"
	}
	```

	If nothing matches, `canonical_id` is `null` and `strategy` is `"no_match"`. In read-only mode, no draft entity is created.

	`entity_type` is one of: `benchmark`, `model`, `metric`, `harness`. Optional `source_config` scopes the lookup to a specific source.

	Batch resolve:

	```bash
	curl -X POST https://evaleval-entity-registry.hf.space/api/v1/resolve/batch \
	-H 'Content-Type: application/json' \
	-d '[
	{"raw_value": "MATH Level 5", "entity_type": "benchmark"},
	{"raw_value": "meta-llama/Llama-3.1-8B", "entity_type": "model"}
	]'
	```

	## Browse entities

	```
	GET /api/v1/benchmarks?search=math
	GET /api/v1/benchmarks/{id}
	GET /api/v1/models
	GET /api/v1/metrics
	GET /api/v1/harnesses
	GET /api/v1/aliases?status=uncertain&entity_type=benchmark
	```

	## Health

	```
	GET /api/v1/health
	GET /api/v1/stats
	```

	## Write endpoints

	Disabled in this Space. `POST`/`PATCH` on entities and aliases return `405 Method Not Allowed`. Mutations happen in the data pipeline (separate from this Space).

	## Interactive docs

	OpenAPI docs at `/docs`.

	## Data sources

	- Entity data: HF Dataset repo `evaleval/entity-registry-data` (read at startup)
	- Resolve logs: HF Storage Bucket `evaleval/entity-registry-storage` (written asynchronously for resolver improvement)