Spaces:

commoncrawl
/

commonlid

Running

commonlid / README.md

Update README.md

9fe0361 verified 9 days ago

1.11 kB

	---
	title: CommonLID Leaderboard
	emoji: 🌍
	colorFrom: indigo
	colorTo: pink
	sdk: gradio
	sdk_version: 5.9.1
	python_version: 3.12
	app_file: app.py
	pinned: false
	license: apache-2.0
	---

	# CommonLID Leaderboard

	Live results table for the [CommonLID](https://huggingface.co/datasets/commoncrawl/CommonLID)
	benchmark and its `commonlid_nano` slice. Models are ranked by **macro
	F1**; click any row for per-language detail.

	## How the data flows

	1. Each run writes `<dataset>/<model>/{summary.json, predictions.jsonl}` to
	`data/results/` via `commonlid run …`.
	2. The contents of `data/results/` are uploaded to
	[`commoncrawl/commonlid-results`](https://huggingface.co/datasets/commoncrawl/commonlid-results)
	with `huggingface-cli upload`.
	3. This Space loads the dataset on boot and renders one tab per benchmark.

	## Local preview

	```bash
	uv sync --extra leaderboard
	uv run commonlid leaderboard serve --local-dir ./data/results
	```

	## Pinning a specific results revision

	Set `COMMONLID_RESULTS_REVISION` (or `--revision` on the CLI) to lock the
	Space to a specific commit on the results dataset.