Spaces:

KevinMerchant13
/

oss-vs-frontier-assistant

Running

Phase 7: initial deploy (cpu-basic)

35c0d38 verified 3 days ago

200 Bytes

	"""Evaluation framework package.

	Loads benchmark datasets, runs both assistants over them, judges the outputs,
	and renders a report comparing OSS vs. frontier on hallucination, bias, and
	safety.
	"""