ContinuousBench/Baselines
Viewer • Updated • 460k • 398
None defined yet.
ContinuousBench measures progress in differentially private synthetic data.
ContinuousBench has two tracks:
Both datasets:
Generate a DP synthetic version of News or Geminon, then test it: https://github.com/plau666/ContinuousBenchEval.
Our evaluation trains a model on your DP synthetic version, and then asks the paired QA to see if your DP synthetic data was capable of teaching a model the knowledge present in the original corpus.