Zhu Jiajun (jz28583) Claude Opus 4.7 (1M context) commited on
Commit
9d28e20
·
1 Parent(s): 54a4248

Update Tasks table: real row counts, Kaggle backend, ibm-aml F1 metric

Browse files

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Files changed (1) hide show
  1. README.md +11 -6
README.md CHANGED
@@ -51,12 +51,17 @@ You provide the `ground_truth/<task>.csv` files yourself (one row per test entit
51
 
52
  ## Tasks
53
 
54
- | Task | Type | Metric | Test rows |
55
- | --- | --- | --- | --- |
56
- | `ieee-fraud-detection` | binary classification | AUC-ROC | ~118k |
57
- | `arxiv-citation` | binary classification | AUC-ROC | 19,394 |
58
- | `figraph` | binary anomaly detection | AUC-ROC | 3,596 |
59
- | `ibm-aml` | binary anomaly detection (preview) | AUC-PR | TBD |
 
 
 
 
 
60
 
61
  ## How it works
62
 
 
51
 
52
  ## Tasks
53
 
54
+ | Task | Type | Metric | Test rows | Backend |
55
+ | --- | --- | --- | --- | --- |
56
+ | `ieee-fraud-detection` | binary classification | AUC-ROC | 506,691 | Kaggle |
57
+ | `arxiv-citation` | binary classification | AUC-ROC | 193,696 | local GT |
58
+ | `figraph` | binary anomaly detection | AUC-ROC | 3,596 | local GT |
59
+ | `ibm-aml` | binary anomaly detection | F1 (minority) | 863,900 | local GT |
60
+
61
+ `ieee-fraud-detection` forwards your CSV to the Kaggle competition and
62
+ returns the official private-leaderboard score. The hosted server uses its
63
+ own Kaggle credentials for that — no setup needed on your side just to
64
+ submit. (You only need `KAGGLE_USERNAME`/`KAGGLE_KEY` if you self-host.)
65
 
66
  ## How it works
67