Spaces:
Sleeping
Sleeping
zhenwu0831 commited on
Commit ·
61d6302
1
Parent(s): 5fed9c7
v16
Browse files
app.py
CHANGED
|
@@ -520,7 +520,7 @@ def process_submission(file):
|
|
| 520 |
|
| 521 |
# concise status
|
| 522 |
lines = [
|
| 523 |
-
f"✅ Submission
|
| 524 |
f"Total score: {total_score:.4f} ({total_score * 100:.2f}%)",
|
| 525 |
f"EM: {avg_em:.4f} ({avg_em * 100:.2f}%)",
|
| 526 |
f"F1: {avg_f1:.4f} ({avg_f1 * 100:.2f}%)",
|
|
@@ -532,10 +532,10 @@ def process_submission(file):
|
|
| 532 |
else:
|
| 533 |
lines.append(f"LLM judge: {avg_judge:.3f} (1-5)")
|
| 534 |
|
| 535 |
-
return "\n".join(lines)
|
| 536 |
|
| 537 |
except Exception as e:
|
| 538 |
-
return f"❌ Error: {e}"
|
| 539 |
|
| 540 |
|
| 541 |
# =========================
|
|
@@ -559,8 +559,6 @@ We compute multiple metrics:
|
|
| 559 |
|
| 560 |
**Attempts:** up to **{MAX_ATTEMPTS}**. Attempts always increase. Your leaderboard score updates only if your **total score improves**.
|
| 561 |
|
| 562 |
-
**Persistence:** leaderboard is stored in HF dataset **{LEADERBOARD_REPO_ID}** as **{LEADERBOARD_FILENAME}**.
|
| 563 |
-
|
| 564 |
**Submission format (JSON):**
|
| 565 |
```json
|
| 566 |
{{
|
|
|
|
| 520 |
|
| 521 |
# concise status
|
| 522 |
lines = [
|
| 523 |
+
f"✅ Submission {andrewid} (attempt #{used + 1}/{MAX_ATTEMPTS}).",
|
| 524 |
f"Total score: {total_score:.4f} ({total_score * 100:.2f}%)",
|
| 525 |
f"EM: {avg_em:.4f} ({avg_em * 100:.2f}%)",
|
| 526 |
f"F1: {avg_f1:.4f} ({avg_f1 * 100:.2f}%)",
|
|
|
|
| 532 |
else:
|
| 533 |
lines.append(f"LLM judge: {avg_judge:.3f} (1-5)")
|
| 534 |
|
| 535 |
+
return "\n".join(lines)
|
| 536 |
|
| 537 |
except Exception as e:
|
| 538 |
+
return f"❌ Error: {e}"
|
| 539 |
|
| 540 |
|
| 541 |
# =========================
|
|
|
|
| 559 |
|
| 560 |
**Attempts:** up to **{MAX_ATTEMPTS}**. Attempts always increase. Your leaderboard score updates only if your **total score improves**.
|
| 561 |
|
|
|
|
|
|
|
| 562 |
**Submission format (JSON):**
|
| 563 |
```json
|
| 564 |
{{
|