zhenwu0831 commited on
Commit
61d6302
·
1 Parent(s): 5fed9c7
Files changed (1) hide show
  1. app.py +3 -5
app.py CHANGED
@@ -520,7 +520,7 @@ def process_submission(file):
520
 
521
  # concise status
522
  lines = [
523
- f"✅ Submission recorded for {andrewid} (attempt #{used + 1}/{MAX_ATTEMPTS}).",
524
  f"Total score: {total_score:.4f} ({total_score * 100:.2f}%)",
525
  f"EM: {avg_em:.4f} ({avg_em * 100:.2f}%)",
526
  f"F1: {avg_f1:.4f} ({avg_f1 * 100:.2f}%)",
@@ -532,10 +532,10 @@ def process_submission(file):
532
  else:
533
  lines.append(f"LLM judge: {avg_judge:.3f} (1-5)")
534
 
535
- return "\n".join(lines), get_leaderboard_display()
536
 
537
  except Exception as e:
538
- return f"❌ Error: {e}", get_leaderboard_display()
539
 
540
 
541
  # =========================
@@ -559,8 +559,6 @@ We compute multiple metrics:
559
 
560
  **Attempts:** up to **{MAX_ATTEMPTS}**. Attempts always increase. Your leaderboard score updates only if your **total score improves**.
561
 
562
- **Persistence:** leaderboard is stored in HF dataset **{LEADERBOARD_REPO_ID}** as **{LEADERBOARD_FILENAME}**.
563
-
564
  **Submission format (JSON):**
565
  ```json
566
  {{
 
520
 
521
  # concise status
522
  lines = [
523
+ f"✅ Submission {andrewid} (attempt #{used + 1}/{MAX_ATTEMPTS}).",
524
  f"Total score: {total_score:.4f} ({total_score * 100:.2f}%)",
525
  f"EM: {avg_em:.4f} ({avg_em * 100:.2f}%)",
526
  f"F1: {avg_f1:.4f} ({avg_f1 * 100:.2f}%)",
 
532
  else:
533
  lines.append(f"LLM judge: {avg_judge:.3f} (1-5)")
534
 
535
+ return "\n".join(lines)
536
 
537
  except Exception as e:
538
+ return f"❌ Error: {e}"
539
 
540
 
541
  # =========================
 
559
 
560
  **Attempts:** up to **{MAX_ATTEMPTS}**. Attempts always increase. Your leaderboard score updates only if your **total score improves**.
561
 
 
 
562
  **Submission format (JSON):**
563
  ```json
564
  {{