Spaces:
Sleeping
Sleeping
MedGRPO Team
commited on
Commit
·
6d8dbb2
1
Parent(s):
c8f4cad
Fix CVS_acc to use raw accuracy instead of component_balanced_accuracy
Browse files- Change extraction to match 'accuracy:' but exclude 'component_balanced_accuracy'
- Now extracts 0.9136 (raw accuracy) instead of 0.8956 (component_balanced)
- Matches expected table value of 0.914
app.py
CHANGED
|
@@ -725,8 +725,8 @@ def parse_evaluation_output(output: str) -> Dict[str, float]:
|
|
| 725 |
except:
|
| 726 |
pass
|
| 727 |
|
| 728 |
-
# CVS Assessment: Extract accuracy
|
| 729 |
-
elif current_task == "cvs_assessment" and "accuracy" in line
|
| 730 |
try:
|
| 731 |
value = float(line.split(":")[-1].strip())
|
| 732 |
metrics["cvs_acc"] = value
|
|
|
|
| 725 |
except:
|
| 726 |
pass
|
| 727 |
|
| 728 |
+
# CVS Assessment: Extract accuracy (not component_balanced_accuracy)
|
| 729 |
+
elif current_task == "cvs_assessment" and "accuracy:" in line and "component_balanced" not in line:
|
| 730 |
try:
|
| 731 |
value = float(line.split(":")[-1].strip())
|
| 732 |
metrics["cvs_acc"] = value
|