Spaces:

lm-harmony
/

README

Running

ghzhang233 commited on Dec 11, 2025

Commit

bf9351b

1 Parent(s): 3c0fc97

update readme

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,10 +1,9 @@
----
-title: README
-emoji: 📉
-colorFrom: gray
-colorTo: yellow
-sdk: static
-pinned: false
----
-Edit this `README.md` markdown file to author your organization card.

+# LM-Harmony
+![train-before-test](assets/banner.png)
+*Which model would you rather have: the weaker student who crammed for the test, or the stronger student who walked in underprepared? Existing leaderboards mostly reward the former.*
+**LM-Harmony** is a multi-task leaderboard for **model potential**. Instead of judging deployment-ready performance out of the box, we use a **train-before-test** paradigm: every model is fine-tuned on the same benchmark-specific training set before evaluation.
+Across 24 diverse tasks, LM-Harmony yields far more stable and consistent rankings than standard direct-evaluation leaderboards. If you care about which model will perform better after you fine-tune it on your own data, the ranking you see here is much more likely to generalize to your workload.