Spaces:
Running
Running
Update miniapp_leaderboard.py
Browse files- miniapp_leaderboard.py +15 -15
miniapp_leaderboard.py
CHANGED
|
@@ -161,13 +161,13 @@ def submit(model_name, model_family, zip_file, profile: gr.OAuthProfile):
|
|
| 161 |
|
| 162 |
with gr.Blocks(title=f"{APP_NAME} leaderboard") as demo:
|
| 163 |
gr.Markdown(f"# {APP_NAME} Leaderboard")
|
| 164 |
-
gr.Markdown("""
|
| 165 |
-
### Data
|
| 166 |
-
|
| 167 |
-
MiniAppBench is the first comprehensive benchmark designed to evaluate principle-driven, interactive application generation. Unlike prior benchmarks that emphasize static UI layouts or isolated algorithmic code snippets, MiniAppBench targets **MiniApps**—HTML-based applications that require both faithful visual rendering and non-trivial interaction logic.
|
| 168 |
-
|
| 169 |
-
The dataset is split into two subsets: **validation (100 instances)** and **test (400 instances)**, and can be accessed at **[MiniAppBench dataset](https://huggingface.co/datasets/MiniAppBench/Dataset)**. The **validation** set includes publicly available **evaluation references** to support reproducible experiments, while the **test** set keeps the references hidden to enable unbiased evaluation.
|
| 170 |
-
""")
|
| 171 |
|
| 172 |
leaderboard = gr.Dataframe(
|
| 173 |
value=pd.DataFrame(columns=COLUMNS), # 启动不访问Hub
|
|
@@ -185,14 +185,14 @@ The dataset is split into two subsets: **validation (100 instances)** and **test
|
|
| 185 |
|
| 186 |
gr.Markdown(
|
| 187 |
"""
|
| 188 |
-
**Submission requirements**
|
| 189 |
-
- Please **sign in with Hugging Face** before submitting.
|
| 190 |
-
- **One submission per user per day** (UTC).
|
| 191 |
-
- Upload a **.zip** file only.
|
| 192 |
-
- The `.zip` must contain the HTML outputs for the **test set queries**.
|
| 193 |
-
|
| 194 |
-
- After you submit, we will update the result in 3 days.
|
| 195 |
-
""",
|
| 196 |
)
|
| 197 |
|
| 198 |
model_name = gr.Textbox(label="Model name", placeholder="e.g. MyModel v1")
|
|
|
|
| 161 |
|
| 162 |
with gr.Blocks(title=f"{APP_NAME} leaderboard") as demo:
|
| 163 |
gr.Markdown(f"# {APP_NAME} Leaderboard")
|
| 164 |
+
gr.Markdown("""
|
| 165 |
+
### Data
|
| 166 |
+
|
| 167 |
+
MiniAppBench is the first comprehensive benchmark designed to evaluate principle-driven, interactive application generation. Unlike prior benchmarks that emphasize static UI layouts or isolated algorithmic code snippets, MiniAppBench targets **MiniApps**—HTML-based applications that require both faithful visual rendering and non-trivial interaction logic.
|
| 168 |
+
|
| 169 |
+
The dataset is split into two subsets: **validation (100 instances)** and **test (400 instances)**, and can be accessed at **[MiniAppBench dataset](https://huggingface.co/datasets/MiniAppBench/Dataset)**. The **validation** set includes publicly available **evaluation references** to support reproducible experiments, while the **test** set keeps the references hidden to enable unbiased evaluation.
|
| 170 |
+
""")
|
| 171 |
|
| 172 |
leaderboard = gr.Dataframe(
|
| 173 |
value=pd.DataFrame(columns=COLUMNS), # 启动不访问Hub
|
|
|
|
| 185 |
|
| 186 |
gr.Markdown(
|
| 187 |
"""
|
| 188 |
+
**Submission requirements**
|
| 189 |
+
- Please **sign in with Hugging Face** before submitting.
|
| 190 |
+
- **One submission per user per day** (UTC).
|
| 191 |
+
- Upload a **.zip** file only.
|
| 192 |
+
- The `.zip` must contain the HTML outputs for the **test set queries**.
|
| 193 |
+
- Each file should be named using the query index: `<index>.html` (e.g., `1.html`, `2.html`, ...).
|
| 194 |
+
- After you submit, we will update the result in 3 days.
|
| 195 |
+
""",
|
| 196 |
)
|
| 197 |
|
| 198 |
model_name = gr.Textbox(label="Model name", placeholder="e.g. MyModel v1")
|