Spaces:
Sleeping
Sleeping
Upload folder using huggingface_hub
Browse files
README.md
CHANGED
|
@@ -55,6 +55,13 @@ Tasks are categorized by the **depth of reasoning** required.
|
|
| 55 |
| Medium | 15 | 6 | 2 | Relational: cross-file reasoning |
|
| 56 |
| Hard | 25 | 10 | 3 | Sequential: cascading failures |
|
| 57 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 58 |
|
| 59 |
See [docs/advanced_readme.md](advanced_readme.md) for the full variant breakdown, pipeline shapes, and reasoning about why hard is genuinely hard.
|
| 60 |
|
|
|
|
| 55 |
| Medium | 15 | 6 | 2 | Relational: cross-file reasoning |
|
| 56 |
| Hard | 25 | 10 | 3 | Sequential: cascading failures |
|
| 57 |
|
| 58 |
+
**Notes** :
|
| 59 |
+
|
| 60 |
+
- Faults are typed (e.g., package_present, dockerfile_base, env_var_present, config_value, ci_stage_order, port_value).
|
| 61 |
+
|
| 62 |
+
- Only the first failing stage is exposed per run; later faults are revealed after fixes.
|
| 63 |
+
|
| 64 |
+
- Validation is structural, not string-based.
|
| 65 |
|
| 66 |
See [docs/advanced_readme.md](advanced_readme.md) for the full variant breakdown, pipeline shapes, and reasoning about why hard is genuinely hard.
|
| 67 |
|