rl_code_fix_env / dataset /README.md
Viraj0112's picture
Upload folder using huggingface_hub
03a907a verified
# Buggy Training Dataset
This dataset is organized as:
- `problem_x/buggy.py`: intentionally buggy implementation
- `problem_x/test.py`: correctness tests that should fail before fixes
- optional extra modules (`helpers.py`, `cache.py`, etc.) to support multi-file bug fixing
Current problems: `problem_1` to `problem_18`.
Bug patterns included:
- off-by-one errors
- boundary condition mistakes
- incorrect sorting direction
- exception handling mistakes
- state/recency bugs in cache logic
- recursive base-case bugs
- parsing and whitespace normalization issues
- order-preservation regressions
- matrix transformation direction errors