Spaces:

visualisable-ai
/

api

Paused

gary-boon Claude commited on Sep 16, 2025

Commit

9dbec03

1 Parent(s): c0d95bf

Fix syntax error in swe_bench_service.py

- Remove commented-out mock data function with nested triple quotes
- Triple quotes inside comments were causing Python syntax errors
- Clean removal ensures no mock data and fixes deployment

🤖 Generated with Claude Code

Co-Authored-By: Claude <noreply@anthropic.com>

Files changed (1) hide show

backend/swe_bench_service.py +2 -67

backend/swe_bench_service.py CHANGED Viewed

@@ -95,73 +95,8 @@ class SWEBenchService:
         self.metrics_cache: Dict[str, Any] = {}
     # Removed _load_mock_tasks - real data only for research
-    """
-    def _load_mock_tasks(self):
-        # Load mock tasks when dataset isn't available
-        repos = [
-            "astropy/astropy", "django/django", "matplotlib/matplotlib",
-            "pandas-dev/pandas", "pytest-dev/pytest", "scikit-learn/scikit-learn"
-        ]
-        statements = [
-            """Modeling's `separability_matrix` does not compute separability correctly for nested CompoundModels
-Consider the following model:
-```python
-from astropy.modeling import models as m
-from astropy.modeling.separable import separable_matrix
-cm = m.Linear1D(10) & m.Linear1D(5)
-```
-It's separability matrix as you might expect is a diagonal:
-```python
->>> separability_matrix(cm)
-array([[ True, False],
-       [False,  True]])
-```""",
-            """Please support header rows in RestructuredText output
-### Description
-It would be great if the RestructuredText output could have header rows for tables, similar to what MySQL does for pipe formatting.
-### Expected behavior
-According to the documentation for MyST parsers, the docutils RST table expects the first row to be treated as a header row.
-### Actual behavior
-The RST output treats the first row as a regular data row and doesn't mark it as a header.""",
-            """Issue when parsing empty lists/arrays in configuration
-When attempting to parse empty lists or arrays from configuration files, the parser incorrectly raises a ValueError instead of returning an empty list.
-```python
->>> config.parse_list("[]")
-ValueError: invalid literal for int() with base 10: '[]'
-```
-Expected behavior: Should return an empty list []"""
-        ]
-        for i in range(300):  # Create 300 mock tasks for better testing
-            repo = repos[i % len(repos)]
-            repo_name = repo.split('/')[1]
-            issue_number = 11000 + i
-            task = SWEBenchTask(
-                instance_id=f"{repo_name}__{repo_name}-{issue_number}",
-                repo=repo,
-                problem_statement=statements[i % len(statements)],
-                base_commit=f"commit_{i:04d}",
-                patch="# Mock patch\n+ line added\n- line removed",
-                FAIL_TO_PASS=["test_1", "test_2"] if i % 2 == 0 else ["test_a"],
-                PASS_TO_PASS=["test_pass_1", "test_pass_2"]
-            )
-            self.tasks[task.instance_id] = task
-        logger.info(f"Loaded {len(self.tasks)} mock SWE-bench tasks")
-    """
     async def load_dataset(self, dataset_name: str = "princeton-nlp/SWE-bench_Lite"):
         """Load SWE-bench dataset from Hugging Face"""

         self.metrics_cache: Dict[str, Any] = {}
     # Removed _load_mock_tasks - real data only for research
+    # Mock data generation has been completely removed to ensure
+    # only real SWE-bench tasks are used for PhD research integrity
     async def load_dataset(self, dataset_name: str = "princeton-nlp/SWE-bench_Lite"):
         """Load SWE-bench dataset from Hugging Face"""