Commit
Β·
0f049b6
1
Parent(s):
9d02fee
docs: fix spec accuracy (line numbers and test counts)
Browse filesSPEC_01: Line reference 112 β 110 (actual location of .with_standard_manager)
SPEC_02: Test count 142 β 140 (verified with pytest --collect-only)
Both specs now match codebase exactly per audit verification.
docs/specs/SPEC_01_DEMO_TERMINATION.md
CHANGED
|
@@ -16,7 +16,7 @@ Advanced (Magentic) mode runs indefinitely from user perspective. The demo was m
|
|
| 16 |
### Question 1: Does max_round_count actually work?
|
| 17 |
|
| 18 |
```python
|
| 19 |
-
# Current code (src/orchestrator_magentic.py:
|
| 20 |
.with_standard_manager(
|
| 21 |
chat_client=manager_client,
|
| 22 |
max_round_count=self._max_rounds, # Default: 10
|
|
|
|
| 16 |
### Question 1: Does max_round_count actually work?
|
| 17 |
|
| 18 |
```python
|
| 19 |
+
# Current code (src/orchestrator_magentic.py:110)
|
| 20 |
.with_standard_manager(
|
| 21 |
chat_client=manager_client,
|
| 22 |
max_round_count=self._max_rounds, # Default: 10
|
docs/specs/SPEC_02_E2E_TESTING.md
CHANGED
|
@@ -4,7 +4,7 @@
|
|
| 4 |
|
| 5 |
## Problem Statement
|
| 6 |
|
| 7 |
-
We have
|
| 8 |
|
| 9 |
We don't know if:
|
| 10 |
1. Simple mode produces a valid report
|
|
@@ -122,7 +122,7 @@ async def test_real_pubmed_search():
|
|
| 122 |
|
| 123 |
```
|
| 124 |
tests/
|
| 125 |
-
βββ unit/ # Existing
|
| 126 |
βββ integration/ # Real API tests (existing)
|
| 127 |
βββ e2e/ # NEW: Full pipeline tests
|
| 128 |
βββ conftest.py # E2E fixtures
|
|
|
|
| 4 |
|
| 5 |
## Problem Statement
|
| 6 |
|
| 7 |
+
We have 140 unit tests that verify individual components work, but **no test that proves the full pipeline produces useful research output**.
|
| 8 |
|
| 9 |
We don't know if:
|
| 10 |
1. Simple mode produces a valid report
|
|
|
|
| 122 |
|
| 123 |
```
|
| 124 |
tests/
|
| 125 |
+
βββ unit/ # Existing 140 tests
|
| 126 |
βββ integration/ # Real API tests (existing)
|
| 127 |
βββ e2e/ # NEW: Full pipeline tests
|
| 128 |
βββ conftest.py # E2E fixtures
|