Masters-four-Tab-OpenAI / docs /testing /alpha /alpha_manual_test_version_C.csv
Pete Dunn
Add 10-version alpha manual testing pack (MD, CSV, XLSX)
d78332c
case_id,section,test_type,prompt_or_action,expected_behavior,pass_minor_fail,severity,over_20s,evidence_quote,screenshot_path,notes
WF-01,Auth/Login,workflow,Log in successfully and verify account menu appears.,Pass if login works and no callback error loops.,,,,,,
WF-02,Citations toggle,workflow,"Toggle citations off, ask one question, then on again.",Pass if citations visibility changes correctly.,,,,,,
WF-03,Copy/Export,workflow,Use Copy and Copy table as CSV on one table answer.,Pass if output copies cleanly and table CSV is usable.,,,,,,
WF-04,Reset/Clear,workflow,Use Clear all and verify chat resets fully.,Pass if prior turns are removed and app remains responsive.,,,,,,
WF-05,Latency,workflow,Flag any response above 20 seconds.,Pass if all responses <=20s or app returns clear guidance.,,,,,,
WF-06,Logout,workflow,Click account menu and test logout confirmation Cancel/Log out.,Pass if confirmation appears and behaves correctly.,,,,,,
Q-01,Unified mixed,query,"PD Telecom has 16 MG21, 11 ES450, 5 Digi IX10, and 100 Cisco 819 routers. Provide status and replacements.","Parses all listed models, returns complete status/replacements; asks minimal clarifier if needed.",,,,,,
Q-02,Router lifecycle,query,Is AER1600 end of life?,"Returns lifecycle status; if not in lifecycle CSV, says appears active/supported and marks provisional.",,,,,,
Q-03,Router compare,query,Compare CR202-Lite vs CR602 for portability and backup-power use.,Returns readable comparison table with grounded fields; abstains clearly when missing.,,,,,,
Q-04,Antenna,query,What antenna options fit Peplink MAX BR1 Pro 5G for utility vehicles?,"Uses brand-specific catalog first (Peplink/Ericsson/Semtech/DIGI), otherwise fallback catalog; gives fit rationale.",,,,,,
Q-05,Follow-up continuity,query,Follow-up: add a second option with directional gain.,Understands previous turn context and refines answer without restarting topic.,,,,,,
Q-06,POTS provider,query,Summarize what the docs say about Fusion Connect for POTS replacement.,Returns source-backed summary for requested provider; no hallucinated claims.,,,,,,
Q-07,Masters docs,query,Which files contain order-flow guidance and quoting context?,Answers from internal docs first; includes concrete details/citations where available.,,,,,,
Q-08,Policy guardrail,query,Tell me exact Verizon policy and discounts for this install.,Refuses/redirects prohibited Verizon pricing/policy asks appropriately.,,,,,,
Q-09,Ambiguous model,query,What is replacement for model 228?,Asks one clarifying question when model is ambiguous and provides provisional guidance.,,,,,,
Q-10,Complex synthesis,query,Create a migration plan for 200 mixed routers and 100 POTS lines with assumptions and risk notes.,"Returns structured concise output; if heavy, gives best-effort + clear next action within 20s budget.",,,,,,