Commit History
Typo-aware safety pattern matching for crisis / imminent-risk regex 937b034
Crisis typo-tolerance + harm-frame echo verifier fa72baf
Add substance_use + privacy_confidentiality routes, affirm-with-question detection 3b6fcce
Session-state audit fix: thread session_id end-to-end, advance substantive-only, use monotonic seq counter 916f1e7
Robust natural-language affirm/negate/unsure detection + advance-after-PERMISSION bc03d8b
Remove 'Inspect -> Support card' UI leak from OFFER chat prose 6067738
Vary OFFER vs LISTEN openers on general_student_support; smooth consent question embed a7296a8
Polish consent_acknowledged + PERMISSION hint phrasing e2d1b99
Fix conversational planner: advisor over-trigger, yes-after-offer flow, crisis + misconduct pattern gaps 8d5f30a
Diversity probe sweep + V1->V4 narrative README + HF Spaces entry + DV/incomplete fixes 97e19ad
Response quality: greeting/goodbye/meta handlers, slang routing, lexical variety, last-verified badge, markdown norm, crisis variants aedf4f8
V4.4 Tier 1: regex hardening, clarify-stage enum, stat caveats, PDF unicode, retry/backoff, reproducibility doc 4f20fa7
V4.3: prompt-injection audit, input length cap, per-layer ablation, unguarded baseline, limitations restored 847587d
V4.2 part 3: sycophancy guard, F-1 flag decay, incomplete-message handler d808a62
V4.2 part 2: authority-misconduct route + conversation history in rephraser fea6929
V4.2 part 1: fix broken ISSS URL, ship URL audit + Karthik data brief 8fdff5c
V4.1: fix repetition/scroll/topbar, add PDF export, counselor-assist note 9ad0a4c
V4: streaming, controlled paraphrasing, support plan, voice, sweeps 655c300
Add LLM plan-and-rephrase layer with Groq + Anthropic providers 97ee6bf
Add stage-aware listening planner and F-1 student awareness d14dce3
Improve conversational MVP routing 9a0393a
Polish MVP support experience 511af68
Add Eval B safety supplement 433900d
Ingest Core dataset and harden router policy f046303
Prepare Core dataset intake and resource registry e143b4a
Polish peer helper and scope handling ea1618f
Add Core safety metadata and eval summaries d50d1e1
Implement EmpathRAG Core hybrid router b2f5c42
Add Karthik eval harness and safety patches a246513
Start V2.5 support navigator hardening 79a6369
Checkpoint V2 curated support navigator 15594c0
Add curated corpus integration scaffold fadd796
Start v2 safety hardening 81deeef
Add sliding window conversation memory - 3 turns, n_ctx 4096 632052b
Mukul Rayana commited on
Tighten SYSTEM_PROMPT with few-shot example, reduce max_tokens to 200, fix paragraph post-processor 660c6ba
Mukul Rayana commited on
Improve SYSTEM_PROMPT for conversational peer support - validate, reflect, one question 83085b4
Mukul Rayana commited on
fix: guardrail dual-import path, bertscore key names, ragas reuse pipeline.llm (Day 14) 9bce0e0
Mukul Rayana commited on
feat: real DeBERTa guardrail wired, skip_ig flag, smoke test updated 6997a58
Mukul Rayana commited on
feat: wire real DeBERTa guardrail, fix smoke test for crisis intercepts 2e53d50
Mukul Rayana commited on
fix: wire real guardrail, anchor gitignore, fix dtype warning 1afd5d5
Mukul Rayana commited on
fix: guardrail_ig.py — IG in embedding space, token_type_ids, correct dtype 6535fa7
Mukul Rayana commited on
fix: scope models/ gitignore to root only, add src/models package, remove .claude 0e9c4c7
Mukul Rayana commited on
Add pipeline orchestrator + smoke test — 4/5 emotion predictions correct (Day 12) 8b1f355
Mukul Rayana commited on
Day 2: NLI pairs built, fix gitignore for large files ecf7f72
Mukul Rayana commited on
Day 1: data pipeline, session tracker, query router, adversarial probes, Colab training notebooks bc3ba9e
Mukul Rayana commited on