Commit History

Phase 13 Stage 1 v2: fix termination signal - reward penalties for non-termination, reduced max_tokens, stop_strings
5a432bb

yashash04 commited on

Phase 13 Stage 1: pivot to winner's pattern - TRL 0.29.0 + fast_inference=False + DAPO loss
cf4ce7e

yashash04 commited on

Phase 13 Stage 1: fix Unicode encoding artifacts in notebook comments
b0a62f6

yashash04 commited on

Phase 13 Stage 1: drop vLLM, use Kaggle native torch 2.10, upgrade trl to 0.18+ (Kaggle dep compatibility)
7247cb4

yashash04 commited on

Phase 13 Stage 1: apply all 7 notebook audit fixes (account labels + health check + hub namespace + explicit commenting)
dec6785

yashash04 commited on

Phase 13 Stage 1 prep: Kaggle runbook + Run 1 config pre-populated (notebook audit flags pending approval)
453c504

yashash04 commited on

docs: correct gpt-oss baseline overclaim β€” tiered win thresholds, cumulative is diagnostic not trophy
f746c6d

yashash04 commited on

docs: correct overclaim in gpt-oss baseline β€” honest win thresholds
12b83b4

yashash04 commited on

Phase 13 prep: ollama provider + gpt-oss:120b baseline eval (3 seeds Γ— 6 scenarios)
a95078f

yashash04 commited on

Phase 13 prep: 5-seed baseline evals + Kaggle Stage 1 kickoff checklist
964e93c

yashash04 commited on

docs: log Phase 11 M-tier scenarios (cumul=1.988 > E-tier 1.284, discriminability three-tier)
15fbd89

yashash04 commited on

Phase 11 fixup: deploy smoke /tasks count 3 -> 6
0fd93c4

yashash04 commited on

Phase 11: medium scenarios M1/M2/M3 + AdaptationRubric multi-drift verification
7828dcd

yashash04 commited on

docs: clean BUILD_LOG phases list + point to TRAINING_LOG; cross-ref Section 1 partial data
1f23161

yashash04 commited on

docs: log Phase 10 HF Space deploy (production step_shaping=0.1000 verified)
c6f9301

yashash04 commited on

Merge branch 'main' of https://huggingface.co/spaces/yashash045/schemashift
fc82aca

yashash04 commited on

initial commit
960e9b5
verified

yashash045 commited on

Phase 10: HF Space deploy β€” README frontmatter + DEPLOY.md + deploy smoke tests
6486c72

yashash04 commited on

docs: log Phase 9 results (discriminability gap 0.348 shaped, env validated)
20ccd3a

yashash04 commited on

Phase 9: baseline eval harness (heuristic + LLM agents) + tests
4d2f869

yashash04 commited on

docs: backfill Phase 8 BUILD_LOG entry + update README status
a3bcf96

yashash04 commited on

docs: add TRAINING_LOG.md template + training_logs dir for Phase 13 discipline
2c70741

yashash04 commited on

Phase 8: Env client + GRPO training skeleton (Kaggle notebook)
2464e9e

yashash04 commited on

Phase 7: FastAPI server + Dockerfile + openenv.yaml
cb33205

yashash04 commited on

Phase 6: SchemaShiftEnvironment scheduler β€” full episode lifecycle
c618a61

yashash04 commited on

Phase 5: Composable rubric grader + dense step shaping
46b8c56

yashash04 commited on

Phase 4: CRMAPI with 3 drifts + scenarios E1/E2/E3
bc8f184

yashash04 commited on

Phase 3: CalendarAPI with 2 drifts + DriftInjector
27dd958

yashash04 commited on

Phase 2: BaseTool + MailAPI with 3 drift handlers
8410ff1

yashash04 commited on

Phase 1: Pydantic models + validation tests
8cdb02b

yashash04 commited on

first commit
d4ab0f1

yashash04 commited on