Inject-Arena / notebooks

Commit History

feat: HF Space replay backend — trace store, /api endpoints, Docker, Cell 9
8c536e6

Jaswanth1210 Claude Sonnet 4.6 commited on

feat: 5 training plots (reward±std, KL/loss, completion stats) + Drive backup in Cell 8
ff63792

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: plots gitignore, trainer_state fallback for reward curve, Cell 8 --trainer-state arg
2fe5366

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: Cell 8 — pull before plots, GH_TOKEN push, copy Drive logs
19d8929

Jaswanth1210 commited on

Phase 7/8: full README, video script, make_plots.py, Cell 8 implementation
5244e53

Jaswanth1210 Claude Opus 4.7 commited on

feat: fill in Cell 7 for full training run (Phase 6)
f94c60c

Jaswanth1210 Claude Sonnet 4.6 commited on

Phase 5: training pipeline — client, GRPO trainer, eval, baselines (23 handcrafted attacks)
550a83e

Jaswanth1210 Claude Sonnet 4.6 commited on

Phase 4: InjectArenaEnv + FastAPI server + Dockerfile + env tests (81 passing)
b54a031

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: load SecAlign before PG2 in Cell 3 to avoid vLLM CUDA init conflict
3d180ef

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: add %cd /content/injectarena to cells 3+4 (lost after runtime restart)
6207128

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: FirewallWrapper falls back to pg2 instance when llamafirewall scanner fails
d089589

Jaswanth1210 Claude Sonnet 4.6 commited on

Phase 3: defense wrappers + Colab smoke/benchmark cells
a9424d2

Jaswanth1210 Claude Sonnet 4.6 commited on

Notebook: hard-code REPO_URL to Inject-Arena GitHub
1e979cd

Jaswanth1210 Claude Opus 4.7 commited on

Phase 0: bootstrap
15bf5e6

Jaswanth1210 Claude Opus 4.7 commited on