Commit History

Upload frontend/src/index.css with huggingface_hub
343e8dc
verified

Jaswanth-K commited on

Upload frontend/src/App.jsx with huggingface_hub
b5b3db2
verified

Jaswanth-K commited on

feat: instant demo mode, inline results after attack, 3-option launch modal
3191547

Jaswanth1210 Claude Opus 4.7 commited on

feat: training results dashboard with stats cards, comparison table, and plots
474639f

Jaswanth1210 Claude Opus 4.7 commited on

feat: launch mode modal with demo vs live options and time estimate
d882fcd

Jaswanth1210 Claude Opus 4.7 commited on

feat: battlefield visualization β€” attacker/firewalls/agent with live reward graph
ff4fe4a

Jaswanth1210 Claude Opus 4.7 commited on

feat: fill in real results, fix openenv.yaml Space URL
bc3c044

Jaswanth1210 Claude Opus 4.7 commited on

feat: real continuous GRPO reward curve + 5-plot dashboard
723a54a

Jaswanth1210 Claude Sonnet 4.6 commited on

feat: launch-mode modal (recorded vs live) + reframed success card
32513fb

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: shorten HF short_description to <= 60 chars
c884702

Jaswanth1210 commited on

fix: HF Space URLs point to Jaswanth-K/Inject-Arena (the actual Space)
46d858d

Jaswanth1210 Claude Sonnet 4.6 commited on

feat: wire frontend to real API, ship plots, multi-stage Docker build
5cceafb

Jaswanth1210 Claude Sonnet 4.6 commited on

Phase 7: replay traces for HF Space demo
e60c2d2

colab commited on

Phase 7: replay traces for HF Space demo
a4470c7

colab commited on

feat: HF Space replay backend β€” trace store, /api endpoints, Docker, Cell 9
8c536e6

Jaswanth1210 Claude Sonnet 4.6 commited on

feat: 5 training plots (rewardΒ±std, KL/loss, completion stats) + Drive backup in Cell 8
ff63792

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: plots gitignore, trainer_state fallback for reward curve, Cell 8 --trainer-state arg
2fe5366

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: Cell 8 β€” pull before plots, GH_TOKEN push, copy Drive logs
19d8929

Jaswanth1210 commited on

Phase 7/8: full README, video script, make_plots.py, Cell 8 implementation
5244e53

Jaswanth1210 Claude Opus 4.7 commited on

feat: fill in Cell 7 for full training run (Phase 6)
f94c60c

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: use huggingface_hub.login() before LoRA download to fix 401
013046f

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: pre-download SecAlign LoRA before vLLM init to avoid 401 in subprocess
b5c4619

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: dtype kwarg (torch_dtype deprecated), vLLM max_model_len=4096
aaa7c61

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: max_completion_length 512β†’128, firewall circuit-breaker
b7d3a14

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: drop BnB 4-bit, load attacker in plain bf16
6aebb94

Jaswanth1210 Claude Opus 4.7 commited on

fix: dtype kwarg + cast lm_head/embeds to bfloat16 to fix GRPO generate
33bf00a

Jaswanth1210 commited on

fix: add torch_dtype=bfloat16 to prevent Float/BFloat16 mismatch in GRPO
b42adcc

Jaswanth1210 commited on

fix: GRPO batch_size must be divisible by num_generations (1β†’4)
0d411fb

Jaswanth1210 commited on

fix: skip Unsloth in GRPO trainer (grpo_accumulated_loss signature mismatch)
17a9ff7

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: stub GuidedDecodingParams for vLLM 0.19+ / TRL compatibility
deab900

Jaswanth1210 Claude Sonnet 4.6 commited on

Phase 5: training pipeline β€” client, GRPO trainer, eval, baselines (23 handcrafted attacks)
550a83e

Jaswanth1210 Claude Sonnet 4.6 commited on

Phase 4: InjectArenaEnv + FastAPI server + Dockerfile + env tests (81 passing)
b54a031

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: load SecAlign before PG2 in Cell 3 to avoid vLLM CUDA init conflict
3d180ef

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: add %cd /content/injectarena to cells 3+4 (lost after runtime restart)
6207128

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: FirewallWrapper falls back to pg2 instance when llamafirewall scanner fails
d089589

Jaswanth1210 Claude Sonnet 4.6 commited on

fix: LlamaFirewall async handling + conditional AgentAlignment (TOGETHER_API_KEY)
90afc08

Jaswanth1210 commited on

fix: SecAlign fallback uses transformers+4bit instead of vLLM (T4 CUDA init conflict)
730165b

Jaswanth1210 commited on

fix: PG2 label check β€” model returns LABEL_0/LABEL_1, not MALICIOUS/BENIGN
4baabbe

Jaswanth1210 commited on

Phase 3: defense wrappers + Colab smoke/benchmark cells
a9424d2

Jaswanth1210 Claude Sonnet 4.6 commited on

Fix: widen requires-python to include 3.12
0c220a9

Jaswanth1210 Claude Sonnet 4.6 commited on

Notebook: hard-code REPO_URL to Inject-Arena GitHub
1e979cd

Jaswanth1210 Claude Opus 4.7 commited on

Phase 2: verifiers, embedding cache, reward function
c59510c

Jaswanth1210 Claude Opus 4.7 commited on

Phase 1: schemas, safety filter, scenario bank
383f8a5

Jaswanth1210 Claude Opus 4.7 commited on

Phase 0: bootstrap
15bf5e6

Jaswanth1210 Claude Opus 4.7 commited on