feat: wire frontend to real API, ship plots, multi-stage Docker build 5cceafb Jaswanth1210 Claude Sonnet 4.6 commited on 21 days ago
feat: HF Space replay backend β trace store, /api endpoints, Docker, Cell 9 8c536e6 Jaswanth1210 Claude Sonnet 4.6 commited on 21 days ago
fix: use huggingface_hub.login() before LoRA download to fix 401 013046f Jaswanth1210 Claude Sonnet 4.6 commited on 22 days ago
fix: pre-download SecAlign LoRA before vLLM init to avoid 401 in subprocess b5c4619 Jaswanth1210 Claude Sonnet 4.6 commited on 22 days ago
fix: dtype kwarg (torch_dtype deprecated), vLLM max_model_len=4096 aaa7c61 Jaswanth1210 Claude Sonnet 4.6 commited on 22 days ago
fix: max_completion_length 512β128, firewall circuit-breaker b7d3a14 Jaswanth1210 Claude Sonnet 4.6 commited on 22 days ago
fix: skip Unsloth in GRPO trainer (grpo_accumulated_loss signature mismatch) 17a9ff7 Jaswanth1210 Claude Sonnet 4.6 commited on 22 days ago
Phase 4: InjectArenaEnv + FastAPI server + Dockerfile + env tests (81 passing) b54a031 Jaswanth1210 Claude Sonnet 4.6 commited on 22 days ago
fix: FirewallWrapper falls back to pg2 instance when llamafirewall scanner fails d089589 Jaswanth1210 Claude Sonnet 4.6 commited on 22 days ago
fix: LlamaFirewall async handling + conditional AgentAlignment (TOGETHER_API_KEY) 90afc08 Jaswanth1210 commited on 22 days ago
fix: SecAlign fallback uses transformers+4bit instead of vLLM (T4 CUDA init conflict) 730165b Jaswanth1210 commited on 22 days ago
fix: PG2 label check β model returns LABEL_0/LABEL_1, not MALICIOUS/BENIGN 4baabbe Jaswanth1210 commited on 22 days ago
Phase 3: defense wrappers + Colab smoke/benchmark cells a9424d2 Jaswanth1210 Claude Sonnet 4.6 commited on 22 days ago
Phase 2: verifiers, embedding cache, reward function c59510c Jaswanth1210 Claude Opus 4.7 commited on 22 days ago
Phase 1: schemas, safety filter, scenario bank 383f8a5 Jaswanth1210 Claude Opus 4.7 commited on 22 days ago