Commit History

docs: drop 'stated honestly' phrasing from card heading
dada8aa
verified

dannyliv commited on

docs: clearer benefit-focused model card, fix library_name metadata to transformers (merged model at repo root)
b1f2c9b
verified

dannyliv commited on

Move V3.2 adapter to adapter/ subfolder so root loads the merged model directly
7d83f9d
verified

dannyliv commited on

Ship V3.2: GCG-hardened weights, merged model + ONNX rebuild, honest FPR disclosure
fd38276
verified

dannyliv commited on

docs(card): add 2026-05-16 project-status note; drop stale ONNX latency figure
045a422
verified

dannyliv commited on

eval(lg3): full comparison table vs LlamaGuard-3-8B
4e06e85
verified

dannyliv commited on

audit phase 3: drop ShieldGemma references; lock LG3-vs-DeBERTa headline framing
c5e7497
verified

dannyliv commited on

audit-fixes: GCG eval (DA #8) + LlamaGuard-3-8B comparison (DA #1)
fae87aa
verified

dannyliv commited on

audit-fixes 2: disclose benign-input FPR (Dolly-15k n=500); ModernBERT 7.4% FPR @ t=0.5 critical for users
d245d35
verified

dannyliv commited on

audit-fixes: canonical t=0.5 headline, drop '#1' star, disclose comparison scope, remove unbenchmarked 18ms latency
78333bd
verified

dannyliv commited on

Style: remove em-dashes (CLAUDE.md Part I, also keeps YAML model-index parseable)
5177f5a
verified

dannyliv commited on

Add per-label classification table (17 heads) + red-team loop status note
01ca5a9
verified

dannyliv commited on

Add Problem-statement + model-selection guide + hardware requirements
4f76a4b
verified

dannyliv commited on

Add Methodology section: sample counts, GOAT techniques, autoresearch loop
c59bfc7
verified

dannyliv commited on

Strip training-cost mentions
5e502fe
verified

dannyliv commited on

Add 'Attack types covered & how it was trained' section with linked sources; remove excluded-model asides
1281341
verified

dannyliv commited on

Add benchmark links + explanations; remove excluded-model asides
ffe4c3a
verified

dannyliv commited on

Restructure model card with full HF metadata (model-index, datasets, metrics, pipeline_tag, intended-use, limitations, citation)
41c0279
verified

dannyliv commited on

Upload onnx/config.json with huggingface_hub
09d3719
verified

dannyliv commited on

ONNX export (opset 18 via optimum)
41b3674
verified

dannyliv commited on

Initial model card: best JBB F1 (0.727) of 9 tested classifiers
7d83e54
verified

dannyliv commited on

Model save
92502bf
verified

dannyliv commited on

Training in progress, step 3351
7feec2b
verified

dannyliv commited on

Training in progress, step 3000
1b81646
verified

dannyliv commited on

Training in progress, step 2000
0129de5
verified

dannyliv commited on

Training in progress, step 1000
1b45d41
verified

dannyliv commited on

Model save
98446fc
verified

dannyliv commited on

Training in progress, step 3351
681e1b2
verified

dannyliv commited on

Training in progress, step 3000
e8752bd
verified

dannyliv commited on

Training in progress, step 2000
2b92c30
verified

dannyliv commited on

Training in progress, step 1000
0eafd11
verified

dannyliv commited on

initial commit
b7b2e07
verified

dannyliv commited on