Implement chat template wrapping for prompt tokenization and add related CLI options db65c83 juiceb0xc0de commited on 8 days ago
refactor: update logging cadence for per-pair AtP metrics and improve GPU memory handling 71b213a juiceb0xc0de commited on 9 days ago
wandb: grouped per-stage telemetry across the full pipeline 7b91880 juiceb0xc0de Claude commited on 9 days ago
atlas: keep scratch on the 800GB volume, off the container disk 833e57c juiceb0xc0de Claude commited on 9 days ago
census recovery: crash-safe metadata journal + F2 chunk-naming support 3574445 juiceb0xc0de Claude commited on 9 days ago
io: tighten temp-path headroom to 1.05x and pick path once 4768d28 juiceb0xc0de Claude commited on 9 days ago
io: size-aware temp path selection (fix /dev/shm ENOSPC crash) bf09bb2 juiceb0xc0de Claude commited on 9 days ago
census: GPU reductions + cross-batch overlap + tmpfs/chunked finalize bf96682 juiceb0xc0de Claude commited on 9 days ago
Fix wandb init: min_hrt_time rejected by newer wandb Settings af7d6b1 juiceb0xc0de Claude commited on 9 days ago
Add interactive HF/W&B auth prompt + W&B OOM telemetry to census extractor 861f9f2 juiceb0xc0de Claude commited on 9 days ago
feat(sub_zero): W&B instrumentation with crash-safe per-step memory logging 87a3b18 juiceb0xc0de Claude commited on 10 days ago
perf(sub_zero): kill AtP per-pair thrash (autograd.grad + GPU reduction) e5ce61e juiceb0xc0de Claude commited on 10 days ago
feat(sub_zero): default --all-layers for full 100% layer coverage 074aaf8 juiceb0xc0de Claude commited on 10 days ago
fix(app): skip OV-circuit + compliance stages when already done 3855f70 juiceb0xc0de Claude commited on 10 days ago
fix(sub_zero): resume Stage 3 AtP per-pair + scope grad to kill OOM c00987b juiceb0xc0de Claude commited on 10 days ago
feat: add --pause-between-stages flag for long atlas runs bdab633 juiceb0xc0de commited on 10 days ago
fix: skip existing analysis before loading .npz; avoid redundant RAM use 524fed5 juiceb0xc0de commited on 10 days ago
fix: cap parallel analysis workers at 4 to avoid RAM OOM 777e9a0 juiceb0xc0de commited on 10 days ago
fix: move _analyze_one to module level for pickling; clear CUDA cache during AtP 963c022 juiceb0xc0de commited on 10 days ago
fix: Implement parallel processing for layer analysis to improve performance 02921ba juiceb0xc0de commited on 10 days ago
fix: Enhance hook registration to handle empty inputs and support kwargs 708d1c2 juiceb0xc0de commited on 10 days ago
Add prompts for illegal activities and malicious cyber actions to red_team_stems.jsonl 55f1362 juiceb0xc0de commited on 10 days ago
fix: Update finalize_census.py to set default workers to 1 for improved performance in parallel finalization 28b3a32 juiceb0xc0de commited on 10 days ago
fix: Update finalize_census.py to use argparse for improved command-line argument handling and parallelization options b05b02f juiceb0xc0de commited on 10 days ago
fix: Enhance per-token analysis by creating per-component scratch directories for separation scores f6ae04e juiceb0xc0de commited on 10 days ago
fix: Update file names and argument defaults for Sub-Zero corpora in various scripts d2163b7 juiceb0xc0de commited on 10 days ago
fix: Enhance compliance behavior handling and improve comparison output in analysis tools 275e282 juiceb0xc0de commited on 10 days ago
fix: Improve handling of ragged object arrays and optimize preallocation in write_npz_array_stream 763530b juiceb0xc0de commited on 10 days ago
fix: Import numpy in local_census.py for array operations d61de4f juiceb0xc0de commited on 10 days ago
fix: Update CPU Space run instructions for Dolphin3.0-Llama3.2-3B to use placeholders for model and dataset identifiers 4eff620 juiceb0xc0de commited on 10 days ago
Add a variety of supportive and engaging responses to corporate.jsonl for enhanced user interaction 7dbb050 juiceb0xc0de commited on 10 days ago
feat: Add detailed CPU run instructions for Dolphin3.0-Llama3.2-3B, including smoke test and component staging 36a1e40 juiceb0xc0de commited on 11 days ago
fix: Update .gitignore to include interpretations directory and 1.ipynb; modify make_compliance_corpora.py for clarity in corpus categorization 798480d juiceb0xc0de commited on 12 days ago
fix: Improve layer count retrieval in _layer_count function to handle nested configurations and add error handling 31513b8 juiceb0xc0de commited on 12 days ago
fix: Update layer count function to handle nested model configurations for accurate parameter estimation 9c9d951 juiceb0xc0de commited on 12 days ago
feat: Enhance database schema and processing for logit lens and residual tracking 1b0916a juiceb0xc0de commited on 12 days ago
Rename compliance-behaviour corpora to 'authentic.jsonl' and 'corporate.jsonl' for clarity; update references in app.py and README.md dde5140 juiceb0xc0de commited on 12 days ago
Add timing functionality to extraction process with configurable intervals c86f8d7 juiceb0xc0de commited on 13 days ago
Add EXAONE layer/attention/MLP name aliases; attn/trust flags 99d89e0 juiceb0xc0de commited on 13 days ago
Refactor model loading to support dynamic trust_remote_code and attention implementation options 536f664 juiceb0xc0de commited on 13 days ago
Fix indentation in space_keepalive.py for consistent formatting 401a183 juiceb0xc0de commited on 14 days ago