Commit History

rls t5 large
9f858c3

natmin322 commited on

rls t5 large
2c4cffd

natmin322 commited on

v9-fix: oracle training routing, calibrated inference, update docs
5a14212

natmin322 commited on

v7-fix: enable C5 for task 2+ (add --run_single False, --n_batches_c5 100)
060b67d

natmin322 commited on

v7: C5 Data-Informed Subspace Init + restructure contributions to 2 core claims
aeb2d78

natmin322 commited on

fix: reduce CPU RAM to prevent OOM SIGKILL - gc.collect, del temps, eval_accumulation_steps, cache cleanup
c03ffe2

natmin322 commited on

Revert "fix: reduce OOM in V2 script - fp16+gradient_ckpt for T4, EVAL_BSZ 128->32"
46044f0

natmin322 commited on

fix: reduce OOM in V2 script - fp16+gradient_ckpt for T4, EVAL_BSZ 128->32
33d5e3a

natmin322 commited on

fix: add trust_remote_code=True to second load_dataset call in run_llama.py
e9acf77

natmin322 commited on

dataset: allow custom dataset code by setting trust_remote_code=True in run_llama.py
55f7d25

natmin322 commited on

C4: Spectrally-Conditioned LoRA Training — preconditioned gradient + spectral entropy regularization
2d42b51

natmin322 commited on

SpecRoute V3: adaptive bias, symmetric inference, threshold 0.995, batch size optimization
9ea634d

natmin322 commited on

fix: eval_strategy epoch (was steps=10, causing 437 evals/task → Kaggle session kill)
dd291ef

natmin322 commited on

fix llama
1c30686

natmin322 commited on

fix llama
99e2af7

natmin322 commited on