rollback: revert to last working Dockerfile and train.py e30d685 unverified Jayant-Kernel commited on Apr 26
fix: proper GRPO with trl 0.12.2 no-deps + force hub downgrade 0efac4a unverified Jayant-Kernel commited on Apr 26
fix: force reinstall huggingface_hub 0.24.7 after deceit_env 54fc539 unverified Jayant-Kernel commited on Apr 26
fix: pin huggingface_hub 0.24.7, install trl with --no-deps a0058bb unverified Jayant-Kernel commited on Apr 26
fix: trl 0.12.2 has GRPOTrainer, pin all deps before trl install 430098b unverified Jayant-Kernel commited on Apr 26
fix: install transformers 4.46.0 BEFORE trl so trl doesnt upgrade it 9264b56 unverified Jayant-Kernel commited on Apr 26
fix: bust docker cache force reinstall trl 0.11.4 e9971fb unverified Jayant-Kernel commited on Apr 26
fix: trl 0.11.4 + transformers 4.46.0 + processing_class e8f541c unverified Jayant-Kernel commited on Apr 26
fix: trl 0.9.4 + transformers 4.41.2 compatible versions e48f580 unverified Jayant-Kernel commited on Apr 26
fix: tokenizer not processing_class, torch cu121 for GPU 56567fd unverified Jayant-Kernel commited on Apr 26
fix: find deceit_env package location and copy data correctly 11baf5d Jayant-Kernel commited on Apr 26
fix: revert to torch 2.1.0 cu121 with trl 0.7.4 - versions that worked before 10648d1 Jayant-Kernel commited on Apr 26
fix: trl 0.12.0 has GRPOTrainer, compatible with torch 2.4.0 84d05af Jayant-Kernel commited on Apr 26
fix: copy data to multiple locations, fallback path for level2 d75e720 Jayant-Kernel commited on Apr 25
fix: replace unsloth with standard transformers+peft, no version conflicts 09c2a70 unverified Jayant-Kernel commited on Apr 25
fix: pin torch 2.1.0 and compatible versions to avoid torchao conflict dc2aaf0 unverified Jayant-Kernel commited on Apr 25
fix: install deps in Dockerfile build, not runtime 3470129 unverified Jayant-Kernel commited on Apr 25