rollback: revert to last working Dockerfile and train.py e30d685 unverified Jayant-Kernel commited on about 1 month ago
fix: proper GRPO with trl 0.12.2 no-deps + force hub downgrade 0efac4a unverified Jayant-Kernel commited on about 1 month ago
fix: custom training loop without TRL dependency 5232a98 unverified Jayant-Kernel commited on about 1 month ago
fix: trl 0.12.2 has GRPOTrainer, pin all deps before trl install 430098b unverified Jayant-Kernel commited on about 1 month ago
fix: try multiple import paths for GRPOConfig 2cdce1f unverified Jayant-Kernel commited on about 1 month ago
fix: trl 0.11.4 + transformers 4.46.0 + processing_class e8f541c unverified Jayant-Kernel commited on about 1 month ago
fix: trl 0.9.4 + transformers 4.41.2 compatible versions e48f580 unverified Jayant-Kernel commited on about 1 month ago
fix: remove tokenizer arg from GRPOTrainer f3d865a unverified Jayant-Kernel commited on about 1 month ago
fix: tokenizer not processing_class, torch cu121 for GPU 56567fd unverified Jayant-Kernel commited on about 1 month ago
fix: CPU fallback when no GPU detected 4c4c68a unverified Jayant-Kernel commited on about 1 month ago
improve: abstention penalty, better prompt, mixed curriculum, more steps 253d1ff Jayant-Kernel commited on Apr 26
fix: copy data to multiple locations, fallback path for level2 d75e720 Jayant-Kernel commited on Apr 25
fix: replace unsloth with standard transformers+peft, no version conflicts 09c2a70 unverified Jayant-Kernel commited on Apr 25
fix: install deps in Dockerfile build, not runtime 3470129 unverified Jayant-Kernel commited on Apr 25
Add health server on port 7860 for HF Spaces keep-alive 32b9179 unverified Jayant-Kernel commited on Apr 25
fix: download datasets from GitHub at runtime instead of relying on package data 0592f6a unverified Jayant-Kernel commited on Apr 25
fix: install unsloth_zoo and nest-asyncio properly 2a3f319 unverified Jayant-Kernel commited on Apr 25