Clean up unused hackathon markdown files and update setup script link 84ccd7d ARKAISW commited on Apr 26
Add final blog post detailing from numeric to semantic reasoning and personal origin 577da98 ARKAISW commited on Apr 26
README: Update training plots with final winning results (93% compliance, 0->4.5 reward) 2705495 ARKAISW commited on Apr 26
README: Final results update — 93% risk compliance, 88% governance, semantic reasoning v2.0 91407ea ARKAISW commited on Apr 26
Semantic observation prompts — rich text replaces raw floats (judge feedback #1) 213c699 ARKAISW commited on Apr 26
Final Hackathon Polish: Left-side dashboard layout, SVG connection scaling, and 0.4s simulation speedup 5686d79 ARKAISW commited on Apr 26
Hard Forced FP16 precision patch with surgical head casting and stability limits c922be6 ARKAISW commited on Apr 26
Apply aggressive GRPO stability patch: LR=1e-5, len=64, norm=0.5, BFloat16 c7fb92b ARKAISW commited on Apr 26
Disable trainer-level fp16 scaling to resolve GradScaler unscale crash 450aae5 ARKAISW commited on Apr 26
Global Float16 precision patch: align BitsAndBytes, model loading, and GRPOTrainer args 0401fe0 ARKAISW commited on Apr 26
Surgical precision fix: explicitly cast lm_head and embed_tokens to compute_dtype d6d5c2e ARKAISW commited on Apr 26
Fix precision mismatch during generate() by casting model to compute dtype 471d5b7 ARKAISW commited on Apr 26
Patch missing TRANSFORMERS_CACHE variable for llm_blender compatibility 2e54203 ARKAISW commited on Apr 26
Refactor train_hf.py to use pure PEFT/Transformers to avoid Unsloth precision bugs 6cb169b ARKAISW commited on Apr 26
Add standalone HF Jobs GRPO training script (500 steps, 8 gens, sample output logging) 5c3b197 ARKAISW commited on Apr 25
Remove broken auto-generated plots in favor of live Kaggle training evidence d9a6265 ARKAISW commited on Apr 25
Update README with requirements alignment, Colab/Kaggle links, and live Kaggle training evidence a45e838 ARKAISW commited on Apr 25
Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat 48b2f2f ARKAISW commited on Apr 25
Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle 96fac64 ARKAISW commited on Apr 25