Commit History

Remove blog_temp.md from repo
5172f48
Running

ARKAISW commited on

Remove GitHub mention from blog per HF submission requirements
3d75bac

ARKAISW commited on

Clean up unused hackathon markdown files and update setup script link
84ccd7d

ARKAISW commited on

Update blog with temp blog content
9a6d252

ARKAISW commited on

Refine blog.md to improve credibility and tone based on feedback
c66becb

ARKAISW commited on

Add final blog post detailing from numeric to semantic reasoning and personal origin
577da98

ARKAISW commited on

README: Update final model repository link
29bde0e

ARKAISW commited on

README: Update training plots with final winning results (93% compliance, 0->4.5 reward)
2705495

ARKAISW commited on

README: Final results update — 93% risk compliance, 88% governance, semantic reasoning v2.0
91407ea

ARKAISW commited on

README: Fix ASCII diagram alignment with standard characters
7101ea1

ARKAISW commited on

README: Highlight v2.0 Semantic Reasoning (Judge Step 1 Complete)
8ddae74

ARKAISW commited on

Semantic observation prompts — rich text replaces raw floats (judge feedback #1)
213c699

ARKAISW commited on

Add compulsory Google Colab link to README
ee53741

ARKAISW commited on

Update submission blog link to compulsory HF URL
ca3c3ed

ARKAISW commited on

Final tweaks to server and requirements
d5dc8ac

ARKAISW commited on

Update requirements-space.txt
7164a8e

ARKAISW commited on

Final manual tweaks to server imports
5b8d71e

ARKAISW commited on

Real ML Demo + Stability Fixes
8489eaa

ARKAISW commited on

Final Hackathon Polish: Left-side dashboard layout, SVG connection scaling, and 0.4s simulation speedup
5686d79

ARKAISW commited on

Hard Forced FP16 precision patch with surgical head casting and stability limits
c922be6

ARKAISW commited on

Apply aggressive GRPO stability patch: LR=1e-5, len=64, norm=0.5, BFloat16
c7fb92b

ARKAISW commited on

Update README deliverables with High-Performance training link
67590a7

ARKAISW commited on

Disable trainer-level fp16 scaling to resolve GradScaler unscale crash
450aae5

ARKAISW commited on

Global Float16 precision patch: align BitsAndBytes, model loading, and GRPOTrainer args
0401fe0

ARKAISW commited on

Surgical precision fix: explicitly cast lm_head and embed_tokens to compute_dtype
d6d5c2e

ARKAISW commited on

Fix precision mismatch during generate() by casting model to compute dtype
471d5b7

ARKAISW commited on

Patch warnings_issued attribute to fix TRL crash
13ae551

ARKAISW commited on

Patch missing TRANSFORMERS_CACHE variable for llm_blender compatibility
2e54203

ARKAISW commited on

Refactor train_hf.py to use pure PEFT/Transformers to avoid Unsloth precision bugs
6cb169b

ARKAISW commited on

Fix LoRA dtype mismatch by explicitly coercing to bfloat16
d3f82d2

ARKAISW commited on

Fix Unsloth chunks crash by setting mini batch to 64
326c883

ARKAISW commited on

Fix Unsloth GRPOConfig compilation attribute name
fa91384

ARKAISW commited on

Fix total_memory attribute typo
f731267

ARKAISW commited on

Add standalone HF Jobs GRPO training script (500 steps, 8 gens, sample output logging)
5c3b197

ARKAISW commited on

Rename WRITEUP.md to blog.md and fix README link
1d0aea6

ARKAISW commited on

Remove broken auto-generated plots in favor of live Kaggle training evidence
d9a6265

ARKAISW commited on

Update README with requirements alignment, Colab/Kaggle links, and live Kaggle training evidence
a45e838

ARKAISW commited on

Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat
48b2f2f

ARKAISW commited on

Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle
96fac64

ARKAISW commited on

Inject unsloth_logit_chunk_multiplier into GRPOConfig
6fe98d9

ARKAISW commited on

Inject all expected Unsloth GRPO args
707a332

ARKAISW commited on

Inject unsloth_num_chunks into GRPOConfig
da2eb01

ARKAISW commited on

Remove unsupported kwargs from GRPOConfig
2f2fb3e

ARKAISW commited on

Disable PatchFastRL to fix Colab OSError
ba90699

ARKAISW commited on

Fix openenv dependency name in notebook
2ed1d89

ARKAISW commited on

Fix notebook dependencies and CUDA attribute
b22b8d5

ARKAISW commited on

Add Colab GRPO training notebook
117a7c7

ARKAISW commited on

Add pyarrow to notebook dependencies to fix colab mismatch
a90f241

ARKAISW commited on

Fix openenv missing from notebook
7331420

ARKAISW commited on

Update latest changes
aec0295

ARKAISW commited on