Spaces:

ARKAISW
/

QuantHive

Sleeping

App Files Files Community

QuantHive

Commit History

Remove blog_temp.md from repo

5172f48

ARKAISW commited on Apr 26

Remove GitHub mention from blog per HF submission requirements

3d75bac

ARKAISW commited on Apr 26

Clean up unused hackathon markdown files and update setup script link

84ccd7d

ARKAISW commited on Apr 26

Update blog with temp blog content

9a6d252

ARKAISW commited on Apr 26

Refine blog.md to improve credibility and tone based on feedback

c66becb

ARKAISW commited on Apr 26

Add final blog post detailing from numeric to semantic reasoning and personal origin

577da98

ARKAISW commited on Apr 26

README: Update final model repository link

29bde0e

ARKAISW commited on Apr 26

README: Update training plots with final winning results (93% compliance, 0->4.5 reward)

2705495

ARKAISW commited on Apr 26

README: Final results update — 93% risk compliance, 88% governance, semantic reasoning v2.0

91407ea

ARKAISW commited on Apr 26

README: Fix ASCII diagram alignment with standard characters

7101ea1

ARKAISW commited on Apr 26

README: Highlight v2.0 Semantic Reasoning (Judge Step 1 Complete)

8ddae74

ARKAISW commited on Apr 26

Semantic observation prompts — rich text replaces raw floats (judge feedback #1)

213c699

ARKAISW commited on Apr 26

Add compulsory Google Colab link to README

ee53741

ARKAISW commited on Apr 26

Update submission blog link to compulsory HF URL

ca3c3ed

ARKAISW commited on Apr 26

Final tweaks to server and requirements

d5dc8ac

ARKAISW commited on Apr 26

Update requirements-space.txt

7164a8e

ARKAISW commited on Apr 26

Final manual tweaks to server imports

5b8d71e

ARKAISW commited on Apr 26

Real ML Demo + Stability Fixes

8489eaa

ARKAISW commited on Apr 26

Final Hackathon Polish: Left-side dashboard layout, SVG connection scaling, and 0.4s simulation speedup

5686d79

ARKAISW commited on Apr 26

Hard Forced FP16 precision patch with surgical head casting and stability limits

c922be6

ARKAISW commited on Apr 26

Apply aggressive GRPO stability patch: LR=1e-5, len=64, norm=0.5, BFloat16

c7fb92b

ARKAISW commited on Apr 26

Update README deliverables with High-Performance training link

67590a7

ARKAISW commited on Apr 26

Disable trainer-level fp16 scaling to resolve GradScaler unscale crash

450aae5

ARKAISW commited on Apr 26

Global Float16 precision patch: align BitsAndBytes, model loading, and GRPOTrainer args

0401fe0

ARKAISW commited on Apr 26

Surgical precision fix: explicitly cast lm_head and embed_tokens to compute_dtype

d6d5c2e

ARKAISW commited on Apr 26

Fix precision mismatch during generate() by casting model to compute dtype

471d5b7

ARKAISW commited on Apr 26

Patch warnings_issued attribute to fix TRL crash

13ae551

ARKAISW commited on Apr 26

Patch missing TRANSFORMERS_CACHE variable for llm_blender compatibility

2e54203

ARKAISW commited on Apr 26

Refactor train_hf.py to use pure PEFT/Transformers to avoid Unsloth precision bugs

6cb169b

ARKAISW commited on Apr 26

Fix LoRA dtype mismatch by explicitly coercing to bfloat16

d3f82d2

ARKAISW commited on Apr 25

Fix Unsloth chunks crash by setting mini batch to 64

326c883

ARKAISW commited on Apr 25

Fix Unsloth GRPOConfig compilation attribute name

fa91384

ARKAISW commited on Apr 25

Fix total_memory attribute typo

f731267

ARKAISW commited on Apr 25

Add standalone HF Jobs GRPO training script (500 steps, 8 gens, sample output logging)

5c3b197

ARKAISW commited on Apr 25

Rename WRITEUP.md to blog.md and fix README link

1d0aea6

ARKAISW commited on Apr 25

Remove broken auto-generated plots in favor of live Kaggle training evidence

d9a6265

ARKAISW commited on Apr 25

Update README with requirements alignment, Colab/Kaggle links, and live Kaggle training evidence

a45e838

ARKAISW commited on Apr 25

Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat

48b2f2f

ARKAISW commited on Apr 25

Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle

96fac64

ARKAISW commited on Apr 25

Inject unsloth_logit_chunk_multiplier into GRPOConfig

6fe98d9

ARKAISW commited on Apr 25

Inject all expected Unsloth GRPO args

707a332

ARKAISW commited on Apr 25

Inject unsloth_num_chunks into GRPOConfig

da2eb01

ARKAISW commited on Apr 25

Remove unsupported kwargs from GRPOConfig

2f2fb3e

ARKAISW commited on Apr 25

Disable PatchFastRL to fix Colab OSError

ba90699

ARKAISW commited on Apr 25

Fix openenv dependency name in notebook

2ed1d89

ARKAISW commited on Apr 25

Fix notebook dependencies and CUDA attribute

b22b8d5

ARKAISW commited on Apr 25

Add Colab GRPO training notebook

117a7c7

ARKAISW commited on Apr 25

Add pyarrow to notebook dependencies to fix colab mismatch

a90f241

ARKAISW commited on Apr 25

Fix openenv missing from notebook

7331420

ARKAISW commited on Apr 25

Update latest changes

aec0295

ARKAISW commited on Apr 25

Commit History

Remove blog_temp.md from repo 5172f48

Remove GitHub mention from blog per HF submission requirements 3d75bac

Clean up unused hackathon markdown files and update setup script link 84ccd7d

Update blog with temp blog content 9a6d252

Refine blog.md to improve credibility and tone based on feedback c66becb

Add final blog post detailing from numeric to semantic reasoning and personal origin 577da98

README: Update final model repository link 29bde0e

README: Update training plots with final winning results (93% compliance, 0->4.5 reward) 2705495

README: Final results update — 93% risk compliance, 88% governance, semantic reasoning v2.0 91407ea

README: Fix ASCII diagram alignment with standard characters 7101ea1

README: Highlight v2.0 Semantic Reasoning (Judge Step 1 Complete) 8ddae74

Semantic observation prompts — rich text replaces raw floats (judge feedback #1) 213c699

Add compulsory Google Colab link to README ee53741

Update submission blog link to compulsory HF URL ca3c3ed

Final tweaks to server and requirements d5dc8ac

Update requirements-space.txt 7164a8e

Final manual tweaks to server imports 5b8d71e

Real ML Demo + Stability Fixes 8489eaa

Final Hackathon Polish: Left-side dashboard layout, SVG connection scaling, and 0.4s simulation speedup 5686d79

Hard Forced FP16 precision patch with surgical head casting and stability limits c922be6

Apply aggressive GRPO stability patch: LR=1e-5, len=64, norm=0.5, BFloat16 c7fb92b

Update README deliverables with High-Performance training link 67590a7

Disable trainer-level fp16 scaling to resolve GradScaler unscale crash 450aae5

Global Float16 precision patch: align BitsAndBytes, model loading, and GRPOTrainer args 0401fe0

Surgical precision fix: explicitly cast lm_head and embed_tokens to compute_dtype d6d5c2e

Fix precision mismatch during generate() by casting model to compute dtype 471d5b7

Patch warnings_issued attribute to fix TRL crash 13ae551

Patch missing TRANSFORMERS_CACHE variable for llm_blender compatibility 2e54203

Refactor train_hf.py to use pure PEFT/Transformers to avoid Unsloth precision bugs 6cb169b

Fix LoRA dtype mismatch by explicitly coercing to bfloat16 d3f82d2

Fix Unsloth chunks crash by setting mini batch to 64 326c883

Fix Unsloth GRPOConfig compilation attribute name fa91384

Fix total_memory attribute typo f731267

Add standalone HF Jobs GRPO training script (500 steps, 8 gens, sample output logging) 5c3b197

Rename WRITEUP.md to blog.md and fix README link 1d0aea6

Remove broken auto-generated plots in favor of live Kaggle training evidence d9a6265

Update README with requirements alignment, Colab/Kaggle links, and live Kaggle training evidence a45e838

Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat 48b2f2f

Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle 96fac64

Inject unsloth_logit_chunk_multiplier into GRPOConfig 6fe98d9

Inject all expected Unsloth GRPO args 707a332

Inject unsloth_num_chunks into GRPOConfig da2eb01

Remove unsupported kwargs from GRPOConfig 2f2fb3e

Disable PatchFastRL to fix Colab OSError ba90699

Fix openenv dependency name in notebook 2ed1d89

Fix notebook dependencies and CUDA attribute b22b8d5

Add Colab GRPO training notebook 117a7c7

Add pyarrow to notebook dependencies to fix colab mismatch a90f241

Fix openenv missing from notebook 7331420

Update latest changes aec0295

Remove blog_temp.md from repo

5172f48

Remove GitHub mention from blog per HF submission requirements

3d75bac

Clean up unused hackathon markdown files and update setup script link

84ccd7d

Update blog with temp blog content

9a6d252

Refine blog.md to improve credibility and tone based on feedback

c66becb

Add final blog post detailing from numeric to semantic reasoning and personal origin

577da98

README: Update final model repository link

29bde0e

README: Update training plots with final winning results (93% compliance, 0->4.5 reward)

2705495

README: Final results update — 93% risk compliance, 88% governance, semantic reasoning v2.0

91407ea

README: Fix ASCII diagram alignment with standard characters

7101ea1

README: Highlight v2.0 Semantic Reasoning (Judge Step 1 Complete)

8ddae74

Semantic observation prompts — rich text replaces raw floats (judge feedback #1)

213c699

Add compulsory Google Colab link to README

ee53741

Update submission blog link to compulsory HF URL

ca3c3ed

Final tweaks to server and requirements

d5dc8ac

Update requirements-space.txt

7164a8e

Final manual tweaks to server imports

5b8d71e

Real ML Demo + Stability Fixes

8489eaa

Final Hackathon Polish: Left-side dashboard layout, SVG connection scaling, and 0.4s simulation speedup

5686d79

Hard Forced FP16 precision patch with surgical head casting and stability limits

c922be6

Apply aggressive GRPO stability patch: LR=1e-5, len=64, norm=0.5, BFloat16

c7fb92b

Update README deliverables with High-Performance training link

67590a7

Disable trainer-level fp16 scaling to resolve GradScaler unscale crash

450aae5

Global Float16 precision patch: align BitsAndBytes, model loading, and GRPOTrainer args

0401fe0

Surgical precision fix: explicitly cast lm_head and embed_tokens to compute_dtype

d6d5c2e

Fix precision mismatch during generate() by casting model to compute dtype

471d5b7

Patch warnings_issued attribute to fix TRL crash

13ae551

Patch missing TRANSFORMERS_CACHE variable for llm_blender compatibility

2e54203

Refactor train_hf.py to use pure PEFT/Transformers to avoid Unsloth precision bugs

6cb169b

Fix LoRA dtype mismatch by explicitly coercing to bfloat16

d3f82d2

Fix Unsloth chunks crash by setting mini batch to 64

326c883

Fix Unsloth GRPOConfig compilation attribute name

fa91384

Fix total_memory attribute typo

f731267

Add standalone HF Jobs GRPO training script (500 steps, 8 gens, sample output logging)

5c3b197

Rename WRITEUP.md to blog.md and fix README link

1d0aea6

Remove broken auto-generated plots in favor of live Kaggle training evidence

d9a6265

Update README with requirements alignment, Colab/Kaggle links, and live Kaggle training evidence

a45e838

Delete compiled cache + re-inject Unsloth attrs for Kaggle/Colab compat

48b2f2f

Bypass Unsloth GRPO compilation - fix SymFloat crash on Colab/Kaggle

96fac64

Inject unsloth_logit_chunk_multiplier into GRPOConfig

6fe98d9

Inject all expected Unsloth GRPO args

707a332

Inject unsloth_num_chunks into GRPOConfig

da2eb01

Remove unsupported kwargs from GRPOConfig

2f2fb3e

Disable PatchFastRL to fix Colab OSError

ba90699

Fix openenv dependency name in notebook

2ed1d89

Fix notebook dependencies and CUDA attribute

b22b8d5

Add Colab GRPO training notebook

117a7c7

Add pyarrow to notebook dependencies to fix colab mismatch

a90f241

Fix openenv missing from notebook

7331420

Update latest changes

aec0295