Spaces:

openenv-community
/

test-local-nested-envs

Sleeping

App Files Files Community

test-local-nested-envs

Commit History

Upload minimum_training_script.ipynb

37d5368
verified

KarlLearnsAI commited on Mar 8

Delete notebooks/train_colab.ipynb

1294051
verified

KarlLearnsAI commited on Mar 8

Upload notebooks/train_colab.ipynb with huggingface_hub

3402f44
verified

KarlLearnsAI commited on Mar 8

Upload architecture.png

f676a15
verified

KarlLearnsAI commited on Mar 8

Delete assets/architecture.png

5949c2b
verified

KarlLearnsAI commited on Mar 8

Upload app.py with huggingface_hub

c7dddaa
verified

KarlLearnsAI commited on Mar 8

Upload app.py with huggingface_hub

934b4ac
verified

KarlLearnsAI commited on Mar 8

Upload assets/architecture.png with huggingface_hub

a0b061b
verified

KarlLearnsAI commited on Mar 8

Add training results visualization with reward trend chart

19157df
unverified

Claude commited on Mar 8

Increase training scale: more steps, episodes, and SFT epochs

b1685a6
unverified

Claude commited on Mar 8

Pre-format SFT dataset as text column, drop formatting_func

384df8f
unverified

Claude commited on Mar 8

Fix pickling error in SFT formatting_func closure

20a8ae9
unverified

Claude commited on Mar 8

Fix SFT formatting_func to return list of strings

591804f
unverified

Claude commited on Mar 8

Fix SFT: set completion_only_loss=False for formatting_func compat

44f7f8c
unverified

Claude commited on Mar 8

Fix SFT warm start: add formatting_func for Unsloth SFTTrainer

b8e7dcd
unverified

Claude commited on Mar 8

Cap prompt generation at 512 tokens and add version print

ee71a24
unverified

Claude commited on Mar 8

Add SFT warm start before GRPO and DB connectivity init check

c2dc160
unverified

Claude commited on Mar 8

Merge pull request #13 from KarlLearnsAI/main

0c33e5f
unverified

Karl Johannes commited on Mar 8

Merge pull request #12 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

e2260ca
unverified

Karl Johannes commited on Mar 8

Merge pull request #11 from KarlLearnsAI/main

420a464
unverified

Karl Johannes commited on Mar 8

Move supabase to core dependencies

cc9c9d7
unverified

Claude commited on Mar 8

Add train.sh startup script and assets folder

434c6b1

KarlLearnsAI Claude Sonnet 4.6 commited on Mar 8

Fix Gradio launch to bind 0.0.0.0 for HF Spaces

faad7f2

KarlLearnsAI commited on Mar 8

Replace app with static architecture overview (no LLM calls on startup)

3502162

KarlLearnsAI commited on Mar 8

Add HF Spaces config metadata to README

d08480b

KarlLearnsAI commited on Mar 8

Merge pull request #10 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

24fd771
unverified

Karl Johannes commited on Mar 8

Switch Llama 3.1 8B to ungated unsloth mirror

6506d63
unverified

Claude commited on Mar 8

Merge pull request #9 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

d494210
unverified

Karl Johannes commited on Mar 8

Add local model inference backend for Layer 2

10418d0
unverified

Claude commited on Mar 8

Increase max completion length from 512 to 2048

552e492
unverified

Claude commited on Mar 8

Add 502/504 and hyphenated Time-out to retry list

4ae001d
unverified

Claude commited on Mar 8

Merge pull request #8 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

4b02447
unverified

Karl Johannes commited on Mar 8

Add retry with exponential backoff for HF Inference API calls

3b78637
unverified

Claude commited on Mar 8

Merge pull request #7 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

a0f036e
unverified

Karl Johannes commited on Mar 8

Make Supabase uploads incremental — upload after every step

76f180f
unverified

Claude commited on Mar 8

Add supabase to Dockerfile pip install

726152d
unverified

Claude commited on Mar 8

Merge pull request #6 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

7522d91
unverified

Karl Johannes commited on Mar 8

Add Supabase upload for training results (Storage + DB)

28bcb40
unverified

Claude commited on Mar 8

Add raw training summary output and adjust training scale

71b0977
unverified

Claude commited on Mar 8

Improve reward function to break refuse-everything local minimum and scale training

bd8220a
unverified

Claude commited on Mar 8

Merge pull request #5 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

c74ed51
unverified

Karl Johannes commited on Mar 8

Add volume verification, fsync, and stdout fallback for training outputs

f703ff1
unverified

Claude commited on Mar 8

Merge pull request #4 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

ac22c8b
unverified

Karl Johannes commited on Mar 8

Update output paths to use persistent volume at /workspace/output

46bfd81
unverified

Claude commited on Mar 8

Clean up dead code, unused imports, and move hardcoded values to config.yaml

3dc48b7
unverified

Claude commited on Mar 8

Add --llm-agent and other legacy CLI flags for backwards compatibility

03d9529
unverified

Claude commited on Mar 8

Merge pull request #3 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

97b6de5
unverified

Karl Johannes commited on Mar 8

Reduce episodes_per_candidate from 5 to 3

006c90d
unverified

Claude commited on Mar 8

Reduce GRPO training params to minimum: 2 candidates, 5 steps, 5 episodes

31b8286
unverified

Claude commited on Mar 8

Centralize all training params in config.yaml (single source of truth)

4e2b74e
unverified

Claude commited on Mar 8

Commit History

Upload minimum_training_script.ipynb 37d5368 verified

Delete notebooks/train_colab.ipynb 1294051 verified

Upload notebooks/train_colab.ipynb with huggingface_hub 3402f44 verified

Upload architecture.png f676a15 verified

Delete assets/architecture.png 5949c2b verified

Upload app.py with huggingface_hub c7dddaa verified

Upload app.py with huggingface_hub 934b4ac verified

Upload assets/architecture.png with huggingface_hub a0b061b verified

Add training results visualization with reward trend chart 19157df unverified

Increase training scale: more steps, episodes, and SFT epochs b1685a6 unverified

Pre-format SFT dataset as text column, drop formatting_func 384df8f unverified

Fix pickling error in SFT formatting_func closure 20a8ae9 unverified

Fix SFT formatting_func to return list of strings 591804f unverified

Fix SFT: set completion_only_loss=False for formatting_func compat 44f7f8c unverified

Fix SFT warm start: add formatting_func for Unsloth SFTTrainer b8e7dcd unverified

Cap prompt generation at 512 tokens and add version print ee71a24 unverified

Add SFT warm start before GRPO and DB connectivity init check c2dc160 unverified

Merge pull request #13 from KarlLearnsAI/main 0c33e5f unverified

Merge pull request #12 from KarlLearnsAI/claude/ai-oversight-system-ThVHS e2260ca unverified

Merge pull request #11 from KarlLearnsAI/main 420a464 unverified

Move supabase to core dependencies cc9c9d7 unverified

Add train.sh startup script and assets folder 434c6b1

Fix Gradio launch to bind 0.0.0.0 for HF Spaces faad7f2

Replace app with static architecture overview (no LLM calls on startup) 3502162

Add HF Spaces config metadata to README d08480b

Merge pull request #10 from KarlLearnsAI/claude/ai-oversight-system-ThVHS 24fd771 unverified

Switch Llama 3.1 8B to ungated unsloth mirror 6506d63 unverified

Merge pull request #9 from KarlLearnsAI/claude/ai-oversight-system-ThVHS d494210 unverified

Add local model inference backend for Layer 2 10418d0 unverified

Increase max completion length from 512 to 2048 552e492 unverified

Add 502/504 and hyphenated Time-out to retry list 4ae001d unverified

Merge pull request #8 from KarlLearnsAI/claude/ai-oversight-system-ThVHS 4b02447 unverified

Add retry with exponential backoff for HF Inference API calls 3b78637 unverified

Merge pull request #7 from KarlLearnsAI/claude/ai-oversight-system-ThVHS a0f036e unverified

Make Supabase uploads incremental — upload after every step 76f180f unverified

Add supabase to Dockerfile pip install 726152d unverified

Merge pull request #6 from KarlLearnsAI/claude/ai-oversight-system-ThVHS 7522d91 unverified

Add Supabase upload for training results (Storage + DB) 28bcb40 unverified

Add raw training summary output and adjust training scale 71b0977 unverified

Improve reward function to break refuse-everything local minimum and scale training bd8220a unverified

Merge pull request #5 from KarlLearnsAI/claude/ai-oversight-system-ThVHS c74ed51 unverified

Add volume verification, fsync, and stdout fallback for training outputs f703ff1 unverified

Merge pull request #4 from KarlLearnsAI/claude/ai-oversight-system-ThVHS ac22c8b unverified

Update output paths to use persistent volume at /workspace/output 46bfd81 unverified

Clean up dead code, unused imports, and move hardcoded values to config.yaml 3dc48b7 unverified

Add --llm-agent and other legacy CLI flags for backwards compatibility 03d9529 unverified

Merge pull request #3 from KarlLearnsAI/claude/ai-oversight-system-ThVHS 97b6de5 unverified

Reduce episodes_per_candidate from 5 to 3 006c90d unverified

Reduce GRPO training params to minimum: 2 candidates, 5 steps, 5 episodes 31b8286 unverified

Centralize all training params in config.yaml (single source of truth) 4e2b74e unverified

Upload minimum_training_script.ipynb

37d5368
verified

Delete notebooks/train_colab.ipynb

1294051
verified

Upload notebooks/train_colab.ipynb with huggingface_hub

3402f44
verified

Upload architecture.png

f676a15
verified

Delete assets/architecture.png

5949c2b
verified

Upload app.py with huggingface_hub

c7dddaa
verified

Upload app.py with huggingface_hub

934b4ac
verified

Upload assets/architecture.png with huggingface_hub

a0b061b
verified

Add training results visualization with reward trend chart

19157df
unverified

Increase training scale: more steps, episodes, and SFT epochs

b1685a6
unverified

Pre-format SFT dataset as text column, drop formatting_func

384df8f
unverified

Fix pickling error in SFT formatting_func closure

20a8ae9
unverified

Fix SFT formatting_func to return list of strings

591804f
unverified

Fix SFT: set completion_only_loss=False for formatting_func compat

44f7f8c
unverified

Fix SFT warm start: add formatting_func for Unsloth SFTTrainer

b8e7dcd
unverified

Cap prompt generation at 512 tokens and add version print

ee71a24
unverified

Add SFT warm start before GRPO and DB connectivity init check

c2dc160
unverified

Merge pull request #13 from KarlLearnsAI/main

0c33e5f
unverified

Merge pull request #12 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

e2260ca
unverified

Merge pull request #11 from KarlLearnsAI/main

420a464
unverified

Move supabase to core dependencies

cc9c9d7
unverified

Add train.sh startup script and assets folder

434c6b1

Fix Gradio launch to bind 0.0.0.0 for HF Spaces

faad7f2

Replace app with static architecture overview (no LLM calls on startup)

3502162

Add HF Spaces config metadata to README

d08480b

Merge pull request #10 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

24fd771
unverified

Switch Llama 3.1 8B to ungated unsloth mirror

6506d63
unverified

Merge pull request #9 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

d494210
unverified

Add local model inference backend for Layer 2

10418d0
unverified

Increase max completion length from 512 to 2048

552e492
unverified

Add 502/504 and hyphenated Time-out to retry list

4ae001d
unverified

Merge pull request #8 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

4b02447
unverified

Add retry with exponential backoff for HF Inference API calls

3b78637
unverified

Merge pull request #7 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

a0f036e
unverified

Make Supabase uploads incremental — upload after every step

76f180f
unverified

Add supabase to Dockerfile pip install

726152d
unverified

Merge pull request #6 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

7522d91
unverified

Add Supabase upload for training results (Storage + DB)

28bcb40
unverified

Add raw training summary output and adjust training scale

71b0977
unverified

Improve reward function to break refuse-everything local minimum and scale training

bd8220a
unverified

Merge pull request #5 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

c74ed51
unverified

Add volume verification, fsync, and stdout fallback for training outputs

f703ff1
unverified

Merge pull request #4 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

ac22c8b
unverified

Update output paths to use persistent volume at /workspace/output

46bfd81
unverified

Clean up dead code, unused imports, and move hardcoded values to config.yaml

3dc48b7
unverified

Add --llm-agent and other legacy CLI flags for backwards compatibility

03d9529
unverified

Merge pull request #3 from KarlLearnsAI/claude/ai-oversight-system-ThVHS

97b6de5
unverified

Reduce episodes_per_candidate from 5 to 3

006c90d
unverified

Reduce GRPO training params to minimum: 2 candidates, 5 steps, 5 episodes

31b8286
unverified

Centralize all training params in config.yaml (single source of truth)

4e2b74e
unverified