Commit History

Upload minimum_training_script.ipynb
37d5368
Running
verified

KarlLearnsAI commited on

Delete notebooks/train_colab.ipynb
1294051
verified

KarlLearnsAI commited on

Upload notebooks/train_colab.ipynb with huggingface_hub
3402f44
verified

KarlLearnsAI commited on

Upload architecture.png
f676a15
verified

KarlLearnsAI commited on

Delete assets/architecture.png
5949c2b
verified

KarlLearnsAI commited on

Upload app.py with huggingface_hub
c7dddaa
verified

KarlLearnsAI commited on

Upload app.py with huggingface_hub
934b4ac
verified

KarlLearnsAI commited on

Upload assets/architecture.png with huggingface_hub
a0b061b
verified

KarlLearnsAI commited on

Add training results visualization with reward trend chart
19157df
unverified

Claude commited on

Increase training scale: more steps, episodes, and SFT epochs
b1685a6
unverified

Claude commited on

Pre-format SFT dataset as text column, drop formatting_func
384df8f
unverified

Claude commited on

Fix pickling error in SFT formatting_func closure
20a8ae9
unverified

Claude commited on

Fix SFT formatting_func to return list of strings
591804f
unverified

Claude commited on

Fix SFT: set completion_only_loss=False for formatting_func compat
44f7f8c
unverified

Claude commited on

Fix SFT warm start: add formatting_func for Unsloth SFTTrainer
b8e7dcd
unverified

Claude commited on

Cap prompt generation at 512 tokens and add version print
ee71a24
unverified

Claude commited on

Add SFT warm start before GRPO and DB connectivity init check
c2dc160
unverified

Claude commited on

Merge pull request #13 from KarlLearnsAI/main
0c33e5f
unverified

Karl Johannes commited on

Merge pull request #12 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
e2260ca
unverified

Karl Johannes commited on

Merge pull request #11 from KarlLearnsAI/main
420a464
unverified

Karl Johannes commited on

Move supabase to core dependencies
cc9c9d7
unverified

Claude commited on

Add train.sh startup script and assets folder
434c6b1

KarlLearnsAI Claude Sonnet 4.6 commited on

Fix Gradio launch to bind 0.0.0.0 for HF Spaces
faad7f2

KarlLearnsAI commited on

Replace app with static architecture overview (no LLM calls on startup)
3502162

KarlLearnsAI commited on

Add HF Spaces config metadata to README
d08480b

KarlLearnsAI commited on

Merge pull request #10 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
24fd771
unverified

Karl Johannes commited on

Switch Llama 3.1 8B to ungated unsloth mirror
6506d63
unverified

Claude commited on

Merge pull request #9 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
d494210
unverified

Karl Johannes commited on

Add local model inference backend for Layer 2
10418d0
unverified

Claude commited on

Increase max completion length from 512 to 2048
552e492
unverified

Claude commited on

Add 502/504 and hyphenated Time-out to retry list
4ae001d
unverified

Claude commited on

Merge pull request #8 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
4b02447
unverified

Karl Johannes commited on

Add retry with exponential backoff for HF Inference API calls
3b78637
unverified

Claude commited on

Merge pull request #7 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
a0f036e
unverified

Karl Johannes commited on

Make Supabase uploads incremental — upload after every step
76f180f
unverified

Claude commited on

Add supabase to Dockerfile pip install
726152d
unverified

Claude commited on

Merge pull request #6 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
7522d91
unverified

Karl Johannes commited on

Add Supabase upload for training results (Storage + DB)
28bcb40
unverified

Claude commited on

Add raw training summary output and adjust training scale
71b0977
unverified

Claude commited on

Improve reward function to break refuse-everything local minimum and scale training
bd8220a
unverified

Claude commited on

Merge pull request #5 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
c74ed51
unverified

Karl Johannes commited on

Add volume verification, fsync, and stdout fallback for training outputs
f703ff1
unverified

Claude commited on

Merge pull request #4 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
ac22c8b
unverified

Karl Johannes commited on

Update output paths to use persistent volume at /workspace/output
46bfd81
unverified

Claude commited on

Clean up dead code, unused imports, and move hardcoded values to config.yaml
3dc48b7
unverified

Claude commited on

Add --llm-agent and other legacy CLI flags for backwards compatibility
03d9529
unverified

Claude commited on

Merge pull request #3 from KarlLearnsAI/claude/ai-oversight-system-ThVHS
97b6de5
unverified

Karl Johannes commited on

Reduce episodes_per_candidate from 5 to 3
006c90d
unverified

Claude commited on

Reduce GRPO training params to minimum: 2 candidates, 5 steps, 5 episodes
31b8286
unverified

Claude commited on

Centralize all training params in config.yaml (single source of truth)
4e2b74e
unverified

Claude commited on