Spaces:
Running on T4
Running on T4
Commit History
Delete notebooks/train_colab.ipynb 1294051 verified
Upload notebooks/train_colab.ipynb with huggingface_hub 3402f44 verified
Upload architecture.png f676a15 verified
Delete assets/architecture.png 5949c2b verified
Upload app.py with huggingface_hub c7dddaa verified
Upload app.py with huggingface_hub 934b4ac verified
Upload assets/architecture.png with huggingface_hub a0b061b verified
Add training results visualization with reward trend chart 19157df unverified
Claude commited on
Increase training scale: more steps, episodes, and SFT epochs b1685a6 unverified
Claude commited on
Pre-format SFT dataset as text column, drop formatting_func 384df8f unverified
Claude commited on
Fix pickling error in SFT formatting_func closure 20a8ae9 unverified
Claude commited on
Fix SFT formatting_func to return list of strings 591804f unverified
Claude commited on
Fix SFT: set completion_only_loss=False for formatting_func compat 44f7f8c unverified
Claude commited on
Fix SFT warm start: add formatting_func for Unsloth SFTTrainer b8e7dcd unverified
Claude commited on
Cap prompt generation at 512 tokens and add version print ee71a24 unverified
Claude commited on
Add SFT warm start before GRPO and DB connectivity init check c2dc160 unverified
Claude commited on
Merge pull request #13 from KarlLearnsAI/main 0c33e5f unverified
Karl Johannes commited on
Merge pull request #12 from KarlLearnsAI/claude/ai-oversight-system-ThVHS e2260ca unverified
Karl Johannes commited on
Merge pull request #11 from KarlLearnsAI/main 420a464 unverified
Karl Johannes commited on
Move supabase to core dependencies cc9c9d7 unverified
Claude commited on
Add train.sh startup script and assets folder 434c6b1
Fix Gradio launch to bind 0.0.0.0 for HF Spaces faad7f2
Replace app with static architecture overview (no LLM calls on startup) 3502162
Add HF Spaces config metadata to README d08480b
Merge pull request #10 from KarlLearnsAI/claude/ai-oversight-system-ThVHS 24fd771 unverified
Karl Johannes commited on
Switch Llama 3.1 8B to ungated unsloth mirror 6506d63 unverified
Claude commited on
Merge pull request #9 from KarlLearnsAI/claude/ai-oversight-system-ThVHS d494210 unverified
Karl Johannes commited on
Add local model inference backend for Layer 2 10418d0 unverified
Claude commited on
Increase max completion length from 512 to 2048 552e492 unverified
Claude commited on
Add 502/504 and hyphenated Time-out to retry list 4ae001d unverified
Claude commited on
Merge pull request #8 from KarlLearnsAI/claude/ai-oversight-system-ThVHS 4b02447 unverified
Karl Johannes commited on
Add retry with exponential backoff for HF Inference API calls 3b78637 unverified
Claude commited on
Merge pull request #7 from KarlLearnsAI/claude/ai-oversight-system-ThVHS a0f036e unverified
Karl Johannes commited on
Make Supabase uploads incremental — upload after every step 76f180f unverified
Claude commited on
Add supabase to Dockerfile pip install 726152d unverified
Claude commited on
Merge pull request #6 from KarlLearnsAI/claude/ai-oversight-system-ThVHS 7522d91 unverified
Karl Johannes commited on
Add Supabase upload for training results (Storage + DB) 28bcb40 unverified
Claude commited on
Add raw training summary output and adjust training scale 71b0977 unverified
Claude commited on
Improve reward function to break refuse-everything local minimum and scale training bd8220a unverified
Claude commited on
Merge pull request #5 from KarlLearnsAI/claude/ai-oversight-system-ThVHS c74ed51 unverified
Karl Johannes commited on
Add volume verification, fsync, and stdout fallback for training outputs f703ff1 unverified
Claude commited on
Merge pull request #4 from KarlLearnsAI/claude/ai-oversight-system-ThVHS ac22c8b unverified
Karl Johannes commited on
Update output paths to use persistent volume at /workspace/output 46bfd81 unverified
Claude commited on
Clean up dead code, unused imports, and move hardcoded values to config.yaml 3dc48b7 unverified
Claude commited on
Add --llm-agent and other legacy CLI flags for backwards compatibility 03d9529 unverified
Claude commited on
Merge pull request #3 from KarlLearnsAI/claude/ai-oversight-system-ThVHS 97b6de5 unverified
Karl Johannes commited on
Reduce episodes_per_candidate from 5 to 3 006c90d unverified
Claude commited on
Reduce GRPO training params to minimum: 2 candidates, 5 steps, 5 episodes 31b8286 unverified
Claude commited on
Centralize all training params in config.yaml (single source of truth) 4e2b74e unverified
Claude commited on