Spaces:

openenv-community
/

test-local-nested-envs

Sleeping

App Files Files Community

test-local-nested-envs

562 kB

Ctrl+K

Ctrl+K

4 contributors

History: 67 commits

KarlLearnsAI's picture

Upload minimum_training_script.ipynb

37d5368 verified 4 months ago

assets
Upload architecture.png 4 months ago
layer0
Improve reward function to break refuse-everything local minimum and scale training 4 months ago
layer1
Pre-format SFT dataset as text column, drop formatting_func 4 months ago
layer2
Switch Llama 3.1 8B to ungated unsloth mirror 4 months ago
personas
Clean up dead code, unused imports, and move hardcoded values to config.yaml 4 months ago
scripts
Make Supabase uploads incremental — upload after every step 4 months ago
tests
Improve reward function to break refuse-everything local minimum and scale training 4 months ago
.gitattributes

60 Bytes
Upload assets/architecture.png with huggingface_hub 4 months ago
.gitignore

126 Bytes
Implement self-improving AI oversight system with nested RL environments 4 months ago
Dockerfile

199 Bytes
Add supabase to Dockerfile pip install 4 months ago
README.md

18.3 kB
Add HF Spaces config metadata to README 4 months ago
app.py

4.83 kB
Upload app.py with huggingface_hub 4 months ago
config.yaml

4.31 kB
Increase training scale: more steps, episodes, and SFT epochs 4 months ago
config_loader.py

5.69 kB
Add SFT warm start before GRPO and DB connectivity init check 4 months ago
minimum_training_script.ipynb

261 kB
Upload minimum_training_script.ipynb 4 months ago
pyproject.toml

917 Bytes
Move supabase to core dependencies 4 months ago
train.sh

1.04 kB
Add train.sh startup script and assets folder 4 months ago