Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
openenv-community
/
test-local-nested-envs
like
0
Running
on
T4
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
test-local-nested-envs
562 kB
Ctrl+K
Ctrl+K
4 contributors
History:
67 commits
KarlLearnsAI
Upload minimum_training_script.ipynb
37d5368
verified
2 months ago
assets
Upload architecture.png
2 months ago
layer0
Improve reward function to break refuse-everything local minimum and scale training
2 months ago
layer1
Pre-format SFT dataset as text column, drop formatting_func
2 months ago
layer2
Switch Llama 3.1 8B to ungated unsloth mirror
2 months ago
personas
Clean up dead code, unused imports, and move hardcoded values to config.yaml
2 months ago
scripts
Make Supabase uploads incremental — upload after every step
2 months ago
tests
Improve reward function to break refuse-everything local minimum and scale training
2 months ago
.gitattributes
Safe
60 Bytes
Upload assets/architecture.png with huggingface_hub
2 months ago
.gitignore
Safe
126 Bytes
Implement self-improving AI oversight system with nested RL environments
2 months ago
Dockerfile
Safe
199 Bytes
Add supabase to Dockerfile pip install
2 months ago
README.md
Safe
18.3 kB
Add HF Spaces config metadata to README
2 months ago
app.py
Safe
4.83 kB
Upload app.py with huggingface_hub
2 months ago
config.yaml
Safe
4.31 kB
Increase training scale: more steps, episodes, and SFT epochs
2 months ago
config_loader.py
Safe
5.69 kB
Add SFT warm start before GRPO and DB connectivity init check
2 months ago
minimum_training_script.ipynb
Safe
261 kB
Upload minimum_training_script.ipynb
2 months ago
pyproject.toml
Safe
917 Bytes
Move supabase to core dependencies
2 months ago
train.sh
Safe
1.04 kB
Add train.sh startup script and assets folder
2 months ago