Spaces:

md896
/

sql-debug-env

Running

App Files Files Community

sql-debug-env

440 kB

Ctrl+K

1 contributor

History: 61 commits

md896

Simplify HF training stack: remove unsloth/vllm path, use plain transformers AutoModel + single OpenEnv reward.

e5262a1 27 days ago

archive
Fix: Mock vllm and llm_blender to stabilize GRPOTrainer in HF Jobs environment 27 days ago
docs
Fix: Mock vllm and llm_blender to stabilize GRPOTrainer in HF Jobs environment 27 days ago
scripts
Initial OpenEnv SQL debug environment about 2 months ago
server
Make OpenEnv training+API judge-proof 27 days ago
skills
Fix: Mock vllm and llm_blender to stabilize GRPOTrainer in HF Jobs environment 27 days ago
tests
Make OpenEnv training+API judge-proof 27 days ago
.dockerignore

130 Bytes
Initial OpenEnv SQL debug environment about 2 months ago
.env.example

124 Bytes
Initial OpenEnv SQL debug environment about 2 months ago
.gitattributes

1.52 kB
initial commit about 2 months ago
.gitignore

411 Bytes
Make OpenEnv training+API judge-proof 27 days ago
Dockerfile

762 Bytes
Initial OpenEnv SQL debug environment about 2 months ago
README.md

6.34 kB
Make OpenEnv training+API judge-proof 27 days ago
colab_pro_training.py

7.3 kB
Deploy: SOTA RL Cartesian Task and Unsloth Scripts 27 days ago
colab_real_world.py

3.88 kB
Fix: Mock vllm and llm_blender to stabilize GRPOTrainer in HF Jobs environment 27 days ago
inference.py

9.6 kB
Make OpenEnv training+API judge-proof 27 days ago
launch_job.py

3.18 kB
Fix HF Jobs bootstrap (pin transformers/trl, drop torchao stack); add reward and trainer JSONL logging; stabilize launch_job. 27 days ago
openenv.yaml

3.11 kB
Make OpenEnv training+API judge-proof 27 days ago
presentation_graphs.py

3.75 kB
Deploy: SOTA RL Cartesian Task and Unsloth Scripts 27 days ago
pyproject.toml

428 Bytes
Deploy: SOTA RL Cartesian Task and Unsloth Scripts 27 days ago
requirements.txt

132 Bytes
Deploy: SOTA RL Cartesian Task and Unsloth Scripts 27 days ago
spider_chart.py

1.37 kB
Deploy: SOTA RL Cartesian Task and Unsloth Scripts 27 days ago
ultimate_benchmark.py

3.32 kB
Deploy: SOTA RL Cartesian Task and Unsloth Scripts 27 days ago
ultimate_sota_training.py

18.1 kB
Simplify HF training stack: remove unsloth/vllm path, use plain transformers AutoModel + single OpenEnv reward. 27 days ago
uv.lock

237 kB
Initial OpenEnv SQL debug environment about 2 months ago