Spaces:

md896
/

sql-debug-env

Running

App Files Files Community

sql-debug-env

Commit History

README: add model card highlights section and metadata snapshot.

279d788

Running

md896 commited on 10 days ago

README: add Hugging Face model link to Quick Links.

3d1b780

md896 commited on 10 days ago

Use .git GitHub URL in HTML and Gradio links.

72d394b

md896 commited on 10 days ago

Gradio: update W&B line to 'example run' dashboard.

6674354

md896 commited on 10 days ago

Demo copy: drop judges wording; workflow map at a glance for readers.

270cdf0

md896 commited on 10 days ago

Expand problem narrative and Engineering Notes: time-on-SQL, Spider vs prod.

40caa50

md896 commited on 10 days ago

Add GitHub repo link; drop Karpathy, blog, slides, demo placeholders.

d00292e

md896 commited on 10 days ago

HTML demo: fix diagram crop; click-to-zoom lightbox with pan and +/- zoom.

e6d1a8f

md896 commited on 10 days ago

HTML /demo: add Benchmark visuals table and three chart figures.

f5c939b

md896 commited on 10 days ago

Gradio: fix invisible header and body text on HF dark embed.

293388d

md896 commited on 10 days ago

Space home: redirect / to HTML /demo; Gradio at /gradio; fix Gradio hero.

f4ae3f3

md896 commited on 10 days ago

Fix Gradio 5 mount error by moving theme/css to Blocks.

3cc7ab5

md896 commited on 10 days ago

Fix Space pip resolve: fastapi>=0.115.2, python-multipart>=0.0.18, pin gradio 5.50.0.

d9b6c59

md896 commited on 10 days ago

Pin Gradio >=5.7.1 for huggingface_hub 1.x (fixes HfFolder ImportError on Space).

f7153ad

md896 commited on 10 days ago

Point /demo and Gradio at diagram-end-to-end-workflow.png (asset on Hub via Xet).

4c3e70f

md896 commited on 10 days ago

Add workflow diagram via Hub API (Xet)

096e16d
verified

md896 commited on 10 days ago

Restore optimized training diagnostics and reward curve images

35a3454

md896 commited on 10 days ago

Ship polished Space UI with Gradio dashboard and evidence-rich demo.

029f9cf

md896 commited on 10 days ago

Upload artifacts/runs/20260426-064318-sample-rewards-32eval/sample_rewards_final.json with huggingface_hub

4724001
verified

md896 commited on 10 days ago

Upload folder using huggingface_hub

d2042c8
verified

md896 commited on 10 days ago

Add training artifacts 20260426-050707

05bd818
verified

md896 commited on 10 days ago

Upload folder using huggingface_hub

0730521
verified

md896 commited on 11 days ago

Upload folder using huggingface_hub

4ca17fb
verified

md896 commited on 11 days ago

Add training artifacts 20260426-032246

d153683
verified

md896 commited on 11 days ago

Add training artifacts 20260425-202751

471dda7
verified

md896 commited on 11 days ago

Avoid whoami rate-limit during job submission

a1e637f

md896 commited on 11 days ago

Harden HF job token wiring and persist full training outputs

9552aaf

md896 commited on 11 days ago

Fix TRL 0.18 compatibility: remove unsupported generation_kwargs; set safety flags on model.generation_config.

6083a40

md896 commited on 11 days ago

Harden GRPO generation stability on CUDA: bf16 + eager attention + invalid-logit guards.

948530a

md896 commited on 11 days ago

Fix GRPO batch/generation mismatch: auto-adjust num_generations; set launcher default to 2.

af54ccd

md896 commited on 11 days ago

Simplify HF training stack: remove unsloth/vllm path, use plain transformers AutoModel + single OpenEnv reward.

e5262a1

md896 commited on 11 days ago

Fix Unsloth startup: avoid pre-importing trl/transformers; mock vllm as real package modules.

d21de11

md896 commited on 11 days ago

Fix HF job startup: import unsloth first and shim vllm package metadata check.

1fdba13

md896 commited on 11 days ago

Fix HF Job bootstrap: transformers>=4.51 for trl 0.18, datasets<4; simplify to colab-style OpenEnv SQL reward.

ee30276

md896 commited on 11 days ago

Fix HF Jobs bootstrap (pin transformers/trl, drop torchao stack); add reward and trainer JSONL logging; stabilize launch_job.

ceee0e3

md896 commited on 11 days ago

changes in ultimate sota

ac3911c

md896 commited on 11 days ago

Fix: Mock vllm and llm_blender to stabilize GRPOTrainer in HF Jobs environment

bc20ef9

md896 commited on 11 days ago

Downgrade TRL to 0.22.2 to natively bypass experimental vllm dependencies

2eb9add

md896 commited on 11 days ago

Fix vllm error cleanly by creating fake python module structure

b2ce6c6

md896 commited on 11 days ago

Add vllm to dependencies to fix TRL's hard import requirement

711ae38

md896 commited on 11 days ago

Remove vllm mock to fix importlib find_spec crash in TRL 0.23

97cddc4

md896 commited on 11 days ago

Mock vllm to bypass TRL missing module error

1bc1daa

md896 commited on 11 days ago

Downgrade TRL to <0.24.0 to fix missing dependency chain

397face

md896 commited on 11 days ago

Add TRANSFORMERS_CACHE mock to fix TRL/llm_blender crash

16dd181

md896 commited on 11 days ago

Fix llm-blender ModuleNotFoundError

1cd9ac8

md896 commited on 11 days ago

Ensure TRL GRPO imports by installing mergekit

6d0b5c3

md896 commited on 11 days ago

Prevent torchvision import crashes in HF Jobs

8b3c03a

md896 commited on 11 days ago

Make OpenEnv training+API judge-proof

d061422

md896 commited on 11 days ago

Add --break-system-packages for Ubuntu 24.04 (PEP-668)

830c039

md896 commited on 11 days ago

Fix TRL and Torchao dependency conflicts

d118f9f

md896 commited on 11 days ago

Commit History

README: add model card highlights section and metadata snapshot. 279d788 Running

README: add Hugging Face model link to Quick Links. 3d1b780

Use .git GitHub URL in HTML and Gradio links. 72d394b

Gradio: update W&B line to 'example run' dashboard. 6674354

Demo copy: drop judges wording; workflow map at a glance for readers. 270cdf0

Expand problem narrative and Engineering Notes: time-on-SQL, Spider vs prod. 40caa50

Add GitHub repo link; drop Karpathy, blog, slides, demo placeholders. d00292e

HTML demo: fix diagram crop; click-to-zoom lightbox with pan and +/- zoom. e6d1a8f

HTML /demo: add Benchmark visuals table and three chart figures. f5c939b

Gradio: fix invisible header and body text on HF dark embed. 293388d

Space home: redirect / to HTML /demo; Gradio at /gradio; fix Gradio hero. f4ae3f3

Fix Gradio 5 mount error by moving theme/css to Blocks. 3cc7ab5

Fix Space pip resolve: fastapi>=0.115.2, python-multipart>=0.0.18, pin gradio 5.50.0. d9b6c59

Pin Gradio >=5.7.1 for huggingface_hub 1.x (fixes HfFolder ImportError on Space). f7153ad

Point /demo and Gradio at diagram-end-to-end-workflow.png (asset on Hub via Xet). 4c3e70f

Add workflow diagram via Hub API (Xet) 096e16d verified

Restore optimized training diagnostics and reward curve images 35a3454

Ship polished Space UI with Gradio dashboard and evidence-rich demo. 029f9cf

Upload artifacts/runs/20260426-064318-sample-rewards-32eval/sample_rewards_final.json with huggingface_hub 4724001 verified

Upload folder using huggingface_hub d2042c8 verified

Add training artifacts 20260426-050707 05bd818 verified

Upload folder using huggingface_hub 0730521 verified

Upload folder using huggingface_hub 4ca17fb verified

Add training artifacts 20260426-032246 d153683 verified

Add training artifacts 20260425-202751 471dda7 verified

Avoid whoami rate-limit during job submission a1e637f

Harden HF job token wiring and persist full training outputs 9552aaf

Fix TRL 0.18 compatibility: remove unsupported generation_kwargs; set safety flags on model.generation_config. 6083a40

Harden GRPO generation stability on CUDA: bf16 + eager attention + invalid-logit guards. 948530a

Fix GRPO batch/generation mismatch: auto-adjust num_generations; set launcher default to 2. af54ccd

Simplify HF training stack: remove unsloth/vllm path, use plain transformers AutoModel + single OpenEnv reward. e5262a1

Fix Unsloth startup: avoid pre-importing trl/transformers; mock vllm as real package modules. d21de11

Fix HF job startup: import unsloth first and shim vllm package metadata check. 1fdba13

Fix HF Job bootstrap: transformers>=4.51 for trl 0.18, datasets<4; simplify to colab-style OpenEnv SQL reward. ee30276

Fix HF Jobs bootstrap (pin transformers/trl, drop torchao stack); add reward and trainer JSONL logging; stabilize launch_job. ceee0e3

changes in ultimate sota ac3911c

Fix: Mock vllm and llm_blender to stabilize GRPOTrainer in HF Jobs environment bc20ef9

Downgrade TRL to 0.22.2 to natively bypass experimental vllm dependencies 2eb9add

Fix vllm error cleanly by creating fake python module structure b2ce6c6

Add vllm to dependencies to fix TRL's hard import requirement 711ae38

Remove vllm mock to fix importlib find_spec crash in TRL 0.23 97cddc4

Mock vllm to bypass TRL missing module error 1bc1daa

Downgrade TRL to <0.24.0 to fix missing dependency chain 397face

Add TRANSFORMERS_CACHE mock to fix TRL/llm_blender crash 16dd181

Fix llm-blender ModuleNotFoundError 1cd9ac8

Ensure TRL GRPO imports by installing mergekit 6d0b5c3

Prevent torchvision import crashes in HF Jobs 8b3c03a

Make OpenEnv training+API judge-proof d061422

Add --break-system-packages for Ubuntu 24.04 (PEP-668) 830c039

Fix TRL and Torchao dependency conflicts d118f9f

README: add model card highlights section and metadata snapshot.

279d788

Running

README: add Hugging Face model link to Quick Links.

3d1b780

Use .git GitHub URL in HTML and Gradio links.

72d394b

Gradio: update W&B line to 'example run' dashboard.

6674354

Demo copy: drop judges wording; workflow map at a glance for readers.

270cdf0

Expand problem narrative and Engineering Notes: time-on-SQL, Spider vs prod.

40caa50

Add GitHub repo link; drop Karpathy, blog, slides, demo placeholders.

d00292e

HTML demo: fix diagram crop; click-to-zoom lightbox with pan and +/- zoom.

e6d1a8f

HTML /demo: add Benchmark visuals table and three chart figures.

f5c939b

Gradio: fix invisible header and body text on HF dark embed.

293388d

Space home: redirect / to HTML /demo; Gradio at /gradio; fix Gradio hero.

f4ae3f3

Fix Gradio 5 mount error by moving theme/css to Blocks.

3cc7ab5

Fix Space pip resolve: fastapi>=0.115.2, python-multipart>=0.0.18, pin gradio 5.50.0.

d9b6c59

Pin Gradio >=5.7.1 for huggingface_hub 1.x (fixes HfFolder ImportError on Space).

f7153ad

Point /demo and Gradio at diagram-end-to-end-workflow.png (asset on Hub via Xet).

4c3e70f

Add workflow diagram via Hub API (Xet)

096e16d
verified

Restore optimized training diagnostics and reward curve images

35a3454

Ship polished Space UI with Gradio dashboard and evidence-rich demo.

029f9cf

Upload artifacts/runs/20260426-064318-sample-rewards-32eval/sample_rewards_final.json with huggingface_hub

4724001
verified

Upload folder using huggingface_hub

d2042c8
verified

Add training artifacts 20260426-050707

05bd818
verified

Upload folder using huggingface_hub

0730521
verified

Upload folder using huggingface_hub

4ca17fb
verified

Add training artifacts 20260426-032246

d153683
verified

Add training artifacts 20260425-202751

471dda7
verified

Avoid whoami rate-limit during job submission

a1e637f

Harden HF job token wiring and persist full training outputs

9552aaf

Fix TRL 0.18 compatibility: remove unsupported generation_kwargs; set safety flags on model.generation_config.

6083a40

Harden GRPO generation stability on CUDA: bf16 + eager attention + invalid-logit guards.

948530a

Fix GRPO batch/generation mismatch: auto-adjust num_generations; set launcher default to 2.

af54ccd

Simplify HF training stack: remove unsloth/vllm path, use plain transformers AutoModel + single OpenEnv reward.

e5262a1

Fix Unsloth startup: avoid pre-importing trl/transformers; mock vllm as real package modules.

d21de11

Fix HF job startup: import unsloth first and shim vllm package metadata check.

1fdba13

Fix HF Job bootstrap: transformers>=4.51 for trl 0.18, datasets<4; simplify to colab-style OpenEnv SQL reward.

ee30276

Fix HF Jobs bootstrap (pin transformers/trl, drop torchao stack); add reward and trainer JSONL logging; stabilize launch_job.

ceee0e3

changes in ultimate sota

ac3911c

Fix: Mock vllm and llm_blender to stabilize GRPOTrainer in HF Jobs environment

bc20ef9

Downgrade TRL to 0.22.2 to natively bypass experimental vllm dependencies

2eb9add

Fix vllm error cleanly by creating fake python module structure

b2ce6c6

Add vllm to dependencies to fix TRL's hard import requirement

711ae38

Remove vllm mock to fix importlib find_spec crash in TRL 0.23

97cddc4

Mock vllm to bypass TRL missing module error

1bc1daa

Downgrade TRL to <0.24.0 to fix missing dependency chain

397face

Add TRANSFORMERS_CACHE mock to fix TRL/llm_blender crash

16dd181

Fix llm-blender ModuleNotFoundError

1cd9ac8

Ensure TRL GRPO imports by installing mergekit

6d0b5c3

Prevent torchvision import crashes in HF Jobs

8b3c03a

Make OpenEnv training+API judge-proof

d061422

Add --break-system-packages for Ubuntu 24.04 (PEP-668)

830c039

Fix TRL and Torchao dependency conflicts

d118f9f