Spaces:

junaid0600
/

sql-db-engineer-agent

Sleeping

App Files Files Community

sql-db-engineer-agent

Commit History

Update env/environment.py

2acdf7a
verified

Soham105 commited on Apr 25

Update training/colab_notebook.py

e5aa2dc
verified

junaid0600 commited on Apr 25

Update env/environment.py

cf6d807
verified

junaid0600 commited on Apr 25

Update training/train_agent.py

44e9354
verified

junaid0600 commited on Apr 25

Update training/train_agent.py

86abfc1
verified

junaid0600 commited on Apr 25

Update training/train_agent.py

987f2db
verified

junaid0600 commited on Apr 25

Upload reward_curve.png

a7802a8
verified

junaid0600 commited on Apr 23

Add Colab training notebook link

9b80a84

junaid0600 commited on Apr 23

changes evalute_agent

15e9605

junaid0600 commited on Apr 23

updated readme

b96d42b

junaid0600 commited on Apr 22

corrected

8778707

junaid0600 commited on Apr 22

Reward curve: strategic +36.7pts vs random +0.0pts

842e560

junaid0600 commited on Apr 22

changes

b2742eb

junaid0600 commited on Apr 21

updated requirements

e4126f3

junaid0600 commited on Apr 21

updated readme

f004baa

junaid0600 commited on Apr 21

prproject.toml and readme updated

809345d

junaid0600 commited on Apr 21

Final Round 2: all checks passing, openenv validate OK

f30d05a

junaid0600 commited on Apr 21

Fix gitignore - exclude pycache

a28e8c9

junaid0600 commited on Apr 20

Force add all env, dataset, api files - gitignore fix

399f4c5

junaid0600 commited on Apr 20

Add db_simulator and scenario files

ff10d5b

junaid0600 commited on Apr 20

Round 2: SQL Database Engineer Agent - 24/24 tests passing

8cb206e

junaid0600 commited on Apr 20

changed

1eef47f

junaid0600 commited on Apr 10

changed

deb9d37

junaid0600 commited on Apr 10

changed

f5f1b7a

junaid0600 commited on Apr 10

changed

ea504bf

junaid0600 commited on Apr 10

changed

dcaf698

junaid0600 commited on Apr 10

changed

f23139f

junaid0600 commited on Apr 10

Use real LLM call for proxy check + baseline scores for task validation

5e3e79e

junaid0600 commited on Apr 10

Use real LLM calls through API_BASE_URL proxy

d8cba4f

junaid0600 commited on Apr 10

Clean inference.py using baseline scores strictly between 0 and 1

b02ec3c

junaid0600 commited on Apr 10

Normalize all rewards to strictly (0.001, 0.999) range in step()

42a1cbd

junaid0600 commited on Apr 10

Fix GraderResponse schema example score from 1 to 0.75

1a89fae

junaid0600 commited on Apr 10

Clamp all reward scores strictly between 0.001 and 0.999

ef20791

junaid0600 commited on Apr 10

Clamp grader scores strictly between 0.001 and 0.999 in endpoint and model

f2d88cb

junaid0600 commited on Apr 10

Fix rewards never exactly 0.0 or 1.0 using proper normalization

7dff36b

junaid0600 commited on Apr 10

Clamp all step rewards strictly between 0.001 and 0.999

11dd1d6

junaid0600 commited on Apr 10

Fix score - shift rewards to positive range, never 0.0 or 1.0

888871f

junaid0600 commited on Apr 9

Ensure score never below 0.1 to fix out of range error

e15627e

junaid0600 commited on Apr 9

Fix score strictly between 0.001 and 0.999 - never 0.0 or 1.0

6e703c0

junaid0600 commited on Apr 9

corrected everytihng

2146d9e

junaid0600 commited on Apr 9

again fixed graders

d4b572f

junaid0600 commited on Apr 9

fixed

59746b9

junaid0600 commited on Apr 9

again changed

95c7542

junaid0600 commited on Apr 9

new fixed

b4cf41e

junaid0600 commited on Apr 9

done

95b11e6

junaid0600 commited on Apr 9

fixed

49552e4

junaid0600 commited on Apr 9

done

265556a

junaid0600 commited on Apr 9

done

3db1e7f

junaid0600 commited on Apr 9

fixed new error

0ac8fe8

junaid0600 commited on Apr 9

fixed

81abf49

junaid0600 commited on Apr 9