Spaces:

ceoavinash
/

codearena-rl

Sleeping

App Files Files Community

codearena-rl

Commit History

Add YouTube video link

e448fed

havinashpatil commited on Apr 26

Force add all results images

ecfc00b

havinashpatil commited on Apr 26

Add all 7 results charts to README and BLOG

b098526

havinashpatil commited on Apr 26

Add detailed results charts to README and BLOG

7f4c57d

havinashpatil commited on Apr 26

Clean notebook outputs and add Colab warning note

5dffd52

havinashpatil commited on Apr 26

Add comprehensive LLM finetuning analysis with 7 visualization graphs

9d429ce

havinashpatil commited on Apr 26

Add blog link to HF discussion post

9143510

havinashpatil commited on Apr 26

Final hackathon submission: polished README + detailed blog writeup

90be6c7

havinashpatil commited on Apr 26

Polish README for Hackathon judging criteria

5e35378

havinashpatil commited on Apr 26

Add Ollama fallback support to ai_fixer

05d943b

havinashpatil commited on Apr 26

Add Hugging Face Inference API fallback for AI fixer

18261cc

havinashpatil commited on Apr 26

Fix builtin syntax fixer for inline one-line headers and validate TGI fixes

0c0a8ff

havinashpatil commited on Apr 26

Fix HF Spaces TGI launch entrypoint to avoid unexpected argument error

27a8fcf

havinashpatil commited on Apr 26

Fix TGI endpoint plumbing and use full TGI runtime image in Dockerfile

a8bc575

havinashpatil commited on Apr 26

Add TGI integration for cloud LLM serving

434afdf

havinashpatil commited on Apr 26

Add AI coding system with local Hugging Face LLM integration

271cc02

havinashpatil commited on Apr 26

Update to Node 20 for better React 19 compatibility

402970c

havinashpatil commited on Apr 26

Fix Dockerfile with multi-stage build for frontend

43ecf30

havinashpatil commited on Apr 26

Update Dockerfile to build frontend for Spaces deployment

da3a03d

havinashpatil commited on Apr 26

Resolve merge conflict and update HF Space URL

99213a1

havinashpatil commited on Apr 26

initial commit

52752ed
verified

ceoavinash commited on Apr 26

Serve React frontend from FastAPI backend for one-click access

62c7e95

havinashpatil commited on Apr 26

Fix emoji in README

034343c

havinashpatil commited on Apr 26

Update README with HF Space metadata

c40cb2b

havinashpatil commited on Apr 26

Finalizing CodeArena RL Benchmark: frontend improvements, GRPO training scripts, and cleaned environment

03a7eb9

havinashpatil commited on Apr 26

fix: clamp reward to [0.01,0.99] so .2f never rounds to 0.00 or 1.00

59fd9d3

havinashpatil commited on Apr 25

fix: removed invalid openenv-py package from notebook install cell

82e39c9

havinashpatil commited on Apr 25

feat: use m-a-p/Code-Feedback dataset for GRPO training

9204c04

havinashpatil commited on Apr 25

chore: update dependencies and include training results for README

8599a81

havinashpatil commited on Apr 25

docs: Rewrite README for hackathon submission

3f9399a

havinashpatil commited on Apr 25

Complete all tasks: Adaptive curriculum, GRPO, React frontend, LLM-as-a-judge

a448db8

havinashpatil commited on Apr 25

Upload meta

03defc2

havinashpatil commited on Apr 25

fix: reset task_id parsing, grader tuple crash fallback, and inference score output

646409d

adityanaikhpt commited on Apr 8

fix: exclusive score clamping (0.001, 0.999) for evaluator

2d8d38c

adityanaikhpt commited on Apr 8

fix: clamp scores to strict (0.001, 0.999) range

b3485c6

adityanaikhpt commited on Apr 8

Rewrite inference.py for strict OpenEnv parsing + add httpx

eb60bd2

adityanaikhpt commited on Apr 8

fix: openenv.yaml tasks graders + strict score bounds

9967cb5

adityanaikhpt commited on Apr 8

fix: add safe_reward() clamp at every reward return point

f14f8d9

adityanaikhpt commited on Apr 8

Minimal patch: standalone proxy ping + reward clamped to (0,1)

74bfde0

adityanaikhpt commited on Apr 8

Add normalize_reward(); enforce strict (0,1) on all paths

b53855f

adityanaikhpt commited on Apr 8

Update grading logic to strict (0,1) ratio fallback

78b7e6b

adityanaikhpt commited on Apr 8

fix: use API_BASE_URL/API_KEY for LiteLLM proxy — always make API call (Phase 2)

51fdbe8

adityanaikhpt commited on Apr 8

fix: make inference.py crash-proof when OPENAI_API_KEY is missing (Phase 2)

1fe26af

adityanaikhpt commited on Apr 8

fix: OpenEnv multi-mode compliance — add main() entrypoint and uv.lock

e92bfc1

adityanaikhpt commited on Apr 8

Add pyproject.toml for OpenEnv validation compliance

652f070

adityanaikhpt commited on Apr 8

Production-ready: add server/app.py with fallback-safe /reset, fix Dockerfile, add HF metadata, add task JSON files

dcc8fa3

adityanaikhpt commited on Apr 8

Initial commit

54a19c9

adityanaikhpt commited on Apr 8

Commit History

Add YouTube video link e448fed

Force add all results images ecfc00b

Add all 7 results charts to README and BLOG b098526

Add detailed results charts to README and BLOG 7f4c57d

Clean notebook outputs and add Colab warning note 5dffd52

Add comprehensive LLM finetuning analysis with 7 visualization graphs 9d429ce

Add blog link to HF discussion post 9143510

Final hackathon submission: polished README + detailed blog writeup 90be6c7

Polish README for Hackathon judging criteria 5e35378

Add Ollama fallback support to ai_fixer 05d943b

Add Hugging Face Inference API fallback for AI fixer 18261cc

Fix builtin syntax fixer for inline one-line headers and validate TGI fixes 0c0a8ff

Fix HF Spaces TGI launch entrypoint to avoid unexpected argument error 27a8fcf

Fix TGI endpoint plumbing and use full TGI runtime image in Dockerfile a8bc575

Add TGI integration for cloud LLM serving 434afdf

Add AI coding system with local Hugging Face LLM integration 271cc02

Update to Node 20 for better React 19 compatibility 402970c

Fix Dockerfile with multi-stage build for frontend 43ecf30

Update Dockerfile to build frontend for Spaces deployment da3a03d

Resolve merge conflict and update HF Space URL 99213a1

initial commit 52752ed verified

Serve React frontend from FastAPI backend for one-click access 62c7e95

Fix emoji in README 034343c

Update README with HF Space metadata c40cb2b

Finalizing CodeArena RL Benchmark: frontend improvements, GRPO training scripts, and cleaned environment 03a7eb9

fix: clamp reward to [0.01,0.99] so .2f never rounds to 0.00 or 1.00 59fd9d3

fix: removed invalid openenv-py package from notebook install cell 82e39c9

feat: use m-a-p/Code-Feedback dataset for GRPO training 9204c04

chore: update dependencies and include training results for README 8599a81

docs: Rewrite README for hackathon submission 3f9399a

Complete all tasks: Adaptive curriculum, GRPO, React frontend, LLM-as-a-judge a448db8

Upload meta 03defc2

fix: reset task_id parsing, grader tuple crash fallback, and inference score output 646409d

fix: exclusive score clamping (0.001, 0.999) for evaluator 2d8d38c

fix: clamp scores to strict (0.001, 0.999) range b3485c6

Rewrite inference.py for strict OpenEnv parsing + add httpx eb60bd2

fix: openenv.yaml tasks graders + strict score bounds 9967cb5

fix: add safe_reward() clamp at every reward return point f14f8d9

Minimal patch: standalone proxy ping + reward clamped to (0,1) 74bfde0

Add normalize_reward(); enforce strict (0,1) on all paths b53855f

Update grading logic to strict (0,1) ratio fallback 78b7e6b

fix: use API_BASE_URL/API_KEY for LiteLLM proxy — always make API call (Phase 2) 51fdbe8

fix: make inference.py crash-proof when OPENAI_API_KEY is missing (Phase 2) 1fe26af

fix: OpenEnv multi-mode compliance — add main() entrypoint and uv.lock e92bfc1

Add pyproject.toml for OpenEnv validation compliance 652f070

Production-ready: add server/app.py with fallback-safe /reset, fix Dockerfile, add HF metadata, add task JSON files dcc8fa3

Initial commit 54a19c9

Add YouTube video link

e448fed

Force add all results images

ecfc00b

Add all 7 results charts to README and BLOG

b098526

Add detailed results charts to README and BLOG

7f4c57d

Clean notebook outputs and add Colab warning note

5dffd52

Add comprehensive LLM finetuning analysis with 7 visualization graphs

9d429ce

Add blog link to HF discussion post

9143510

Final hackathon submission: polished README + detailed blog writeup

90be6c7

Polish README for Hackathon judging criteria

5e35378

Add Ollama fallback support to ai_fixer

05d943b

Add Hugging Face Inference API fallback for AI fixer

18261cc

Fix builtin syntax fixer for inline one-line headers and validate TGI fixes

0c0a8ff

Fix HF Spaces TGI launch entrypoint to avoid unexpected argument error

27a8fcf

Fix TGI endpoint plumbing and use full TGI runtime image in Dockerfile

a8bc575

Add TGI integration for cloud LLM serving

434afdf

Add AI coding system with local Hugging Face LLM integration

271cc02

Update to Node 20 for better React 19 compatibility

402970c

Fix Dockerfile with multi-stage build for frontend

43ecf30

Update Dockerfile to build frontend for Spaces deployment

da3a03d

Resolve merge conflict and update HF Space URL

99213a1

initial commit

52752ed
verified

Serve React frontend from FastAPI backend for one-click access

62c7e95

Fix emoji in README

034343c

Update README with HF Space metadata

c40cb2b

Finalizing CodeArena RL Benchmark: frontend improvements, GRPO training scripts, and cleaned environment

03a7eb9

fix: clamp reward to [0.01,0.99] so .2f never rounds to 0.00 or 1.00

59fd9d3

fix: removed invalid openenv-py package from notebook install cell

82e39c9

feat: use m-a-p/Code-Feedback dataset for GRPO training

9204c04

chore: update dependencies and include training results for README

8599a81

docs: Rewrite README for hackathon submission

3f9399a

Complete all tasks: Adaptive curriculum, GRPO, React frontend, LLM-as-a-judge

a448db8

Upload meta

03defc2

fix: reset task_id parsing, grader tuple crash fallback, and inference score output

646409d

fix: exclusive score clamping (0.001, 0.999) for evaluator

2d8d38c

fix: clamp scores to strict (0.001, 0.999) range

b3485c6

Rewrite inference.py for strict OpenEnv parsing + add httpx

eb60bd2

fix: openenv.yaml tasks graders + strict score bounds

9967cb5

fix: add safe_reward() clamp at every reward return point

f14f8d9

Minimal patch: standalone proxy ping + reward clamped to (0,1)

74bfde0

Add normalize_reward(); enforce strict (0,1) on all paths

b53855f

Update grading logic to strict (0,1) ratio fallback

78b7e6b

fix: use API_BASE_URL/API_KEY for LiteLLM proxy — always make API call (Phase 2)

51fdbe8

fix: make inference.py crash-proof when OPENAI_API_KEY is missing (Phase 2)

1fe26af

fix: OpenEnv multi-mode compliance — add main() entrypoint and uv.lock

e92bfc1

Add pyproject.toml for OpenEnv validation compliance

652f070

Production-ready: add server/app.py with fallback-safe /reset, fix Dockerfile, add HF metadata, add task JSON files

dcc8fa3

Initial commit

54a19c9