Spaces:

mahithakur
/

PRobe

Runtime error

App Files Files Community

PRobe

Commit History

Fix health check: use Python instead of curl

abb18c6

mahithakur Claude Haiku 4.5 commited on Apr 26

Fix HuggingFace Space deployment: install curl for health checks and increase startup timeout

49487a6

mahithakur Claude Haiku 4.5 commited on Apr 26

Fix healthcheck to use correct port with PORT environment variable

3eeae84

mahithakur commited on Apr 26

Fix root path 404: add redirect from / to /ui/

1ab43d1

mahithakur commited on Apr 26

Fix HuggingFace Space deployment: use same-origin WebSocket and dynamic port

840c18a

mahithakur Claude Haiku 4.5 commited on Apr 26

Fix WebSocket protocol for HTTPS deployment: use wss:// for secure connections

d8bca9a

mahithakur Claude Haiku 4.5 commited on Apr 26

Rewrite README with philosophical, accessible prose for broader audience

7b76f88

mahithakur commited on Apr 26

formated readme

df53ef9

mahithakur commited on Apr 26

Clean up submission links table formatting

b936ff4

mahithakur commited on Apr 26

Update README with HF Space URL and submission links

d69b3cd

mahithakur commited on Apr 26

Remove VS cache files, add to gitignore

b8587e5

mahithakur commited on Apr 26

Add blog post and HF submission checklist for hackathon

6ba15c6

mahithakur commited on Apr 26

Model name update

9d6a95e

mahithakur commited on Apr 26

Updated readme

02fa6e2

mahithakur commited on Apr 26

Document Colab 100-step training summary in README

759305e

mahithakur commited on Apr 25

Readme cleanup

4bb4e67

mahithakur commited on Apr 25

Add 100-step Colab GRPO training results

d25e8b9

mahithakur commited on Apr 25

Fix eval decode prompt length slicing

fd3e88c

mahithakur commited on Apr 25

Fix GRPO reward mapping and evaluation generation

b2874f4

mahithakur commited on Apr 25

Include prompt column for GRPOTrainer

4fb3c37

mahithakur commited on Apr 25

Use finite datasets.Dataset for GRPOTrainer compatibility

3476016

mahithakur commited on Apr 25

Yield tensors from GRPO dataset generator

e817e69

mahithakur commited on Apr 25

Fix GRPO dataloader batching on Kaggle

8568d9f

mahithakur commited on Apr 25

Fix GRPO sample id mapping and Kaggle training setup

2066092

mahithakur commited on Apr 25

Add sample training results and judge report for hackathon submission

95658b4

mahithakur commited on Apr 25

Fix division by zero in final avg improvement calculation

022be04

mahithakur commited on Apr 25

Fix category enum: use lowercase for ProbeAction categories

b91005f

mahithakur commited on Apr 25

Fix eval_report.py: use step() return value and extract reward from observation

b3ab235

mahithakur commited on Apr 25

Remove unsupported data_collator argument from GRPOTrainer

97ddd73

mahithakur commited on Apr 25

Add data collator to GRPOTrainer for on-the-fly prompt tokenization

f2d68bf

mahithakur commited on Apr 25

Revert to raw prompt format—let GRPOTrainer handle tokenization

e49866a

mahithakur commited on Apr 25

Convert tokenized outputs to torch tensors for proper batching

9ba7512

mahithakur commited on Apr 25

Tokenize prompts in dataset generator for GRPOTrainer compatibility

75be7bc

mahithakur commited on Apr 25

Fix prompt format: convert from chat list to string for GRPOTrainer compatibility

77793ce

mahithakur commited on Apr 25

Fix dataset format: only yield 'prompt' field to avoid tensor concatenation errors

80c91ac

mahithakur commited on Apr 25

Remove tokenizer argument from GRPOTrainer—not supported in installed TRL version

4c16e6a

mahithakur commited on Apr 25

Fix path bootstrap in train_grpo.py: use parent.parent to reach project root

bc2ac25

mahithakur commited on Apr 25

Fix GRPO batch size config for Kaggle P100: batch_size=4, grad_accum=2 (global=8, divisible by num_generations=2)

feefe4a

mahithakur commited on Apr 25

Add eval_report.py for before/after training comparison

4e029fe

mahithakur commited on Apr 25

Add pre-training baseline results and graphs

754af78

mahithakur commited on Apr 25

Fix JSON parsing and environment bugs

c22ceaa

mahithakur commited on Apr 25

BlogPost for Huggeing later use

02eeb03

Thakur, Mahipal commited on Apr 24

Updated readme

cd9d2e3

Thakur, Mahipal commited on Apr 24

UI Integration

44bd7bd

Thakur, Mahipal commited on Apr 24

Added Meaning ful comments

4ec7361

Thakur, Mahipal commited on Apr 24

Code improvenets

fa66cd4

Thakur, Mahipal commited on Apr 24

Updated readme file

ab07180

Thakur, Mahipal commited on Apr 24

refactor: remove legacy architecture, promote clean structure to repo root

85fab7b

Thakur, Mahipal commited on Apr 24

Worked on folder structure

bb51474

Thakur, Mahipal commited on Apr 24

Judge matching chnages with name changes

104c835

Thakur, Mahipal commited on Apr 24

Commit History

Fix health check: use Python instead of curl abb18c6

Fix HuggingFace Space deployment: install curl for health checks and increase startup timeout 49487a6

Fix healthcheck to use correct port with PORT environment variable 3eeae84

Fix root path 404: add redirect from / to /ui/ 1ab43d1

Fix HuggingFace Space deployment: use same-origin WebSocket and dynamic port 840c18a

Fix WebSocket protocol for HTTPS deployment: use wss:// for secure connections d8bca9a

Rewrite README with philosophical, accessible prose for broader audience 7b76f88

formated readme df53ef9

Clean up submission links table formatting b936ff4

Update README with HF Space URL and submission links d69b3cd

Remove VS cache files, add to gitignore b8587e5

Add blog post and HF submission checklist for hackathon 6ba15c6

Model name update 9d6a95e

Updated readme 02fa6e2

Document Colab 100-step training summary in README 759305e

Readme cleanup 4bb4e67

Add 100-step Colab GRPO training results d25e8b9

Fix eval decode prompt length slicing fd3e88c

Fix GRPO reward mapping and evaluation generation b2874f4

Include prompt column for GRPOTrainer 4fb3c37

Use finite datasets.Dataset for GRPOTrainer compatibility 3476016

Yield tensors from GRPO dataset generator e817e69

Fix GRPO dataloader batching on Kaggle 8568d9f

Fix GRPO sample id mapping and Kaggle training setup 2066092

Add sample training results and judge report for hackathon submission 95658b4

Fix division by zero in final avg improvement calculation 022be04

Fix category enum: use lowercase for ProbeAction categories b91005f

Fix eval_report.py: use step() return value and extract reward from observation b3ab235

Remove unsupported data_collator argument from GRPOTrainer 97ddd73

Add data collator to GRPOTrainer for on-the-fly prompt tokenization f2d68bf

Revert to raw prompt format—let GRPOTrainer handle tokenization e49866a

Convert tokenized outputs to torch tensors for proper batching 9ba7512

Tokenize prompts in dataset generator for GRPOTrainer compatibility 75be7bc

Fix prompt format: convert from chat list to string for GRPOTrainer compatibility 77793ce

Fix dataset format: only yield 'prompt' field to avoid tensor concatenation errors 80c91ac

Remove tokenizer argument from GRPOTrainer—not supported in installed TRL version 4c16e6a

Fix path bootstrap in train_grpo.py: use parent.parent to reach project root bc2ac25

Fix GRPO batch size config for Kaggle P100: batch_size=4, grad_accum=2 (global=8, divisible by num_generations=2) feefe4a

Add eval_report.py for before/after training comparison 4e029fe

Add pre-training baseline results and graphs 754af78

Fix JSON parsing and environment bugs c22ceaa

BlogPost for Huggeing later use 02eeb03

Updated readme cd9d2e3

UI Integration 44bd7bd

Added Meaning ful comments 4ec7361

Code improvenets fa66cd4

Updated readme file ab07180

refactor: remove legacy architecture, promote clean structure to repo root 85fab7b

Worked on folder structure bb51474

Judge matching chnages with name changes 104c835

Fix health check: use Python instead of curl

abb18c6

Fix HuggingFace Space deployment: install curl for health checks and increase startup timeout

49487a6

Fix healthcheck to use correct port with PORT environment variable

3eeae84

Fix root path 404: add redirect from / to /ui/

1ab43d1

Fix HuggingFace Space deployment: use same-origin WebSocket and dynamic port

840c18a

Fix WebSocket protocol for HTTPS deployment: use wss:// for secure connections

d8bca9a

Rewrite README with philosophical, accessible prose for broader audience

7b76f88

formated readme

df53ef9

Clean up submission links table formatting

b936ff4

Update README with HF Space URL and submission links

d69b3cd

Remove VS cache files, add to gitignore

b8587e5

Add blog post and HF submission checklist for hackathon

6ba15c6

Model name update

9d6a95e

Updated readme

02fa6e2

Document Colab 100-step training summary in README

759305e

Readme cleanup

4bb4e67

Add 100-step Colab GRPO training results

d25e8b9

Fix eval decode prompt length slicing

fd3e88c

Fix GRPO reward mapping and evaluation generation

b2874f4

Include prompt column for GRPOTrainer

4fb3c37

Use finite datasets.Dataset for GRPOTrainer compatibility

3476016

Yield tensors from GRPO dataset generator

e817e69

Fix GRPO dataloader batching on Kaggle

8568d9f

Fix GRPO sample id mapping and Kaggle training setup

2066092

Add sample training results and judge report for hackathon submission

95658b4

Fix division by zero in final avg improvement calculation

022be04

Fix category enum: use lowercase for ProbeAction categories

b91005f

Fix eval_report.py: use step() return value and extract reward from observation

b3ab235

Remove unsupported data_collator argument from GRPOTrainer

97ddd73

Add data collator to GRPOTrainer for on-the-fly prompt tokenization

f2d68bf

Revert to raw prompt format—let GRPOTrainer handle tokenization

e49866a

Convert tokenized outputs to torch tensors for proper batching

9ba7512

Tokenize prompts in dataset generator for GRPOTrainer compatibility

75be7bc

Fix prompt format: convert from chat list to string for GRPOTrainer compatibility

77793ce

Fix dataset format: only yield 'prompt' field to avoid tensor concatenation errors

80c91ac

Remove tokenizer argument from GRPOTrainer—not supported in installed TRL version

4c16e6a

Fix path bootstrap in train_grpo.py: use parent.parent to reach project root

bc2ac25

Fix GRPO batch size config for Kaggle P100: batch_size=4, grad_accum=2 (global=8, divisible by num_generations=2)

feefe4a

Add eval_report.py for before/after training comparison

4e029fe

Add pre-training baseline results and graphs

754af78

Fix JSON parsing and environment bugs

c22ceaa

BlogPost for Huggeing later use

02eeb03

Updated readme

cd9d2e3

UI Integration

44bd7bd

Added Meaning ful comments

4ec7361

Code improvenets

fa66cd4

Updated readme file

ab07180

refactor: remove legacy architecture, promote clean structure to repo root

85fab7b

Worked on folder structure

bb51474

Judge matching chnages with name changes

104c835