sakha / src

Commit History

final push
5a8662c
unverified

atharva-again commited on

feat(grpo): enhance action response parsing by removing reasoning blocks and refining regex handling
5202cdc

Bemohit commited on

feat(grpo): update max completion length and refine prompt handling for improved evaluation
ff1f7a0

Bemohit commited on

feat(grpo): adjust training parameters and disable thinking mode for consistent action calls
8f1e9fc

Bemohit commited on

feat(grpo): enhance training dynamics with new replay policies and update state steps
264ee3d

Bemohit commited on

feat(grpo): update max sequence length and refine prompt formatting in training scripts
79bced7

Bemohit commited on

refactor: remove SakhaEnvWrapper class and streamline reward function in GRPO training script
097c9e4

Bemohit commited on

feat(rubric): integrate SakhaRubric into SakhaEnvironment step/reward path
1bdc498
unverified

atharva-again commited on

feat(rubric): add composable rubric scaffolding wrapping existing reward logic
237c898
unverified

atharva-again commited on

fix(inference): align checklist compliance and structured run logs
284ec94
unverified

atharva-again commited on

feat(formatters): add structured output formatting system
5dd1b3a
unverified

atharva-again commited on

feat(graders): implement new scoring logic
ba99d71
unverified

atharva-again commited on

feat(env): implement workflow-based simulation
0816fd4
unverified

atharva-again commited on

feat(models): added new action types and pending task system
3ab55b0
unverified

atharva-again commited on

fix: heuristic takeover when llm fails, step reward/penalty derived from grader, some other fixes
52b4770
unverified

atharva-again commited on

fix: copy README to Docker, enable web UI, and enhance documentation
30220ed
unverified

atharva-again commited on