Spaces:

jester1177
/

mutant-hunter-env

Sleeping

App Files Files Community

mutant-hunter-env / scripts

Commit History

Drop GRPO temp to 0.3, bump max_new_tokens to 2048, add inference smoke test

576dfc3

Krishna1107 Claude Opus 4.7 commited on Apr 26

Add inference-time retry loop with feedback for self-correction

10feced

Krishna1107 Claude Opus 4.7 commited on Apr 26

Add in-context demonstration learning support

968797f

Krishna1107 Claude Opus 4.7 commited on Apr 26

Fix make_plots.py to accept --wandb-run-id separately from --training-log-json

640263f

Krishna1107 Claude Opus 4.7 commited on Apr 26

pytorch fixes

fe6913c

Krishna1107 commited on Apr 26

no unsloth

361a1fc

Krishna1107 commited on Apr 26

Retry torch.cuda.is_available() in fresh python procs; NVML ready != cuInit ready

27e2f37

Krishna1107 commited on Apr 26

Rewrite run_hf_job_7b.sh end-to-end: nvidia-smi GPU poll, clean phase structure

8e61ac4

Krishna1107 commited on Apr 26

Drop venv approach; force-reinstall torch stack over base image to actually upgrade past 2.5.1

1110d0a

Krishna1107 commited on Apr 26

Phase 0: build fresh venv + restore CUDA poll + restore upfront huggingface_hub install

c4df898

Krishna1107 commited on Apr 26

Install torchaudio alongside torch/torchvision to clear pinned-base-image conflict

7ab4e76

Krishna1107 commited on Apr 26

Drop version pins on training extras and torch/torchvision install

241a3bd

Krishna1107 commited on Apr 26

Hard reset HF Job env: install torch+torchvision together at fixed versions

bd78955

Krishna1107 commited on Apr 26

Upgrade torchvision/torchaudio alongside torch to fix nms op registration

94a6b52

Krishna1107 commited on Apr 26

Upgrade torch to 2.5+ in Phase 0; pin trl>=0.14 for GRPOTrainer

ce6481e

Krishna1107 commited on Apr 26

Make 7B HF Job Phase 0 resilient: poll for CUDA readiness, install hf_hub up front for trap

27a9b93

Krishna1107 Claude Opus 4.7 commited on Apr 26

Use --env instead of --secrets KEY=VALUE for WANDB_API_KEY

571af06

Krishna1107 Claude Opus 4.7 commited on Apr 26

Fix wandb login in 7B HF Job: verify env var present, use CLI not Python API

cb93805

Krishna1107 Claude Opus 4.7 commited on Apr 26

Add 7B variant of HF Job for stronger cold-start.

f249cf5

Krishna1107 Claude Opus 4.7 commited on Apr 26

Add reduced-scope HF Job variant for time-constrained scenarios

51e8315

Krishna1107 Claude Opus 4.7 commited on Apr 26

Switch HF Job base image to pytorch:2.5.1-cuda12.4 to fix CUDA13 ABI bleed

12ecfb5

Krishna1107 Claude Opus 4.7 commited on Apr 25

Fix torch/torchvision ABI mismatch in HF Job Phase 0

efb52e5

Krishna1107 Claude Opus 4.7 commited on Apr 25

Fix GitHub username in HF Job scripts: jester1177 -> melohub-xbit

082986e

Krishna1107 Claude Opus 4.7 commited on Apr 25

Fix submit_hf_job.sh syntax: image as positional arg

900c737

Krishna1107 Claude Opus 4.7 commited on Apr 25

Pre-submission HF Job patches: Python version compat + skip torch reinstall

b5d30d0

Krishna1107 Claude Opus 4.7 commited on Apr 25

Add HF Job training pipeline: persistence-aware run script, judge-facing demo notebook, baseline JSON output

e01ee6d

Krishna1107 Claude Opus 4.7 commited on Apr 25

Re-apply training prep work on top of validated Layers 1-5 base

1b83cd4

Krishna1107 commited on Apr 25

Initial commit: MutantHunter — RL env for mutation-score-rewarded test generation

91487c9

Krishna1107 commited on Apr 25

Commit History

Drop GRPO temp to 0.3, bump max_new_tokens to 2048, add inference smoke test 576dfc3

Add inference-time retry loop with feedback for self-correction 10feced

Add in-context demonstration learning support 968797f

Fix make_plots.py to accept --wandb-run-id separately from --training-log-json 640263f

pytorch fixes fe6913c

no unsloth 361a1fc

Retry torch.cuda.is_available() in fresh python procs; NVML ready != cuInit ready 27e2f37

Rewrite run_hf_job_7b.sh end-to-end: nvidia-smi GPU poll, clean phase structure 8e61ac4

Drop venv approach; force-reinstall torch stack over base image to actually upgrade past 2.5.1 1110d0a

Phase 0: build fresh venv + restore CUDA poll + restore upfront huggingface_hub install c4df898

Install torchaudio alongside torch/torchvision to clear pinned-base-image conflict 7ab4e76

Drop version pins on training extras and torch/torchvision install 241a3bd

Hard reset HF Job env: install torch+torchvision together at fixed versions bd78955

Upgrade torchvision/torchaudio alongside torch to fix nms op registration 94a6b52

Upgrade torch to 2.5+ in Phase 0; pin trl>=0.14 for GRPOTrainer ce6481e

Make 7B HF Job Phase 0 resilient: poll for CUDA readiness, install hf_hub up front for trap 27a9b93

Use --env instead of --secrets KEY=VALUE for WANDB_API_KEY 571af06

Fix wandb login in 7B HF Job: verify env var present, use CLI not Python API cb93805

Add 7B variant of HF Job for stronger cold-start. f249cf5

Add reduced-scope HF Job variant for time-constrained scenarios 51e8315

Switch HF Job base image to pytorch:2.5.1-cuda12.4 to fix CUDA13 ABI bleed 12ecfb5

Fix torch/torchvision ABI mismatch in HF Job Phase 0 efb52e5

Fix GitHub username in HF Job scripts: jester1177 -> melohub-xbit 082986e

Fix submit_hf_job.sh syntax: image as positional arg 900c737

Pre-submission HF Job patches: Python version compat + skip torch reinstall b5d30d0

Add HF Job training pipeline: persistence-aware run script, judge-facing demo notebook, baseline JSON output e01ee6d

Re-apply training prep work on top of validated Layers 1-5 base 1b83cd4

Initial commit: MutantHunter — RL env for mutation-score-rewarded test generation 91487c9

Drop GRPO temp to 0.3, bump max_new_tokens to 2048, add inference smoke test

576dfc3

Add inference-time retry loop with feedback for self-correction

10feced

Add in-context demonstration learning support

968797f

Fix make_plots.py to accept --wandb-run-id separately from --training-log-json

640263f

pytorch fixes

fe6913c

no unsloth

361a1fc

Retry torch.cuda.is_available() in fresh python procs; NVML ready != cuInit ready

27e2f37

Rewrite run_hf_job_7b.sh end-to-end: nvidia-smi GPU poll, clean phase structure

8e61ac4

Drop venv approach; force-reinstall torch stack over base image to actually upgrade past 2.5.1

1110d0a

Phase 0: build fresh venv + restore CUDA poll + restore upfront huggingface_hub install

c4df898

Install torchaudio alongside torch/torchvision to clear pinned-base-image conflict

7ab4e76

Drop version pins on training extras and torch/torchvision install

241a3bd

Hard reset HF Job env: install torch+torchvision together at fixed versions

bd78955

Upgrade torchvision/torchaudio alongside torch to fix nms op registration

94a6b52

Upgrade torch to 2.5+ in Phase 0; pin trl>=0.14 for GRPOTrainer

ce6481e

Make 7B HF Job Phase 0 resilient: poll for CUDA readiness, install hf_hub up front for trap

27a9b93

Use --env instead of --secrets KEY=VALUE for WANDB_API_KEY

571af06

Fix wandb login in 7B HF Job: verify env var present, use CLI not Python API

cb93805

Add 7B variant of HF Job for stronger cold-start.

f249cf5

Add reduced-scope HF Job variant for time-constrained scenarios

51e8315

Switch HF Job base image to pytorch:2.5.1-cuda12.4 to fix CUDA13 ABI bleed

12ecfb5

Fix torch/torchvision ABI mismatch in HF Job Phase 0

efb52e5

Fix GitHub username in HF Job scripts: jester1177 -> melohub-xbit

082986e

Fix submit_hf_job.sh syntax: image as positional arg

900c737

Pre-submission HF Job patches: Python version compat + skip torch reinstall

b5d30d0

Add HF Job training pipeline: persistence-aware run script, judge-facing demo notebook, baseline JSON output

e01ee6d

Re-apply training prep work on top of validated Layers 1-5 base

1b83cd4

Initial commit: MutantHunter — RL env for mutation-score-rewarded test generation

91487c9