Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
jester1177
/
mutant-hunter-env
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
mutant-hunter-env
/
training
89 kB
Ctrl+K
Ctrl+K
3 contributors
History:
10 commits
Krishna1107
Drop GRPO temp to 0.3, bump max_new_tokens to 2048, add inference smoke test
576dfc3
about 1 month ago
data
Add in-context demonstration learning support
about 1 month ago
__init__.py
Safe
41 Bytes
Initial commit: MutantHunter — RL env for mutation-score-rewarded test generation
about 1 month ago
baseline_eval.py
Safe
6.53 kB
Add HF Job training pipeline: persistence-aware run script, judge-facing demo notebook, baseline JSON output
about 1 month ago
mine_demonstrations.py
19.6 kB
Add in-context demonstration learning support
about 1 month ago
prompts.py
13.6 kB
Prompt fix: include full module source + grounding rule + corpus example; skip baseline recompute when cached
about 1 month ago
smoke_grpo_inference.py
8.48 kB
Drop GRPO temp to 0.3, bump max_new_tokens to 2048, add inference smoke test
about 1 month ago
smoke_reward_fn.py
3.55 kB
Fix GRPO reward routing: correct seed lookup + markdown fence stripping
about 1 month ago
train_grpo.ipynb
7.94 kB
Fix verification gaps: add README links, rename blog, fix Colab badge, set author
about 1 month ago
train_grpo.py
16.4 kB
Drop GRPO temp to 0.3, bump max_new_tokens to 2048, add inference smoke test
about 1 month ago