Commit History

Deploy selfplay trainer from b3202c405e19f6b4f9defecc748ed0358111eef2
2bd30a8
verified

Jyo-K commited on

Deploy selfplay trainer from 9062ef625693685dfe748a2444f2c6baea9eb8ea
c79f97c
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
391e42c
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
4f609e4
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
1bcd53c
verified

Jyo-K commited on

Deploy selfplay trainer from 312af1c2877300e47381e4ca80a195f5906fe623
e754391
verified

Jyo-K commited on

Deploy selfplay trainer from e7d94bf79fc5c66fbd4b3b11607712a4c86ef753
7f23d42
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
bd368b1
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
8c8d67a
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
6e27d58
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
24ae974
verified

Jyo-K commited on

Deploy selfplay trainer from d3356ff330ec4809bb60f3fb8e35595b8e2ea282
8f825d7
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
79be77c
verified

Jyo-K commited on

Local sync: darkguard-selfplay-trainer
7b0e4c3
verified

Jyo-K commited on

Deploy selfplay trainer from 18ea3b212e9bcaa7f61732b6d91eca80594678c4
f58b43e
verified

Jyo-K commited on

Sync darkguard-selfplay-trainer from RL_Env (curriculum + fixes)
9998abe
verified

Jyo-K commited on

Rebalance designer challenge_delta toward challenging but solvable
21288b2
verified

Jyo-K commited on

Avoid consumer penalty double-counting in reward routing
e253d33
verified

Jyo-K commited on

Remove deprecated Blocks theme arg for Gradio 6
5592ba2
verified

Jyo-K commited on

Remove deprecated wandb start_method setting
e7f6dec
verified

Jyo-K commited on

Wire router toggle into policy runtime
68faf45
verified

Jyo-K commited on

Disable heavy local router by default and clean generation args
b455c9c
verified

Jyo-K commited on

Expose local action router checkbox in GUI
e92422b
verified

Jyo-K commited on

Add local action router toggle default off
fd1b3f2
verified

Jyo-K commited on

Set default base models to unsloth Qwen3 FP8
694ce46
verified

Jyo-K commited on

Add local Qwen-based action routing layer with safe fallback
d90afca
verified

Jyo-K commited on

Wire env retry/pacing config into trainer client
9cff197
verified

Jyo-K commited on

Add connection retry and pacing config defaults
8a8a554
verified

Jyo-K commited on

Add rate-limit pacing and retry backoff for env requests
35c13c0
verified

Jyo-K commited on

Preserve secret tokens when GUI fields blank and auto-enable W&B
52b208e
verified

Jyo-K commited on

Improve W&B auth fallback and init reliability
08eb5b4
verified

Jyo-K commited on

Read HF/W&B tokens from Space secrets by default
70c6802
verified

Jyo-K commited on

Log rollout warnings instead of fatal crash
b9dabfd
verified

Jyo-K commited on

Handle reset/step failures without crashing trainer
159a2e1
verified

Jyo-K commited on

Improve remote env HTTP error diagnostics
21f7f3e
verified

Jyo-K commited on

Fix connection/start callback errors with safe parsing
8c096ef
verified

Jyo-K commited on

Always clear running state via trainer finally block
5155724
verified

Jyo-K commited on

Recover stale running state on Start/Stop
f0870ce
verified

Jyo-K commited on

Improve stop button UX when idle
2195bee
verified

Jyo-K commited on

Allow holdout evaluation to stop early
f05f776
verified

Jyo-K commited on

Fix stop button responsiveness in phase loops
f5eab46
verified

Jyo-K commited on

Update GUI default model to unsloth Qwen3 4B FP8
729f398
verified

Jyo-K commited on

Set default base models to unsloth Qwen3 4B FP8
31bbeb9
verified

Jyo-K commited on

Initial self-play trainer Space sync
6ff67e5
verified

Jyo-K commited on