Deploy selfplay trainer from b3202c405e19f6b4f9defecc748ed0358111eef2 2bd30a8 verified Jyo-K commited on Apr 26
Deploy selfplay trainer from 9062ef625693685dfe748a2444f2c6baea9eb8ea c79f97c verified Jyo-K commited on Apr 26
Deploy selfplay trainer from 312af1c2877300e47381e4ca80a195f5906fe623 e754391 verified Jyo-K commited on Apr 26
Deploy selfplay trainer from e7d94bf79fc5c66fbd4b3b11607712a4c86ef753 7f23d42 verified Jyo-K commited on Apr 26
Deploy selfplay trainer from d3356ff330ec4809bb60f3fb8e35595b8e2ea282 8f825d7 verified Jyo-K commited on Apr 25
Deploy selfplay trainer from 18ea3b212e9bcaa7f61732b6d91eca80594678c4 f58b43e verified Jyo-K commited on Apr 25
Sync darkguard-selfplay-trainer from RL_Env (curriculum + fixes) 9998abe verified Jyo-K commited on Apr 25
Rebalance designer challenge_delta toward challenging but solvable 21288b2 verified Jyo-K commited on Apr 25
Disable heavy local router by default and clean generation args b455c9c verified Jyo-K commited on Apr 25
Add local Qwen-based action routing layer with safe fallback d90afca verified Jyo-K commited on Apr 25
Preserve secret tokens when GUI fields blank and auto-enable W&B 52b208e verified Jyo-K commited on Apr 25