I used PGX and MCTX to train AlphaZero on Kuhn Poker. It ran on a TPU v2-8(courtesy of the TPU Research Cloud Program) for ~3.5 days. Code can be found here.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support