ArbitrAgent / training

Commit History

Update arbitragent Colab GRPO config
16362ff

AbeBhatti commited on

Colab: GRPOConfig bf16/fp16=False, try/except for use_bf16
78c2390

AbeBhatti commited on

fix colab repo url
da7bca3

AbeBhatti commited on

negotiation bluff classifier + message cleaner
6858719

AbeBhatti commited on

Add test selfplay states for HF Spaces
10d346d

AbeBhatti commited on

Add test selfplay states for HF Spaces
906a1b8

AbeBhatti commited on

Clean repo — code only, no weights or training data
b4e7ad1

AbeBhatti commited on

Add all code, exclude large model weights
6017516

AbeBhatti commited on

Add arbitragent_colab.ipynb — end-to-end Colab notebook for hackathon
d9a59d1

AbeBhatti commited on

Initial commit: ArbitrAgent with README, agent loop, envs, demo, training
bf0a450

AbeBhatti commited on