psh3333's picture
Fix README metadata and upload
33f0efd verified
metadata
tags:
  - reinforcement-learning
  - '2048'
  - wandb
library_name: generic

๐ŸŽฎ 2048 RL Agent (W&B Serverless)

This agent was trained using W&B Serverless RL on CoreWeave infrastructure. It learned to play 2048 via reinforcement learning loops.

๐Ÿ“‚ Files

  • adapter_model.safetensors: The trained LoRA weights.
  • evaluation_script.py: Script to evaluate the agent.

๐Ÿš€ Usage

Check evaluation_script.py for inference details.