StanislavKo28
/

DSP_Bidder_2_rules

Model card Files Files and versions

Intro

It's RL (Reinforcement Learning) DQN (Deep Q-Learning) model for DOOH DSP Bidder problem. The model should respect 2 rules:

even pacing over time
desired publisher distribution (which can be different from publishers distribution in raw bid requests flow).

Requirements.txt

torch==2.10.0
matplotlib==3.10.8
ipython==8.0.0
torchrl==0.11.1
tensordict==0.11.0
numpy==2.4.2
pandas==2.3.3

Training process

Data flow

Python all-in-one files

dsp_bidder_2_training.py - training
dsp_bidder_2_inference.py - testing

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support