File size: 1,914 Bytes
f619940 aa28c6b | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 | ---
license: unknown
---
# Intro
It's RL (Reinforcement Learning) DQN (Deep Q-Learning) model for DOOH DSP Bidder problem.
The model should respect 4 rules:
- even pacing over time
- desired publishers distribution (which can be different from publishers distribution in raw bid requests flow).
- desired venue types distribution (which can be different from venue types distribution in raw bid requests flow).
- desired household sizes distribution (which can be different from household sizes distribution in raw bid requests flow).
# Requirements.txt
```
torch==2.10.0
matplotlib==3.10.8
ipython==8.0.0
torchrl==0.11.1
tensordict==0.11.0
numpy==2.4.2
pandas==2.3.3
```
# Training process

# Data flow

# Python all-in-one files
- [dsp_bidder_4_training.py](https://huggingface.co/StanislavKo28/DSP_Bidder_4_rules/blob/main/dsp_bidder_4_training.py) - training
- [dsp_bidder_4_inference.py](https://huggingface.co/StanislavKo28/DSP_Bidder_4_rules/blob/main/dsp_bidder_4_inference.py) - testing
# Python more clean structured files
- [p410_environment.py](https://huggingface.co/StanislavKo28/DSP_Bidder_4_rules/blob/main/p410_environment.py) - PyTorch RL environment class
- [p420_bid_requests.py](https://huggingface.co/StanislavKo28/DSP_Bidder_4_rules/blob/main/p420_bid_requests.py) - Campaign bid requests flow emulation. 1680 bid requests per 1 week. Number of weeks can be changed.
- [p400_dsp_bidder_4_training.py](https://huggingface.co/StanislavKo28/DSP_Bidder_4_rules/blob/main/p400_dsp_bidder_4_training.py) - PyTorch RL DQN training code
- [p440_bidder_inference.py](https://huggingface.co/StanislavKo28/DSP_Bidder_4_rules/blob/main/p440_bidder_inference.py) - Inference/Testing code
- [p450_functions.py](https://huggingface.co/StanislavKo28/DSP_Bidder_4_rules/blob/main/p450_functions.py) - Auxiliary functions
|