StanislavKo28 commited on
Commit
a75bcf3
·
verified ·
1 Parent(s): 24acc4c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -3
README.md CHANGED
@@ -1,3 +1,36 @@
1
- ---
2
- license: unknown
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: unknown
3
+ ---
4
+
5
+ # Intro
6
+
7
+ It's RL (Reinforcement Learning) DQN (Deep Q-Learning) model for DOOH DSP Bidder problem.
8
+ The model should respect 3 rules:
9
+ - even pacing over time
10
+ - desired publishers distribution (which can be different from publishers distribution in raw bid requests flow).
11
+ - desired venue types distribution (which can be different from venue types distribution in raw bid requests flow).
12
+
13
+ # Requirements.txt
14
+
15
+ ```
16
+ torch==2.10.0
17
+ matplotlib==3.10.8
18
+ ipython==8.0.0
19
+ torchrl==0.11.1
20
+ tensordict==0.11.0
21
+ numpy==2.4.2
22
+ pandas==2.3.3
23
+ ```
24
+
25
+ # Training process
26
+
27
+ ![alt](training_200_036_250_GOOD_3.png)
28
+
29
+ # Data flow
30
+
31
+ ![alt](bidder_transormer_3_001.png)
32
+
33
+ # Python all-in-one files
34
+
35
+ - [dsp_bidder_3_training.py](https://huggingface.co/StanislavKo28/DSP_Bidder_3_rules/blob/main/dsp_bidder_3_training.py) - training
36
+ - [dsp_bidder_3_inference.py](https://huggingface.co/StanislavKo28/DSP_Bidder_3_rules/blob/main/dsp_bidder_3_inference.py) - testing