Saifoufa commited on
Commit
196b68f
·
verified ·
1 Parent(s): a4fd33e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -1
README.md CHANGED
@@ -10,4 +10,33 @@ pinned: false
10
  short_description: Compares the effectiveness of slogans generated using RL.
11
  ---
12
 
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  short_description: Compares the effectiveness of slogans generated using RL.
11
  ---
12
 
13
+ # Gym Campaign Slogan Comparator
14
+
15
+ This application compares the effectiveness of slogans generated by two models for a gym campaign. Each model generates slogans along with a predicted effectiveness score. The app calculates the success rate of each model by comparing their slogans pairwise.
16
+
17
+ ## Features
18
+
19
+ - Compare two lists of slogans with their effectiveness scores.
20
+ - Determine which model performs better overall.
21
+ - Visualize the success rate of each model in a simple interface.
22
+ - Shareable Gradio app for easy use.
23
+
24
+ ## Inputs
25
+
26
+ - **Model X Slogans:** A list of slogans with scores (format: `slogan,score` per line).
27
+ - **Model Y Slogans:** Same format for the second model.
28
+
29
+ ## Outputs
30
+
31
+ - **Model X Success Rate:** Percentage of pairwise wins for Model X.
32
+ - **Model Y Success Rate:** Percentage of pairwise wins for Model Y.
33
+
34
+ ## Reinforcement Learning Context
35
+
36
+ Reinforcement Learning (RL) can be applied to optimize slogan generation over time. In this app:
37
+
38
+ - Each slogan generation by the models is considered an **action**.
39
+ - The effectiveness score acts as a **reward signal**.
40
+ - By comparing slogans and updating the model to maximize wins (success rate), RL techniques like policy gradients can be used to improve the quality of slogans over iterations.
41
+
42
+ This app provides a simple interface to visualize these comparisons and can serve as a foundation for RL-based optimization of advertising campaigns.