Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -25,26 +25,26 @@ The objective was to improve serendipity (Tail Percentage, Low Popularity) while
|
|
| 25 |
- **tailpercentage@10**: 0.0004
|
| 26 |
|
| 27 |
### Best Valid Results (GRPO)
|
| 28 |
-
- **ndcg@10**: 0.
|
| 29 |
-
- **hit@10**: 0.
|
| 30 |
-
- **averagepopularity@10**:
|
| 31 |
-
- **giniindex@10**: 0.
|
| 32 |
-
- **itemcoverage@10**: 0.
|
| 33 |
-
- **shannonentropy@10**: 0.
|
| 34 |
- **tailpercentage@10**: 0.0
|
| 35 |
|
| 36 |
### Test Results (GRPO)
|
| 37 |
-
- **ndcg@10**: 0.
|
| 38 |
-
- **hit@10**: 0.
|
| 39 |
-
- **averagepopularity@10**:
|
| 40 |
-
- **giniindex@10**: 0.
|
| 41 |
-
- **itemcoverage@10**: 0.
|
| 42 |
-
- **shannonentropy@10**: 0.
|
| 43 |
- **tailpercentage@10**: 0.0005
|
| 44 |
|
| 45 |
## RL Hyperparameters
|
| 46 |
-
- **Alpha**: 0.
|
| 47 |
-
- **KL Beta**: 0.
|
| 48 |
- **Group Size**: 16
|
| 49 |
- **Learning Rate**: 5e-05
|
| 50 |
|
|
|
|
| 25 |
- **tailpercentage@10**: 0.0004
|
| 26 |
|
| 27 |
### Best Valid Results (GRPO)
|
| 28 |
+
- **ndcg@10**: 0.0539
|
| 29 |
+
- **hit@10**: 0.1171
|
| 30 |
+
- **averagepopularity@10**: 217.3145
|
| 31 |
+
- **giniindex@10**: 0.9117
|
| 32 |
+
- **itemcoverage@10**: 0.2114
|
| 33 |
+
- **shannonentropy@10**: 0.0222
|
| 34 |
- **tailpercentage@10**: 0.0
|
| 35 |
|
| 36 |
### Test Results (GRPO)
|
| 37 |
+
- **ndcg@10**: 0.049
|
| 38 |
+
- **hit@10**: 0.1044
|
| 39 |
+
- **averagepopularity@10**: 174.0842
|
| 40 |
+
- **giniindex@10**: 0.9072
|
| 41 |
+
- **itemcoverage@10**: 0.2665
|
| 42 |
+
- **shannonentropy@10**: 0.0179
|
| 43 |
- **tailpercentage@10**: 0.0005
|
| 44 |
|
| 45 |
## RL Hyperparameters
|
| 46 |
+
- **Alpha**: 0.1 (Weight for Useful Reward vs Unexpected Reward)
|
| 47 |
+
- **KL Beta**: 0.5
|
| 48 |
- **Group Size**: 16
|
| 49 |
- **Learning Rate**: 5e-05
|
| 50 |
|