kkknight commited on
Commit
0125fb6
·
verified ·
1 Parent(s): ae3b485

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +125 -1
README.md CHANGED
@@ -12,4 +12,128 @@ tags:
12
  - llm
13
  - large-language-model
14
  - recommender-system
15
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  - llm
13
  - large-language-model
14
  - recommender-system
15
+ ---
16
+
17
+ # 🌟 MiniOneRec · Generative Recommender Checkpoints
18
+
19
+ <p align="center">
20
+ <img src="https://raw.githubusercontent.com/AkaliKong/MiniOneRec/main/assets/logo.png" width="450"/>
21
+ </p>
22
+
23
+ **MiniOneRec** is the *first fully-open generative recommendation framework*.
24
+ It provides an end-to-end workflow covering **Semantic-ID (SID) construction**, **Supervised Fine-Tuning (SFT)** and **Recommendation-oriented Reinforcement Learning (RL)**.
25
+
26
+ These checkpoints accompany the paper:
27
+
28
+ > **MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation**
29
+ > <a href="https://arxiv.org/abs/2510.24431">📄 Technical Report</a> |📦 <a href="https://github.com/AkaliKong/MiniOneRec"> Github</a>|<a href="https://modelscope.cn/models/k925238839/MiniOneRec">🤖 Modelscope</a>
30
+
31
+ ---
32
+
33
+ ## 🗺️ Table of Contents
34
+ 1. Model Summary
35
+ 2. Intended Uses & Limitations
36
+ 3. Quick Start
37
+ 4. Training & Evaluation Details
38
+ 5. Citation
39
+ 6. Acknowledgements
40
+
41
+ ---
42
+
43
+ ## 1️⃣ Model Summary
44
+
45
+ MiniOneRec rewrites every catalogue item into a discrete **SID token**:
46
+
47
+ 1. **Text Encoder** (frozen PLM) →
48
+ 2. **3-level Residual Quantisation (RQ-VAE / RQ-KMeans)** → SID.
49
+
50
+ User history ≙ SID sequence.
51
+ Training pipeline:
52
+
53
+ |
54
+ Stage
55
+ |
56
+ Objective
57
+ |
58
+ Notes
59
+ |
60
+ |-------|-----------|-------|
61
+ | **SFT** | Next-SID prediction + language alignment | inherits world knowledge while grounding in item space |
62
+ | **RL (GRPO)** | KL-regularised policy optimisation | constrained beam search over the closed SID set |
63
+
64
+ ### Released checkpoints (examples)
65
+
66
+ |
67
+ Checkpoint
68
+ |
69
+ Base LLM
70
+ |
71
+ #
72
+ Params
73
+ |
74
+ Precision
75
+ |
76
+ Stage
77
+ |
78
+ |-------------------------------------|---------------------|---------|-----------|-----------|
79
+ | `MiniOneRec-SFT-industrial` | Qwen-7B | 7 B | bf16 | SFT |
80
+ | `MiniOneRec-RL-industrial` | Qwen-7B | 7 B | bf16 | SFT+RL |
81
+
82
+ *(Replace with the exact repo names you upload.)*
83
+
84
+ ---
85
+
86
+ ## 2️⃣ Intended Uses & Limitations
87
+
88
+ ### ✅ Intended
89
+
90
+ * Next-item prediction in e-commerce / content platforms.
91
+ * Research on generative recommendation and RL-from-human-feedback variants.
92
+
93
+ ### ❌ Out-of-Scope
94
+
95
+ * Safety-critical deployments without exhaustive evaluation.
96
+ * Domains whose item catalogue is not covered by the released SID vocabulary.
97
+ * Generation of content that violates the Apache-2.0 license or local regulations.
98
+
99
+ ### ⚖️ Ethical Considerations
100
+
101
+ The model may inherit bias from the training corpus (user behaviour, language model).
102
+ Please **audit for fairness, privacy and potential leakage** before production use.
103
+
104
+ ---
105
+
106
+
107
+ ## 3️⃣ Citation
108
+
109
+ ```
110
+ @misc{MiniOneRec,
111
+ title = {MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation},
112
+ author = {Xiaoyu Kong and Leheng Sheng and Junfei Tan and Yuxin Chen and Jiancan Wu and An Zhang and Xiang Wang and Xiangnan He},
113
+ year = {2025},
114
+ eprint = {2510.24431},
115
+ archivePrefix = {arXiv},
116
+ primaryClass = {cs.IR}
117
+ }
118
+
119
+ @article{ReRe,
120
+ title = {Reinforced Preference Optimization for Recommendation},
121
+ author = {Junfei Tan and Yuxin Chen and An Zhang and Junguang Jiang and Bin Liu and Ziru Xu and Han Zhu and Jian Xu and Bo Zheng and Xiang Wang},
122
+ journal = {arXiv preprint arXiv:2510.12211},
123
+ year = {2025}
124
+ }
125
+ ```
126
+ ## 4️⃣ Acknowledgements
127
+ This repository reuses or adapts portions of code from the following open-source projects. We gratefully acknowledge their authors and contributors:
128
+
129
+ - [ReRe](https://github.com/sober-clever/ReRe)
130
+ - [LC-Rec](https://github.com/zhengbw0324/LC-Rec)
131
+
132
+
133
+ ## 5️⃣ Institutions <!-- omit in toc -->
134
+
135
+ This project is developed by the following institutions:
136
+
137
+ - <img src="assets/lds.png" width="28px"> [LDS](https://data-science.ustc.edu.cn/_upload/tpl/15/04/5380/template5380/index.html)
138
+ - <img src="assets/alphalab.jpg" width="28px"> [AlphaLab](https://alphalab-ustc.github.io/index.html)
139
+ - <img src="assets/next.jpg" width="28px"> [NExT](https://www.nextcenter.org/)