kkknight
/

MiniOneRec

 - llm
 - large-language-model
 - recommender-system
+---
+# 🌟 MiniOneRec · Generative Recommender Checkpoints
+<p align="center">
+  <img src="https://raw.githubusercontent.com/AkaliKong/MiniOneRec/main/assets/logo.png" width="450"/>
+</p>
+**MiniOneRec** is the *first fully-open generative recommendation framework*.
+It provides an end-to-end workflow covering **Semantic-ID (SID) construction**, **Supervised Fine-Tuning (SFT)** and **Recommendation-oriented Reinforcement Learning (RL)**.
+These checkpoints accompany the paper:
+> **MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation**
+> <a href="https://arxiv.org/abs/2510.24431">📄 Technical Report</a> |📦 <a href="https://github.com/AkaliKong/MiniOneRec"> Github</a>|<a href="https://modelscope.cn/models/k925238839/MiniOneRec">🤖  Modelscope</a>
+---
+## 🗺️ Table of Contents
+1. Model Summary
+2. Intended Uses & Limitations
+3. Quick Start
+4. Training & Evaluation Details
+5. Citation
+6. Acknowledgements
+---
+## 1️⃣ Model Summary
+MiniOneRec rewrites every catalogue item into a discrete **SID token**:
+1. **Text Encoder** (frozen PLM) →
+2. **3-level Residual Quantisation (RQ-VAE / RQ-KMeans)** → SID.
+User history ≙ SID sequence.
+Training pipeline:
+|
+ Stage
+|
+ Objective
+|
+ Notes
+|
+|-------|-----------|-------|
+| **SFT** | Next-SID prediction + language alignment | inherits world knowledge while grounding in item space |
+| **RL (GRPO)** | KL-regularised policy optimisation | constrained beam search over the closed SID set |
+### Released checkpoints (examples)
+|
+ Checkpoint
+|
+ Base LLM
+|
+#
+Params
+|
+ Precision
+|
+ Stage
+|
+|-------------------------------------|---------------------|---------|-----------|-----------|
+| `MiniOneRec-SFT-industrial`         | Qwen-7B             | 7 B     | bf16      | SFT       |
+| `MiniOneRec-RL-industrial`          | Qwen-7B             | 7 B     | bf16      | SFT+RL    |
+*(Replace with the exact repo names you upload.)*
+---
+## 2️⃣ Intended Uses & Limitations
+### ✅ Intended
+* Next-item prediction in e-commerce / content platforms.
+* Research on generative recommendation and RL-from-human-feedback variants.
+### ❌ Out-of-Scope
+* Safety-critical deployments without exhaustive evaluation.
+* Domains whose item catalogue is not covered by the released SID vocabulary.
+* Generation of content that violates the Apache-2.0 license or local regulations.
+### ⚖️ Ethical Considerations
+The model may inherit bias from the training corpus (user behaviour, language model).
+Please **audit for fairness, privacy and potential leakage** before production use.
+---
+## 3️⃣ Citation
+```
+@misc{MiniOneRec,
+  title  = {MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation},
+  author = {Xiaoyu Kong and Leheng Sheng and Junfei Tan and Yuxin Chen and Jiancan Wu and An Zhang and Xiang Wang and Xiangnan He},
+  year   = {2025},
+  eprint = {2510.24431},
+  archivePrefix = {arXiv},
+  primaryClass  = {cs.IR}
+}
+@article{ReRe,
+  title   = {Reinforced Preference Optimization for Recommendation},
+  author  = {Junfei Tan and Yuxin Chen and An Zhang and Junguang Jiang and Bin Liu and Ziru Xu and Han Zhu and Jian Xu and Bo Zheng and Xiang Wang},
+  journal = {arXiv preprint arXiv:2510.12211},
+  year    = {2025}
+}
+```
+## 4️⃣ Acknowledgements
+This repository reuses or adapts portions of code from the following open-source projects. We gratefully acknowledge their authors and contributors:
+- [ReRe](https://github.com/sober-clever/ReRe)
+- [LC-Rec](https://github.com/zhengbw0324/LC-Rec)
+## 5️⃣ Institutions  <!-- omit in toc -->
+This project is developed by the following institutions:
+- <img src="assets/lds.png" width="28px"> [LDS](https://data-science.ustc.edu.cn/_upload/tpl/15/04/5380/template5380/index.html)
+- <img src="assets/alphalab.jpg" width="28px"> [AlphaLab](https://alphalab-ustc.github.io/index.html)
+- <img src="assets/next.jpg" width="28px"> [NExT](https://www.nextcenter.org/)