Add pipeline tag and link to paper/code
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,13 +1,18 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
datasets:
|
| 4 |
- AL-GR/Item-EMB
|
| 5 |
language:
|
| 6 |
- zh
|
|
|
|
|
|
|
| 7 |
---
|
|
|
|
| 8 |
# Forge-SID-Model
|
| 9 |
|
| 10 |
-
This repository contains a pre-trained **RQVAE (Residual Quantized Variational Autoencoder)** model designed for **SID (
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |
The model weights are stored in `final_sid_rq_model.pth`.
|
| 13 |
|
|
@@ -55,4 +60,17 @@ CKPT_PATH = './Forge-SID-Model/final_sid_rq_model.pth'
|
|
| 55 |
|
| 56 |
---
|
| 57 |
|
| 58 |
-
For more details about the training setup or the FORGE framework, please refer to the main repository: [AL-GR/FORGE](https://github.com/AL-GR/FORGE).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
datasets:
|
| 3 |
- AL-GR/Item-EMB
|
| 4 |
language:
|
| 5 |
- zh
|
| 6 |
+
license: apache-2.0
|
| 7 |
+
pipeline_tag: feature-extraction
|
| 8 |
---
|
| 9 |
+
|
| 10 |
# Forge-SID-Model
|
| 11 |
|
| 12 |
+
This repository contains a pre-trained **RQVAE (Residual Quantized Variational Autoencoder)** model designed for **SID (Semantic Identifier) generation** tasks. It is part of the [FORGE](https://github.com/AL-GR/FORGE) ecosystem, introduced in the paper [FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets](https://huggingface.co/papers/2509.20904).
|
| 13 |
+
|
| 14 |
+
- **Project Page:** [https://huggingface.co/AL-GR](https://huggingface.co/AL-GR)
|
| 15 |
+
- **Code:** [https://github.com/selous123/al_sid](https://github.com/selous123/al_sid)
|
| 16 |
|
| 17 |
The model weights are stored in `final_sid_rq_model.pth`.
|
| 18 |
|
|
|
|
| 60 |
|
| 61 |
---
|
| 62 |
|
| 63 |
+
For more details about the training setup or the FORGE framework, please refer to the main repository: [AL-GR/FORGE](https://github.com/AL-GR/FORGE).
|
| 64 |
+
|
| 65 |
+
## Citation
|
| 66 |
+
|
| 67 |
+
If you find this work helpful, please cite:
|
| 68 |
+
|
| 69 |
+
```bibtex
|
| 70 |
+
@article{fu2025forge,
|
| 71 |
+
title={FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets},
|
| 72 |
+
author={Fu, Kairui and Zhang, Tao and Xiao, Shuwen and Wang, Ziyang and Zhang, Xinming and Zhang, Chenchi and Yan, Yuliang and Zheng, Junjun and others},
|
| 73 |
+
journal={arXiv preprint arXiv:2509.20904},
|
| 74 |
+
year={2025}
|
| 75 |
+
}
|
| 76 |
+
```
|