Feature Extraction
Chinese

Add pipeline tag and link to paper/code

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -1,13 +1,18 @@
1
  ---
2
- license: apache-2.0
3
  datasets:
4
  - AL-GR/Item-EMB
5
  language:
6
  - zh
 
 
7
  ---
 
8
  # Forge-SID-Model
9
 
10
- This repository contains a pre-trained **RQVAE (Residual Quantized Variational Autoencoder)** model designed for **SID (Speaker Identity/Structure) generation** tasks. It is part of the [FORGE](https://github.com/AL-GR/FORGE) ecosystem.
 
 
 
11
 
12
  The model weights are stored in `final_sid_rq_model.pth`.
13
 
@@ -55,4 +60,17 @@ CKPT_PATH = './Forge-SID-Model/final_sid_rq_model.pth'
55
 
56
  ---
57
 
58
- For more details about the training setup or the FORGE framework, please refer to the main repository: [AL-GR/FORGE](https://github.com/AL-GR/FORGE).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
2
  datasets:
3
  - AL-GR/Item-EMB
4
  language:
5
  - zh
6
+ license: apache-2.0
7
+ pipeline_tag: feature-extraction
8
  ---
9
+
10
  # Forge-SID-Model
11
 
12
+ This repository contains a pre-trained **RQVAE (Residual Quantized Variational Autoencoder)** model designed for **SID (Semantic Identifier) generation** tasks. It is part of the [FORGE](https://github.com/AL-GR/FORGE) ecosystem, introduced in the paper [FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets](https://huggingface.co/papers/2509.20904).
13
+
14
+ - **Project Page:** [https://huggingface.co/AL-GR](https://huggingface.co/AL-GR)
15
+ - **Code:** [https://github.com/selous123/al_sid](https://github.com/selous123/al_sid)
16
 
17
  The model weights are stored in `final_sid_rq_model.pth`.
18
 
 
60
 
61
  ---
62
 
63
+ For more details about the training setup or the FORGE framework, please refer to the main repository: [AL-GR/FORGE](https://github.com/AL-GR/FORGE).
64
+
65
+ ## Citation
66
+
67
+ If you find this work helpful, please cite:
68
+
69
+ ```bibtex
70
+ @article{fu2025forge,
71
+ title={FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets},
72
+ author={Fu, Kairui and Zhang, Tao and Xiao, Shuwen and Wang, Ziyang and Zhang, Xinming and Zhang, Chenchi and Yan, Yuliang and Zheng, Junjun and others},
73
+ journal={arXiv preprint arXiv:2509.20904},
74
+ year={2025}
75
+ }
76
+ ```