Sungyeon commited on
Commit
949beaa
·
verified ·
1 Parent(s): 33a371c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -172,13 +172,13 @@ wget https://huggingface.co/TIGER-Lab/UniIR/resolve/main/checkpoint/CLIP_SF/clip
172
  # Clone the GENIUS checkpoints (Stage 1 and 2)
173
  git clone https://huggingface.co/Sungyeon/GENIUS
174
  ```
175
- ### Each Checkpoints
176
  - **CLIP-SF Model** (Stage 0): [`clip_sf_large.pth`](https://huggingface.co/TIGER-Lab/UniIR/blob/main/checkpoint/CLIP_SF/clip_sf_large.pth)
177
  - **Residual Quantization Model** (Stage 1): [`rq_clip_large.pth`](https://huggingface.co/Sungyeon/GENIUS/blob/main/checkpoint/rq_clip_large.pth)
178
  - **Generator Model** (Stage 2): [`GENIUS_t5small.pth`](https://huggingface.co/Sungyeon/GENIUS/blob/main/checkpoint/GENIUS_t5small.pth)
179
 
180
- > Note: All three models are required for full functionality. The CLIP-SF model is used for feature extraction, the Residual Quantization model for ID encoding, and the Generator model for retrieval.
181
-
182
  ## 📈 Performance
183
 
184
  > The results in parentheses denote scores from our reimplemented checkpoints, as the originals were lost during server migration. While close to the paper, slight variations may occur due to retraining randomness.
@@ -218,7 +218,7 @@ When the candidate pool grows, embedding‐based retrieval (e.g., CLIP + nearest
218
  If you find this work useful, please cite:
219
 
220
  ```bibtex
221
- @article{kim2024genius,
222
  title={GENIUS: A Generative Framework for Universal Multimodal Search},
223
  author={Kim, Sungyeon and Zhu, Xinliang and Lin, Xiaofan and Bastan, Muhammet and Gray, Douglas and Kwak, Suha},
224
  journal={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
 
172
  # Clone the GENIUS checkpoints (Stage 1 and 2)
173
  git clone https://huggingface.co/Sungyeon/GENIUS
174
  ```
175
+ ### Each Component Checkpoints
176
  - **CLIP-SF Model** (Stage 0): [`clip_sf_large.pth`](https://huggingface.co/TIGER-Lab/UniIR/blob/main/checkpoint/CLIP_SF/clip_sf_large.pth)
177
  - **Residual Quantization Model** (Stage 1): [`rq_clip_large.pth`](https://huggingface.co/Sungyeon/GENIUS/blob/main/checkpoint/rq_clip_large.pth)
178
  - **Generator Model** (Stage 2): [`GENIUS_t5small.pth`](https://huggingface.co/Sungyeon/GENIUS/blob/main/checkpoint/GENIUS_t5small.pth)
179
 
180
+ > Note: All three models are required for full functionality.
181
+ >
182
  ## 📈 Performance
183
 
184
  > The results in parentheses denote scores from our reimplemented checkpoints, as the originals were lost during server migration. While close to the paper, slight variations may occur due to retraining randomness.
 
218
  If you find this work useful, please cite:
219
 
220
  ```bibtex
221
+ @inproceedings{kim2024genius,
222
  title={GENIUS: A Generative Framework for Universal Multimodal Search},
223
  author={Kim, Sungyeon and Zhu, Xinliang and Lin, Xiaofan and Bastan, Muhammet and Gray, Douglas and Kwak, Suha},
224
  journal={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},