shahidul034's picture
Add files using upload-large-folder tool
034cb04 verified

Reproduce the paper results

Download the dataset

huggingface-cli download --repo-type dataset PeterJinGo/nq_hotpotqa_train --local-dir $WORK_DIR/data/nq_hotpotqa_train

Launch the local search engine

(1) Download the indexing and corpus.

save_path=/the/path/to/save
python scripts/download.py --save_path $save_path
cat $save_path/part_* > $save_path/e5_Flat.index
gzip -d $save_path/wiki-18.jsonl.gz

(2) Launch a local retrieval server.

conda activate retriever
bash retrieval_launch.sh

Run PPO training

bash train_ppo.sh

Run GRPO training

bash train_grpo.sh

Run evaluation

bash evaluate.sh

You can change $BASE_MODEL to the path of the model you would like to evaluate.