All baselines are stored in examples/pertqa, pubmedqa, antibiotic_pred, and hotpotqa (All baselines can be find in the xxx_baseline.ipynb notebook). pertqa contains three datasets, Adamson, Norman, Reploge antibitic_pred contains one dataset, MolQA. The only different baseline is aflow. Aflow should be run with test_aflow.sh