Family of models and datasets presented in "Generalise or Memorise? Benchmarking Ligand-Conditioned Protein Generation from Sequence-Only Data"