Annerial
/

predicted_protein_complex_structures_datasets

Model card Files Files and versions

predicted_protein_complex_structures_datasets / README.txt

Annerial's picture

Upload README.txt with huggingface_hub

b11cebd verified about 1 year ago

history blame contribute delete

738 Bytes

	'id': The ID number in our dataset. Homodimers are prefixed with "EID_", while heterodimers are prefixed with "ID_".
	'sequence': The protein pair sequence, formatted as: Sequence A: Sequence B.
	'len1': The length of Sequence A.
	'len2': The length of Sequence B.
	'plddt': The pLDDT score for the predicted complex.
	'ptm': The pTM score for the predicted complex.
	'iptm': The ipTM score for the predicted complex.
	'gene_id1': The gene ID for Sequence A.
	'gene_id2': The gene ID for Sequence B.
	'protein_names1': The protein name for Sequence A.
	'protein_names2': The protein name for Sequence B.
	'organism1': The name of the species to which Sequence A belongs.
	'organism2': The name of the species to which Sequence B belongs.