mirror of
https://github.com/junliu621/PPLM.git
synced 2026-06-04 14:24:22 +08:00
Datasets
- You can download the sequence pair dataset used to train PPLM through https://zhanggroup.org/PPLM/bin/databases/PPLM_Dataset.tar.gz or google_drive
- You can access the original protein-protein interaction dataset from D-SCRIPT. The corrected pair lists by remove duplicate, erroneous, and invalid negative samples are provided in the ppi folder.
- You can access the original protein-protein binding affinity dataset from PPB-Affinity. To prevent potential data leakage, we resplited the five-fold cross-validation list by considering the structure similarity, and the list of PDB IDs for each fold is provided in the affinity folder.
- You can download the inter-protein contact prediction dataset through https://zhanggroup.org/PPLM/bin/databases/PPLM-Contact_Dataset.tar.gz or google_drive.