Files
PPLM/data
2026-05-28 10:27:11 +08:00
..
2025-12-05 17:12:29 +08:00

Datasets

  1. You can download the sequence pair dataset used to train PPLM through pplm_dataset
  2. You can access the original protein-protein interaction dataset from D-SCRIPT. The corrected pair lists by remove duplicate, erroneous, and invalid negative samples are provided in the ppi folder.
  3. You can access the original protein-protein binding affinity dataset from PPB-Affinity. To prevent potential data leakage, we resplited the five-fold cross-validation list by considering the structure similarity, and the list of PDB IDs for each fold is provided in the affinity folder.
  4. You can download the inter-protein contact prediction dataset through contact_dataset.