Ziqi Gao: Data Science and Analytics, The Hong Kong University of Science and Technology, Guangzhou, 511400, China. ORCID
Chenran Jiang: Pingshan Translational Medicine Center, Shenzhen Bay Laboratory, Shenzhen, 518118, China.
Jiawen Zhang: Data Science and Analytics, The Hong Kong University of Science and Technology, Guangzhou, 511400, China.
Xiaosen Jiang: The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Chinese Academy of Sciences, Hangzhou, 310022, China.
Lanqing Li: AI Lab, Tencent, Shenzhen, 518000, China. ORCID
Peilin Zhao: AI Lab, Tencent, Shenzhen, 518000, China.
Huanming Yang: The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Chinese Academy of Sciences, Hangzhou, 310022, China. ORCID
Yong Huang: Department of Chemistry, The Hong Kong University of Science and Technology, Hong Kong SAR, China. yonghuang@ust.hk. ORCID
Jia Li: Data Science and Analytics, The Hong Kong University of Science and Technology, Guangzhou, 511400, China. jialee@ust.hk. ORCID
Protein-Protein Interactions (PPIs) are fundamental means of functions and signalings in biological systems. The massive growth in demand and cost associated with experimental PPI studies calls for computational tools for automated prediction and understanding of PPIs. Despite recent progress, in silico methods remain inadequate in modeling the natural PPI hierarchy. Here we present a double-viewed hierarchical graph learning model, HIGH-PPI, to predict PPIs and extrapolate the molecular details involved. In this model, we create a hierarchical graph, in which a node in the PPI network (top outside-of-protein view) is a protein graph (bottom inside-of-protein view). In the bottom view, a group of chemically relevant descriptors, instead of the protein sequences, are used to better capture the structure-function relationship of the protein. HIGH-PPI examines both outside-of-protein and inside-of-protein of the human interactome to establish a robust machine understanding of PPIs. This model demonstrates high accuracy and robustness in predicting PPIs. Moreover, HIGH-PPI can interpret the modes of action of PPIs by identifying important binding and catalytic sites precisely. Overall, "HIGH-PPI [ https://github.com/zqgao22/HIGH-PPI ]" is a domain-knowledge-driven and interpretable framework for PPI prediction studies.