Bringing Your Own View: Graph Contrastive Learning without Prefabricated Data Augmentations.

Advanced Search

Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen

Author Information

Yuning You: Texas A&M University.
Tianlong Chen: University of Texas at Austin.
Zhangyang Wang: University of Texas at Austin.
Yang Shen: Texas A&M University.

PMID: 35647617 DOI: 10.1145/3488560.3498416

Self-supervision is recently surging at its new frontier of graph learning. It facilitates graph representations beneficial to downstream tasks; but its success could hinge on domain knowledge for handcraft or the often expensive trials and errors. Even its state-of-the-art representative, graph contrastive learning (GraphCL), is not completely free of those needs as GraphCL uses a prefabricated prior reflected by the ad-hoc manual selection of graph data augmentations. Our work aims at advancing GraphCL by answering the following questions: Accordingly, we have extended the prefabricated discrete prior in the augmentation set, to a learnable continuous prior in the parameter space of graph generators, assuming that graph priors , similar to the concept of image manifolds, can be learned by data generation. Furthermore, to form contrastive views without collapsing to trivial solutions due to the prior learnability, we have leveraged both principles of information minimization (InfoMin) and information bottleneck (InfoBN) to regularize the learned priors. Eventually, contrastive learning, InfoMin, and InfoBN are incorporated organically into one framework of bi-level optimization. Our principled and automated approach has proven to be competitive against the state-of-the-art graph self-supervision methods, including GraphCL, on benchmarks of small graphs; and shown even better generalizability on large-scale graphs, without resorting to human expertise or downstream validation. Our code is publicly released at https://github.com/Shen-Lab/GraphCL_Automated.

Graph contrastive learning graph generative model information bottleneck information minimization

IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):2747-2758 [PMID: 35895656]
Adv Neural Inf Process Syst. 2022 Dec;35:1909-1922 [PMID: 37192934]
Chem Sci. 2017 Oct 31;9(2):513-530 [PMID: 29629118]
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii68-ii74 [PMID: 36124802]
Proc Mach Learn Res. 2020 Jul;119:10871-10880 [PMID: 33283198]
J Chem Inf Model. 2015 Nov 23;55(11):2324-37 [PMID: 26479676]
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2412-2429 [PMID: 35476575]

R35 GM124952/NIGMS NIH HHS

Journal Article

A Good View for Graph Contrastive Learning.Cross-modality and self-supervised protein embedding for compound-protein affinity and contact prediction.Detecting anomalous proteins using deep representations.GDCL-NcDA: identifying non-coding RNA-disease associations via contrastive learning between deep graph learning and deep matrix factorization.

OpenLB
Open Library of Bioscience