Xiaopeng Xu, Chencheng Xu, Wenjia He, Lesong Wei, Haoyang Li, Juexiao Zhou, Ruochi Zhang, Yu Wang, Yuanpeng Xiong, Xin Gao
MOTIVATION: Macrocyclic peptides hold great promise as therapeutics targeting intracellular proteins. This stems from their remarkable ability to bind flat protein surfaces with high affinity and specificity while potentially traversing the cell membrane. Research has already explored their use in developing inhibitors for intracellular proteins, such as KRAS, a well-known driver in various cancers. However, computational approaches for de novo macrocyclic peptide design remain largely unexplored.
RESULTS: Here, we introduce HELM-GPT, a novel method that combines the strength of the hierarchical editing language for macromolecules (HELM) representation and generative pre-trained transformer (GPT) for de novo macrocyclic peptide design. Through reinforcement learning (RL), our experiments demonstrate that HELM-GPT has the ability to generate valid macrocyclic peptides and optimize their properties. Furthermore, we introduce a contrastive preference loss during the RL process, further enhanced the optimization performance. Finally, to co-optimize peptide permeability and KRAS binding affinity, we propose a step-by-step optimization strategy, demonstrating its effectiveness in generating molecules fulfilling both criteria. In conclusion, the HELM-GPT method can be used to identify novel macrocyclic peptides to target intracellular proteins.
AVAILABILITY AND IMPLEMENTATION: The code and data of HELM-GPT are freely available on GitHub (https://github.com/charlesxu90/helm-gpt).
J Chem Inf Model. 2012 Oct 22;52(10):2796-806
[PMID:
22947017]
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):1043-1053
[PMID:
37364719]
Trends Pharmacol Sci. 2022 Mar;43(3):234-248
[PMID:
34911657]
J Chem Inf Model. 2020 Dec 28;60(12):5918-5922
[PMID:
33118816]
Methods Mol Biol. 2015;1248:39-53
[PMID:
25616324]
Chem Sci. 2019 Feb 11;10(12):3567-3572
[PMID:
30996948]
Proc Natl Acad Sci U S A. 2021 Mar 23;118(12):
[PMID:
33723038]
Cell. 2022 Sep 15;185(19):3520-3532.e26
[PMID:
36041435]
J Am Chem Soc. 2006 Mar 1;128(8):2510-1
[PMID:
16492015]
J Am Chem Soc. 2006 Nov 1;128(43):14073-80
[PMID:
17061890]
Expert Opin Drug Discov. 2020 Jul;15(7):833-852
[PMID:
32345066]
Front Oncol. 2022 Nov 17;12:992171
[PMID:
36465350]
F1000Res. 2024 Feb 20;12:757
[PMID:
38434657]
J Chem Inf Model. 2023 Apr 10;63(7):2240-2250
[PMID:
36930969]
J Cheminform. 2017 Sep 4;9(1):48
[PMID:
29086083]
Nucleic Acids Res. 2019 Jan 8;47(D1):D930-D940
[PMID:
30398643]
Curr Opin Struct Biol. 2022 Feb;72:226-236
[PMID:
34963082]
Front Pharmacol. 2020 Dec 18;11:565644
[PMID:
33390943]
Drug Discov Today. 2021 Nov;26(11):2707-2715
[PMID:
34082136]