HELM-GPT: de novo macrocyclic peptide design using generative pre-trained transformer.

Xiaopeng Xu, Chencheng Xu, Wenjia He, Lesong Wei, Haoyang Li, Juexiao Zhou, Ruochi Zhang, Yu Wang, Yuanpeng Xiong, Xin Gao
Author Information
  1. Xiaopeng Xu: Computer Science Program, Computer, Electrical and Mathematical Science and Engineering (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Makkah, Kingdom of Saudi Arabia. ORCID
  2. Chencheng Xu: Computer Science Program, Computer, Electrical and Mathematical Science and Engineering (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Makkah, Kingdom of Saudi Arabia.
  3. Wenjia He: Computer Science Program, Computer, Electrical and Mathematical Science and Engineering (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Makkah, Kingdom of Saudi Arabia.
  4. Lesong Wei: Computer Science Program, Computer, Electrical and Mathematical Science and Engineering (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Makkah, Kingdom of Saudi Arabia.
  5. Haoyang Li: Computer Science Program, Computer, Electrical and Mathematical Science and Engineering (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Makkah, Kingdom of Saudi Arabia. ORCID
  6. Juexiao Zhou: Computer Science Program, Computer, Electrical and Mathematical Science and Engineering (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Makkah, Kingdom of Saudi Arabia. ORCID
  7. Ruochi Zhang: Syneron Technology, Guangzhou 510000, China.
  8. Yu Wang: Syneron Technology, Guangzhou 510000, China.
  9. Yuanpeng Xiong: Syneron Technology, Guangzhou 510000, China.
  10. Xin Gao: Computer Science Program, Computer, Electrical and Mathematical Science and Engineering (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Makkah, Kingdom of Saudi Arabia. ORCID

Abstract

MOTIVATION: Macrocyclic peptides hold great promise as therapeutics targeting intracellular proteins. This stems from their remarkable ability to bind flat protein surfaces with high affinity and specificity while potentially traversing the cell membrane. Research has already explored their use in developing inhibitors for intracellular proteins, such as KRAS, a well-known driver in various cancers. However, computational approaches for de novo macrocyclic peptide design remain largely unexplored.
RESULTS: Here, we introduce HELM-GPT, a novel method that combines the strength of the hierarchical editing language for macromolecules (HELM) representation and generative pre-trained transformer (GPT) for de novo macrocyclic peptide design. Through reinforcement learning (RL), our experiments demonstrate that HELM-GPT has the ability to generate valid macrocyclic peptides and optimize their properties. Furthermore, we introduce a contrastive preference loss during the RL process, further enhanced the optimization performance. Finally, to co-optimize peptide permeability and KRAS binding affinity, we propose a step-by-step optimization strategy, demonstrating its effectiveness in generating molecules fulfilling both criteria. In conclusion, the HELM-GPT method can be used to identify novel macrocyclic peptides to target intracellular proteins.
AVAILABILITY AND IMPLEMENTATION: The code and data of HELM-GPT are freely available on GitHub (https://github.com/charlesxu90/helm-gpt).

References

J Chem Inf Model. 2012 Oct 22;52(10):2796-806 [PMID: 22947017]
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):1043-1053 [PMID: 37364719]
Trends Pharmacol Sci. 2022 Mar;43(3):234-248 [PMID: 34911657]
J Chem Inf Model. 2020 Dec 28;60(12):5918-5922 [PMID: 33118816]
Methods Mol Biol. 2015;1248:39-53 [PMID: 25616324]
Chem Sci. 2019 Feb 11;10(12):3567-3572 [PMID: 30996948]
Proc Natl Acad Sci U S A. 2021 Mar 23;118(12): [PMID: 33723038]
Cell. 2022 Sep 15;185(19):3520-3532.e26 [PMID: 36041435]
J Am Chem Soc. 2006 Mar 1;128(8):2510-1 [PMID: 16492015]
J Am Chem Soc. 2006 Nov 1;128(43):14073-80 [PMID: 17061890]
Expert Opin Drug Discov. 2020 Jul;15(7):833-852 [PMID: 32345066]
Front Oncol. 2022 Nov 17;12:992171 [PMID: 36465350]
F1000Res. 2024 Feb 20;12:757 [PMID: 38434657]
J Chem Inf Model. 2023 Apr 10;63(7):2240-2250 [PMID: 36930969]
J Cheminform. 2017 Sep 4;9(1):48 [PMID: 29086083]
Nucleic Acids Res. 2019 Jan 8;47(D1):D930-D940 [PMID: 30398643]
Curr Opin Struct Biol. 2022 Feb;72:226-236 [PMID: 34963082]
Front Pharmacol. 2020 Dec 18;11:565644 [PMID: 33390943]
Drug Discov Today. 2021 Nov;26(11):2707-2715 [PMID: 34082136]

Grants

  1. URF/1/4352-01-01/King Abdullah University of Science and Technology

MeSH Term

Peptides, Cyclic
Computational Biology
Drug Design
Peptides
Humans
Algorithms
Software

Chemicals

Peptides, Cyclic
Peptides

Word Cloud

Similar Articles

Cited By