Automated design of multi-target ligands by generative deep learning.
Laura Isigkeit, Tim Hörmann, Espen Schallmayer, Katharina Scholz, Felix F Lillich, Johanna H M Ehrler, Benedikt Hufnagel, Jasmin Büchner, Julian A Marschner, Jörg Pabel, Ewgenij Proschak, Daniel Merk
Author Information
Laura Isigkeit: Goethe University Frankfurt, Institute of Pharmaceutical Chemistry, 60438, Frankfurt, Germany. ORCID
Tim Hörmann: Ludwig-Maximilians-Universität München, Department of Pharmacy, 81377, Munich, Germany. ORCID
Espen Schallmayer: Goethe University Frankfurt, Institute of Pharmaceutical Chemistry, 60438, Frankfurt, Germany.
Katharina Scholz: Ludwig-Maximilians-Universität München, Department of Pharmacy, 81377, Munich, Germany. ORCID
Felix F Lillich: Goethe University Frankfurt, Institute of Pharmaceutical Chemistry, 60438, Frankfurt, Germany. ORCID
Johanna H M Ehrler: Goethe University Frankfurt, Institute of Pharmaceutical Chemistry, 60438, Frankfurt, Germany. ORCID
Benedikt Hufnagel: Goethe University Frankfurt, Institute of Pharmaceutical Chemistry, 60438, Frankfurt, Germany.
Jasmin Büchner: Goethe University Frankfurt, Institute of Pharmaceutical Chemistry, 60438, Frankfurt, Germany.
Julian A Marschner: Ludwig-Maximilians-Universität München, Department of Pharmacy, 81377, Munich, Germany.
Jörg Pabel: Ludwig-Maximilians-Universität München, Department of Pharmacy, 81377, Munich, Germany. ORCID
Ewgenij Proschak: Goethe University Frankfurt, Institute of Pharmaceutical Chemistry, 60438, Frankfurt, Germany.
Daniel Merk: Goethe University Frankfurt, Institute of Pharmaceutical Chemistry, 60438, Frankfurt, Germany. daniel.merk@cup.lmu.de. ORCID
Generative deep learning models enable data-driven de novo design of molecules with tailored features. Chemical language models (CLM) trained on string representations of molecules such as SMILES have been successfully employed to design new chemical entities with experimentally confirmed activity on intended targets. Here, we probe the application of CLM to generate multi-target ligands for designed polypharmacology. We capitalize on the ability of CLM to learn from small fine-tuning sets of molecules and successfully bias the model towards designing drug-like molecules with similarity to known ligands of target pairs of interest. Designs obtained from CLM after pooled fine-tuning are predicted active on both proteins of interest and comprise pharmacophore elements of ligands for both targets in one molecule. Synthesis and testing of twelve computationally favored CLM designs for six target pairs reveals modulation of at least one intended protein by all selected designs with up to double-digit nanomolar potency and confirms seven compounds as designed dual ligands. These results corroborate CLM for multi-target de novo design as source of innovation in drug discovery.
101040355/EC | EU Framework Programme for Research and Innovation H2020 | H2020 Priority Excellent Science | H2020 European Research Council (H2020 Excellent Science - European Research Council)