Communications in Information and Systems

Volume 19 (2019)

Number 3

Generative network complex (GNC) for drug discovery

Pages: 241 – 277

DOI: https://dx.doi.org/10.4310/CIS.2019.v19.n3.a2

Authors

Christopher Grow (Department of Mathematics, Michigan State University, East Lansing, Mi., U.S.A.)

Kaifu Gao (Department of Mathematics, Michigan State University, East Lansing, Mi., U.S.A.)

Duc Duy Nguyen (Department of Mathematics, Michigan State University, East Lansing, Mi., U.S.A.)

Guo-Wei Wei (Department of Mathematics, Michigan State University, East Lansing, Mi., U.S.A.)

Abstract

It remains a challenging task to generate a vast variety of novel compounds with desirable pharmacological properties. In this work, a generative network complex (GNC) is proposed as a new platform for designing novel compounds, predicting their physical and chemical properties, and selecting potential drug candidates that fulfill various druggable criteria such as binding affinity, solubility, partition coefficient, etc. We combine a SMILES string generator, which consists of an encoder, a drug-property controlled or regulated latent space, and a decoder, with verification deep neural networks, a target-specific three-dimensional (3D) pose generator, and mathematical deep learning networks to generate new compounds, predict their drug properties, construct 3D poses associated with target proteins, and reevaluate druggability, respectively. New compounds were generated in the latent space by either randomized output, controlled output, or optimized output. In our demonstration, 2.08 million and 2.8 million novel compounds are generated respectively for Cathepsin S and BACE targets. These new compounds are very different from the seeds and cover a larger chemical space. For potentially active compounds, their 3D poses are generated using a state-of-the-art method. The resulting 3D complexes are further evaluated for druggability by a championing deep learning algorithm based on algebraic topology, differential geometry, and algebraic graph theories. Performed on supercomputers, the whole process took less than one week. Therefore, our GNC is an efficient new paradigm for discovering new drug candidates.

The research of D.D. Nguyen was supported in part by Bristol-Myers Squibb and by Pfizer.

The research of G.-W. Wei was supported in part by NSF Grants DMS-1721024, DMS-1761320, and IIS1900473, NIH grant GM126189, by Bristol-Myers Squibb, and by Pfizer.

Published 6 December 2019