Du, YuanqiJamasb, Arian R.Guo, JeffFu, TianfanHarris, CharlesWang, YinghengDuan, ChenruLio, PietroSchwaller, PhilippeBlundell, Tom L.2024-07-032024-07-032024-07-032024-06-1810.1038/s42256-024-00843-5https://infoscience.epfl.ch/handle/20.500.14299/209100WOS:001249357700001Machine learning has provided a means to accelerate early-stage drug discovery by combining molecule generation and filtering steps in a single architecture that leverages the experience and design preferences of medicinal chemists. However, designing machine learning models that can achieve this on the fly to the satisfaction of medicinal chemists remains a challenge owing to the enormous search space. Researchers have addressed de novo design of molecules by decomposing the problem into a series of tasks determined by design criteria. Here we provide a comprehensive overview of the current state of the art in molecular design using machine learning models as well as important design decisions, such as the choice of molecular representations, generative methods and optimization strategies. Subsequently, we present a collection of practical applications in which the reviewed methodologies have been experimentally validated, encompassing both academic and industrial efforts. Finally, we draw attention to the theoretical, computational and empirical challenges in deploying generative machine learning and highlight future opportunities to better align such approaches to achieve realistic drug discovery end points.|Data-driven generative methods have the potential to greatly facilitate molecular design tasks for drug design.TechnologyDrug DiscoveryNeural-NetworkTransformerExplorationDerivativesSolubilityStrategiesInhibitorsAlgorithmModelMachine learning-aided generative molecular designtext::journal::journal article::review article