Machine learning-aided generative molecular design
Machine learning has provided a means to accelerate early-stage drug discovery by combining molecule generation and filtering steps in a single architecture that leverages the experience and design preferences of medicinal chemists. However, designing machine learning models that can achieve this on the fly to the satisfaction of medicinal chemists remains a challenge owing to the enormous search space. Researchers have addressed de novo design of molecules by decomposing the problem into a series of tasks determined by design criteria. Here we provide a comprehensive overview of the current state of the art in molecular design using machine learning models as well as important design decisions, such as the choice of molecular representations, generative methods and optimization strategies. Subsequently, we present a collection of practical applications in which the reviewed methodologies have been experimentally validated, encompassing both academic and industrial efforts. Finally, we draw attention to the theoretical, computational and empirical challenges in deploying generative machine learning and highlight future opportunities to better align such approaches to achieve realistic drug discovery end points.|Data-driven generative methods have the potential to greatly facilitate molecular design tasks for drug design.
WOS:001249357700001
2024-06-18
REVIEWED
Funder | Grant Number |
Schweizerischer Nationalfonds zur Frderung der Wissenschaftlichen Forschung (Swiss National Science Foundation) | 180544 |
NCCR Catalysis | |
National Centre of Competence in Research - Swiss National Science Foundation | BB/M011194/1 |
Biotechnology and Biological Sciences Research Council (BBSRC) DTP studentship | |
Cornell Presidential Life Science Fellowship | |