ARBRE: Computational resource to predict pathways towards industrially important aromatic compounds
Synthetic biology and metabolic engineering rely on computational search tools for predictions of novel biosynthetic pathways to industrially important compounds, many of which are derived from aromatic amino acids. Pathway search tools vary in their scope of covered reactions and compounds, as well as in metrics for ranking and evaluation. In this work, we present a new computational resource called ARBRE: Aromatic compounds RetroBiosynthesis Repository and Explorer. It consists of a comprehensive biochemical reaction network centered around aromatic amino acid biosynthesis and a computational toolbox for navigating this network. ARBRE encompasses over 33 ' 000 known and 390 ' 000 novel reactions predicted with generalized enzymatic reactions rules and over 74 ' 000 compounds, of which 19 ' 000 are known to biochemical databases and 55 ' 000 only to PubChem. Over 1 ' 000 molecules that were solely part of the PubChem database before and were previously impossible to integrate into a biochemical network are included into the ARBRE reaction network by assigning enzymatic reactions. ARBRE can be applied for pathway search, enzyme annotation, pathway ranking, visualization, and network expansion around known biochemical pathways and products of lignin degradation to predict valuable compound derivations. In line with the standards of open science, we have made the toolbox freely available to the scientific community on git (https://github.com/EPFL-LCSB/ARBRE) and we provide the web-version at http://lcsb-databases.epfl.ch/arbre/. We envision that ARBRE will provide the community with a new computational resource and comprehensive search tool to predict and rank pathways towards industrially important aromatic compounds.
WOS:000793674600001
2022-07-01
72
259
274
REVIEWED
Relation | URL/DOI |
IsSupplementedBy | |