systems biology based metabolic engineering for non-natural … · 2019. 4. 17. · mcs minimal cut...
TRANSCRIPT
TSpace Research Repository tspace.library.utoronto.ca
Systems biology based metabolic engineering for non-natural chemicals
Alessandra Biz, Scott Proulx, Zhiqing Xu, Kavya Siddartha, Alex
Mulet Indrayanti, Radhakrishnan Mahadevan
Version Post-print/Accepted Manuscript
Citation (published version)
Publisher’s Statement
Biz, A., Proulx, S., Xu, Z., Siddartha, K., Indrayanti, A.M. and Mahadevan, R., 2019. Systems biology based metabolic engineering for non-natural chemicals. Biotechnology Advances. The article has been published in final form at [10.1016/j.biotechadv.2019.04.001]
Copyright / License This work is licensed under the Creative Commons
Attribution-NonCommercial-NoDerivatives 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
How to cite TSpace items
Always cite the published version, so the author(s) will receive recognition through services that track
citation counts, e.g. Scopus. If you need to cite the page number of the author manuscript from TSpace because you cannot access the published version, then cite the TSpace version in addition to the published
version using the permanent URI (handle) found on the record page.
This article was made openly accessible by U of T Faculty. Please tell us how this access benefits you. Your story matters.
Accepted Manuscript
Systems biology based metabolic engineering for non-naturalchemicals
Alessandra Biz, Scott Proulx, Zhiqing Xu, Kavya Siddartha, AlexMulet Indrayanti, Radhakrishnan Mahadevan
PII: S0734-9750(19)30060-6DOI: https://doi.org/10.1016/j.biotechadv.2019.04.001Reference: JBA 7379
To appear in: Biotechnology Advances
Received date: 4 December 2018Revised date: 23 February 2019Accepted date: 1 April 2019
Please cite this article as: A. Biz, S. Proulx, Z. Xu, et al., Systems biology based metabolicengineering for non-natural chemicals, Biotechnology Advances, https://doi.org/10.1016/j.biotechadv.2019.04.001
This is a PDF file of an unedited manuscript that has been accepted for publication. Asa service to our customers we are providing this early version of the manuscript. Themanuscript will undergo copyediting, typesetting, and review of the resulting proof beforeit is published in its final form. Please note that during the production process errors maybe discovered which could affect the content, and all legal disclaimers that apply to thejournal pertain.
ACC
EPTE
D M
ANU
SCR
IPT
Systems Biology Based Metabolic Engineering for Non-natural Chemicals
Alessandra Biz1, Scott Proulx
1, Zhiqing Xu
1, Kavya Siddartha
1, Alex Mulet Indrayanti
1,
Radhakrishnan Mahadevan1,*
1Department of Chemical Engineering and Applied Chemistry, University of Toronto,
Toronto, ON, Canada
*Corresponding Author.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Abstract
Production of chemicals in microorganisms is no longer restricted to products arising from
native metabolic potential. In this review, we highlight the evolution of metabolic
engineering studies, from the production of natural chemicals fermented from biomass
hydrolysates, to the engineering of microorganisms for the production of non-natural
chemicals. Advances in synthetic biology are accelerating the successful development of
microbial cell factories to directly produce value-added chemicals. Here we outline the
emergence of novel computational tools for the creation of synthetic pathways, for designing
artificial enzymes for non-natural reactions and for re-wiring host metabolism to increase the
metabolic flux to products. We also highlight exciting opportunities for applying directed
evolution of enzymes, dynamic control of growth and production, growth-coupling strategies
as well as decoupled strategies based on orthogonal pathways in the context of non-natural
chemicals.
Keywords: systems biology; non-natural chemicals; in-silico pathway design; directed
evolution; de novo protein engineering; genome-scale models; flux balance analysis; dynamic
control; growth-coupled production; orthogonality
Abbreviations
FBA Flux Balance Analysis
MOMA Minimization of Metabolic Adjustment
FDCA 2,5-Furandicarboxylic acid
5-HMF Hydroxymethylfurfural
PEF polyethylene furanoate
PLA polylactic acid
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
BDO butanediol
GSMMs Genome-Scale Metabolic Models
ER enoate reductases
1,3 PDO 1,3 pentanediol
CBM Constraint-Based Modeling
COBRA Constraint-Based Reconstruction and Analysis
CSOMs Computational Strain Optimization Methods
MCS minimal cut sets
EMs elementary flux modes
NCD nicotinamide cytosine dinucleotide
EMA Elementary Mode Analysis
MOVE Metabolic Valve Enumerator
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
1. Introduction
In 2004, when the U. S. Department of Energy released the report “Top Value-Added
Chemicals from Biomass” (Werpy and Petersen, 2004) the idea was to screen platform
chemicals that were naturally produced by microorganisms from sugars, which, in turn, could
be chemically transformed into molecules of industrial relevance. In fact, currently there are
several companies producing bulk and fine chemicals from renewable sources, although, in
most cases, via chemical routes (Table 1).
Now, instead of producing these platform chemicals, more studies are being driven by the
idea that any chemical, including non-natural chemicals, could be potentially made directly in
microbial cell factories (Lee et al., 2012). The technology that is accelerating this shift is the
integration of systems biology with metabolic engineering (Jensen and Keasling, 2018).
Non-natural chemicals are molecules, such as, fuels, materials and pharmaceuticals, that
rarely occur in nature and are primarily being made chemically by using petroleum as starting
material (Lee et al., 2012). As they are not made naturally by any organism, there are no
known natural pathways that lead to the production of these compounds. However, recent
advances in systems biology based metabolic engineering are being used to uncover synthetic
pathways and rewire host metabolism for their biochemical production from sugars (Jensen
and Keasling, 2018). These studies, and the insights learned from them, are the focus of this
review.
Table 2 shows that the production of non-natural chemicals is already a reality, with more
than 40 different compounds being produced in different hosts, in particular, Escherichia
coli, Pseudomonas putida, Corynebacterium glutamicum and Saccharomyces cerevisiae. The
major synthetic pathways for producing non-natural compounds are highlighted in Figure 1.
Recently, a few of those compounds have reached the commercial scale. One noteworthy
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
example is the development of a process for the production of 1,4 butanediol (1,4 BDO)
directly in E. coli by the company Genomatica (San Diego, CA), which reached the
production scale of 30000 tons/year.
Systems biology is generally defined by the integration of computational techniques and
biological Big Data, acquired by using high-throughput technologies, to unravel the
principles of complex biological systems (Dai and Nielsen, 2015). Recently, there has been a
focus on synthetic biology, which involves the use of engineering fundamentals, such as
systems abstraction, part standardization, characterization and modeling, for the design,
construction and optimization of genetic circuits and synthetic pathways. Thus, systems
biology based metabolic engineering is the field of study that combines systems and synthetic
biology to develop processes for making non-natural chemicals biologically (Lee et al.,
2011). Such strategies are based on, 1) identifying de novo pathways that could provide a
route to produce the proposed chemical, 2) identifying the enzyme candidates that could be
used for the proposed reactions of the pathway, and 3) determining and implementing the
genetic interventions that can be made in the host for increasing flux to products (Lee et al.,
2011). The general outline of how systems biology based metabolic engineering is used to
design microbes for non-natural chemicals production is shown in Figure 2.
In this review, we will briefly present each of these concepts, including the latest
achievements in systems biology based strategies to design microbial cell factories and how
these technologies are being used for producing non-natural chemicals. Important aspects that
will be covered in this review are: de novo pathway design, engineering of enzymes through
directed evolution, model-based designs for both growth-coupled synthetic pathways and
decoupled growth and production, as well as dynamic control and orthogonality concepts
based on metabolic valves.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
2. In-silico pathway design
In-silico pathway design is the preliminary step in the development of novel biosynthetic
pathways. In the past decade, a large number of pathway design tools were developed,
making use of divergent approaches (i.e., different molecule encodings, pathway network
topologies, pruning methods, search algorithms, pathway prioritization metrics etc.) (Wang et
al., 2017; Hadadi and Hatzimanikatis, 2015). However, only a small portion of those
algorithms allow prediction of de novo pathways with non-natural intermediates and
chemicals, while most tools have focused on searching the reaction networks of metabolites
in natural biochemical processes.
A generalized workflow for de novo pathway design involves the following critical steps: (a)
the identification, from a list of starting materials, of all chemically plausible pathways to the
desired target compound on the basis of chemical structure transformation, (b) the pathway
prioritization based on various criteria, including thermodynamic feasibility, number of non-
natural reaction steps, enzyme docking and cellular resources cost etc., and (c) the selection
of the most suitable host organism and designing the synthetic expression constructs
(pathway integration).
In this section, we will briefly discuss how such steps are linked together to build a pipeline
for the discovery of new bio-pathways, as well as the recently developed computational tools
designed for that task, and the main challenges that need to be overcome regarding pathway
prediction.
2.1 Pathway identification
Recent efforts for de novo pathway discovery involve two closely-related approaches to
pathway identification, one of which is the construction of a putative reaction network using
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
generalized reaction transformations of promiscuous enzymes, which is often referred to as
“graph-based” pathway finding methods (BNICE, SimPheny, ReactPRED, GEM-path,
RetroPath 2.0) (Wang et al., 2017a). The other approach is the creation of an extended
pathway database (XTMS, MINE, ATLAS of Biochemistry) which allows a fast search for
pathways with non-natural intermediates and reactions (Carbonell et al., 2014). Both
strategies explore generalized reaction transformations of putative promiscuous enzymes that
can act upon structurally similar molecules (Figure 3A).
Different molecular and reaction encoding systems (Figure 3C) have been proposed to solve
the pathway identification in a retrosynthesis context. SimPheny (Yim et al., 2011) and
BNICE (Hatzimanikatis et al., 2005) use the third level EC number to guide their reaction
rules encoding where molecules and reaction operators are represented in the form of BEM
(bond-electron matrix), while GEM-path (Campodonico et al., 2014) and RetroPath2.0
(Delépine et al., 2017) utilize the simpler SMILES/SMARTS and SMIRKS system to encode
the compounds and transformations (Figure 3C). Most of the pathfinding algorithms use in-
house sets of transformation rules, for example, the latest version of BNICE has 361
generalized reaction rules, covering ~90% of KEGG 2014 (with 9000+ reactions).
Retropath2.0 and ReactPRED (Sivakumar et al., 2016) take a more data-driven approach.
ReactPRED automates the generation of transformation rules from user-defined reactions and
RetroPath2.0 systematically extracts thousands of transformation rules (expressed in
SMARTS/SMIRKS format) from 31527 reactions in the MetaNetX database, automatically
identifying the reaction centre and generating generalized reaction operator for each reaction
in the database. These automatically generated rules have specified more neighboring atoms
to the reaction centre and therefore are relatively less “general” as compared to reaction rules
in other methods (i.e., BNICE).
An update of the reaction rules database used in RetroPath2.0, RetroRules (Duigou et
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
al.,2018) extract over 400,000 rules from multiple public databases including MetaNetX
(Moretti et al., 2016), Brenda (Placzek et al., 2017), BiGG (King et al., 2015), KEGG
(Kanehisa and Goto, 2000), MetaCyc (Caspi et al., 2016), MetRxn (Kumar et al., 2012) etc.
While most currently used reaction rules include only substructures of the substrates and
products, the recently developed RING (Gupta et al., 2018), sets more constraints in the
transformation rules, for example, upper and lower bounds of size of each substrate, what
substructure must not be included in each reactant, etc.
Various network pruning methods have also been developed to reduce the redundant branches
(reactions) and nodes (substrates) in the pathway network. SimIndex and SimZyme (Pertusi
et al., 2014) screen intermediate compounds based on their structural similarities to the target
compound and the enzyme’s native substrates, respectively. GEM-Path uses estimated
reaction thermodynamics to guide the network search. SimPheny (BioPathway Predictor) sets
a bound for molecule size. Other approaches include calculating a penalty score based on
enzyme sequence consistency in Retropath2.0.
The network search algorithm (Figure 3D) most commonly used by different de novo
pathway prediction tools is breadth-first search (some with pathway finding heuristics)
starting from the target compound (as used in RetroPath 2.0 and Delepine et al., 2017). This
retrosynthetic approach works well for relatively large molecular structures. However, since
transformation rules cannot be used to predict the reactants and products at the same time,
retrosynthesis does not work as well for pathways that consist of decomposition reactions. In
addition to this retrosynthetic approach, breadth-first search starting from the source
compounds, depth-first search (as used in BNICE, Hatzimanikatis et al., 2005) and double
direction search from the both the source and target chemicals (used in Uri Alon
Biopathfinder, Noor et al., 2010) are also used in different pathway identification algorithms.
Breadth-first search starting from the source compounds (as used in ReTrace, Pitkanen et al.,
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
2009) is applicable to identify pathways with complex pathway topology. The bottleneck of
this approach is the fast growth of the number of compounds (i.e., combinatorial explosion in
the hypergraph case) and relatively heavier computational burden. Likewise, double-direction
search from both the source and target chemicals (as used in Uri Alon Biopathfinder, Noor et
al., 2010) is a combination of substrates-side breadth-first-search and product-side
retrosynthesis and allows identification of pathways with complicated structures while saving
runtime to a great extent. Finally, depth-first search (as used in BNICE, Hatzimanikatis et al.,
2005) is an alternative method used by some pathway identification algorithms, yet it is
limited to simple graph pathway networks. To our knowledge, there is no existing pathway
finding algorithm that uses depth-first search to explore a hypergraph reaction network.
Several pathfinding webservers extended the search space of existing databases by applying
the graph-based pathway finding techniques, such as XTMS, ATLAS and MINE. XTMS
(Carbonell et al., 2014), for example, collects metabolites and reactions from MetaCyc and
EcoCyc (Caspi et al., 2016, Keseler et al., 2017) and then uses Retropath (Carbonell et al.,
2011; Carbonell et al., 2013) to identify intermediate heterologous reaction steps. On the
other hand, ATLAS (Hadadi et al., 2016) and MINE (Jeffryes et al., 2015) are hypothetical
databases built based on KEGG and BNICE computational frameworks.
One of the main challenges that still remain concerning pathway identification is to enhance
the prediction of linear pathways and to recognize branched pathways (using a hypergraph or
a bipartite graph network representation) (Figure 3B). Earlier pathway finding algorithms,
i.e., that use simple edges to represent reactions from one compound to another, are limited in
identifying linear pathways and are not capable of predicting branched pathways with more
complex topologies, such as, condensation or synthetic reaction steps. Retropath2.0 is already
an advance in this sense as it uses hypergraph network representation of the pathway
network, where the details of a reaction can be stored perfectly by a directed hyper-edge
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
connecting multiple substrates and products (Klamt et al., 2009). However, the limitation is
that it allows each promiscuous enzymatic reaction to take only one non-native substrate.
There are also other algorithms that try to identify branched pathways through merging linear
pathways, such as LPAT (Heath et al., 2011). However, to our knowledge, there is no current
computational method that uses hyper-edge reaction rules which would allow multiple non-
native substrates in the reaction. Though using hyper-edge rules will heavily increase the
computational burden, hyper-edge reaction rules will allow identification of literally all
potentially reachable compounds and broaden the chemical repertoire in the putative network.
2.2 Pathway Ranking and Selection
Pathway ranking, as shown in Figure 3E, is another important step in the biosynthetic
pathway design workflow. Resulting pathways from the identification step are usually very
numerous and cannot be analysed manually. These pathways are, therefore, ranked based on
pathway length, number of non-native reaction steps, and thermodynamic feasibility, in
which a more negative delta G is more favorable (Li et al., 2004). GSMM (Genome-scale
metabolic models) and FBA (flux balance analysis), which will be discussed in section 4.1,
are also performed in some methods to estimate the maximum product yield.
Some recently developed pathway design methods combine different pathway restraints and
factors and sum them into an overall score for pathway prioritization. For example, the
method proposed by Calhoun et al. (2018) includes molecular docking, cheminformatics,
similarity ensemble approach (SEA) and chemical transformation feasibility in the scoring
function they have developed for identifying pathways. Other possible approaches for
pathway ranking include assessing the toxicity of intermediate compounds and binding site
covalence (i.e., Cho et al., 2010).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Finally, artificial intelligence may bring new solutions to pathway discovery and ranking. A
recent computer-aided tool for chemical synthesis used a deep neural network trained on
millions of chemical reactions to rank transformation patterns and determine the feasible
synthesis method (Segler et al., 2018). Such tool uses Monte Carlo tree search and symbolic
artificial intelligence to learn from the database and rank the most promising retrosynthetic
steps, simulating the decision-making process of an expert. Although it was used in the
context of designing retrosynthesis steps for organic chemistry, it could also be applied for de
novo biochemical pathway design.
3. Enzyme discovery and optimization
A subsequent step following in-silico pathway design is to find suitable enzymes that will
make the consecutive chemical transformations predicted by the computational tools.
However, even considering that most of the prediction tools apply methods based on
experimentally identified enzymatic reaction rules, it is very likely that most of the pathways
will contain at least one unknown enzyme, which is especially the case concerning pathways
for non-natural chemicals (Campodonico et al., 2014).
In some scenarios, researchers will have an idea of which enzyme, or set of enzymes, is the
best candidate for making the target chemical (Lee et al., 2012). However, besides this
rational design, recent advances in enzyme engineering expanded the possibilities of
implementation of the in-silico pathway designs in microbial hosts. For example, directed
evolution is used to further optimize natural enzymes for increased activity and substrate
specificity (Arnold, 2017). Additionally, even if no enzyme is found for the proposed
reaction, related enzymes can be modified via rational design (Tiwari et al., 2012), or could,
potentially, be created de novo by using protein simulation algorithms (Zanghellini, 2014).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Nevertheless, it is worth to mention that while directed evolution is at an advanced stage of
development, there are not many successful cases of enzymes being created de novo despite
the recent progress in the field.
In this section, we will discuss genome mining for enzyme selection, de novo protein
engineering and directed evolution, focusing on the cutting-edge techniques that are allowing
the production of non-natural chemicals in microorganisms.
3.1 Genome mining for enzyme selection and optimization
A starting point for the construction of the pathway is to evaluate the subclass of the enzyme
predicted by the prediction algorithms and search in the available databases for enzymes that
would perform the proposed reaction. If a suitable enzyme for a determined substrate is not
found, there is still the possibility for the discovery of enzymes that may react,
promiscuously, with the desired substrate, since many enzymes act on multiple substrates
(Prather and Martin, 2008). Searching for enzymes in genome databases using only genetic
information and without a structure at hand is often called “genome mining” (Ziemert et al.,
2016) and precedes the process of expression and screening for performance. The best
candidates are then selected for combinatorial expression in a heterologous host.
One example of this concept of enzyme promiscuity screening being applied to pathways for
producing non-natural chemicals is the production of adipic acid via the muconic acid
pathway. Until recently, the enzyme needed to reduce both double bonds of muconic acid to
make adipic acid was unknown. To this end, Joo et al. (2017) cloned and screened about 25
genes from putative old yellow enzymes and 6 genes from putative enoate reductases (ER)
from GenBank database for activity against muconic acid. They found that at least 2 of such
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
ERs, from Bacillus coagulans and Clostidium acetobutylicum had excellent activity reducing
muconic acid, discovering the missing link of that pathway.
Another example is the 1,4 BDO production in E. coli using the synthetic pathway predicted
in-silico by SimPheny (Yim et al., 2011), already mentioned in section 2.1. The 7 step
pathway required 5 heterologous enzymes, that had to be screened from different
microorganisms, including cat2, sucD and 4hbd, from Porphyromonas gingivalis, sucA from
Mycobacterium bovis, and adhE2 from Clostridium acetobutylicum (Yim et al., 2011).
In-silico pathway prediction models can aid in this “screen and select” approach by
identifying and ranking candidate enzymes. Carbonell and Faulon (2010) created a method
for predicting catalytic and substrate promiscuity using gene sequence similarities and
supervised machine learning algorithms. In the same manner, the genome-wide method
PROPER (enzyme PROmiscuity PrEdictoR) predicts promiscuous activities of metabolic
genes (Oberhardt et al., 2016). For that, it uses a permissive PSI-BLAST approach to search
for distant gene sequence similarities and analyze phylogenetic trees to assign secondary
activities to enzymes. This method was used in combination with genome-scale metabolic
modeling (GEM-PROPER) to predict the effect of exchanging enzymes with promiscuous
analogues from alternative metabolic pathways. A next step to further advance those models
would be to validate the promiscuity score based on gene sequence correspondence by using
similarities in binding site topography.
3.2 Directed evolution and rational design of enzymes
If a suitable promiscuous enzyme cannot be found to complete the metabolic pathway, it is
necessary to design novel enzymes for such reactions that have not been identified to occur in
nature, for example, using directed evolution and rational design on enzymes (Figure 4A).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Directed evolution is an experimental technique used for protein engineering that imitates the
natural process of evolution, but in a considerably faster manner. For that, a large library
made from a starting enzyme mutant undergoes several rounds of iterative mutagenesis under
a selection pressure to elect the ones capable of best performing a desired function (Cobb et
al., 2013).
Directed evolution is a powerful tool in metabolic engineering as it allows researchers to
select for the enzyme with the right features, such as, improved activity in non-native
substrates, changing enantioselectivity, or even to create a new non-natural enzyme activity,
without having to devote resources into elucidating why those features make the enzyme
behave as desired (Packer and Liu, 2015; Arnold, 2017).
The first stage of directed evolution is obtaining the library of mutants. Several mutagenesis
methods have been developed for this purpose, including, random mutagenesis, site-directed
mutagenesis and site-saturated mutagenesis (Marcheschi et al., 2013). DNA shuffling, or
recombination, can also be used to make random combinatorial libraries. It involves
randomly fragmenting and rejoining DNA sequences and is often used in place of or in
addition to error-prone PCR to generate a diverse combinatorial library (Cobb et al., 2013).
Other random mutagenesis methods include transcription factor engineering, multiplex
genome engineering (MAGE), and are covered in the review by Marcheschi et al. (2013).
Directed evolution was used to adapt a series of enzymes to perform totally novel non-natural
reactions, including the enantioselective cyclopropanation of styrenes (Coelho et al., 2013), a
metal-oxo–mediated anti-Markovnikov oxidation of styrenes (Hammer et al., 2017) and an
asymmetric cyclopropanation using evolved myoglobin (Bajaj et al., 2016). The use of
directed evolution has also been used for the production of non-natural chemicals, such as the
non-natural carotenoid C50 astaxanthin (Furubayashi et al., 2015).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
The primary obstacle to any directed evolution work is the development of an adequately
sensitive and robust screen for the enzyme activity that is being selected for (Cobb et al.,
2013). The current gold standards for such measurements, gas and liquid chromatography
methods (GC and LC) and mass spectrometry (MS), are not scalable for high-throughput
operations due to factors such as reagent cost and signal to noise ratio (Dietrich et. al, 2010).
As the libraries generated for directed evolution are large (102 – 10
6 variants) (Dietrich et. al,
2010), these methods of measurement become the bottleneck of the process. While this
limitation can be overcome through the reduction of libraries by using rational design prior to
directed evolution (Cobb et al, 2013), such libraries may still be considered large for LC-MS
based methods.
Rational protein design, instead of creating random mutants straightaway, proposes a
preliminary in-silico screening, in which many combinations of amino acid exchanges would
be tested beforehand by computational simulation to identify the changes that are most likely
to succeed, thus creating a much smaller library of mutants to screen compared to random
mutagenesis. This method allows solutions with the mutation of dozens of amino acids at a
time. Several Molecular Dynamics (MD) simulations tools are available to analyse the
tridimensional enzyme structures and guide rational designs.
Recently, Walther et al. (2017) demonstrated the usefulness of rational design of enzymes in
the context of non-natural chemicals. They used molecular modeling and structural analysis
for engineering the enzymes needed for the production of the non-natural chemical 2,4
dehydroxybutyric acid (2,4 DHB): malate kinase, malate semialdehyde dehydrogenase and
malate semialdehyde reductase. Similarly, Tai et al. (2016) and Wang et al. (2017b) also used
structural analysis to redesign native 2-ketoacid decarboxylase and diol dehydratase,
respectively, for heterologous production of 1,4 BDO.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
3.3 High-throughput library screening with biosensors
As the systems biology-based techniques rely on the acquisition of large amounts of data, a
significant bottleneck is the screening and analysis. While the generation of mutant libraries
can be very high throughput, their characterization is limited by the use of lower throughput
techniques such as chromatography (e.g. HPLC, GC) and mass spectrometry (Rogers et al.,
2016; Skjoedt et al., 2016). An alternative method to overcome this limitation in screening is
the development of biosensors to enable the screening or selection of massive libraries, for
example, in directed evolution (Rogers and Church, 2016; Rogers et al., 2016).
Biosensors are based on the use of biological mechanism of molecular recognition for a target
chemical. In principle, a signal molecule, that can be detected by color, fluorescence, or
which ensures cell fitness, is coupled with the presence of the target chemical (Rogers et al.,
2016).
Thus, biosensors can be used to screen libraries of enzymes when searching for novel
biocatalysts for the purposes of host development and optimization (Figure 4B). For example,
biosensors expressed in vivo can help identify the highest producers of a certain non-natural
product in a set of clones obtained from metabolic engineering strategies, such as, the
introduction of a heterologous production pathway (Snoek et al., 2018). Additionally,
biosensors can be applied to the identification of the best-performing enzymes to react on a
given substrate (Ho et al., 2018).
Most biosensors are based on either a transcription factor or a sensor-actuator system. For
example, when applied to the production of non-natural chemicals, the chemical produced
serves as the input signal, which upon transcription factor recognition will result in the
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
expression of an actuator component such as a reporter gene (e.g. GFP) or an antibiotic
resistance gene that will enable survival (Dietrich et al., 2013; Li et al., 2015; Leavitt et al.,
2017; Snoek et al., 2018).
Ligand-recognition orthogonality is a key feature of any biosensor. When a biosensor is
designed to respond to a non-natural chemical that is being produced, the system has an
inherent element of orthogonality since, in theory, no other chemical should be detected.
However, cross-reactivity with highly similar molecules is possible and needs to be addressed
through methods such as negative selection. Dixon et al. (2010) strategically mutated a
purine-binding riboswitch native to E. coli by targeting the most important base pairs
involved in ligand binding. They were able to construct two mutant switches with high
selectivity for ammeline and azacytosine (two non-natural ligands in said organism) and
virtually none towards the original purine ligands (Dixon et al., 2010).
Another biosensor feature that is promising for future high-throughput screens is the addition
of levels of control by changing media condition at a given time point. Snoek et al. (2018)
developed a biosensor-selector system which responds to cis, cis-muconic acid concentration
by inducing a G418 resistance-conferring gene, with the false positive rate being altered by
changing the media conditions. Analogously, Chen et al. (2018) showed in their study that
response time of their putrescine-detecting biosensor was dependent on the nutrients
available in the media, particularly with regards to nitrogen source. De Paepe et al. (2017)
and Zhang et al. (2015) provide an in-depth review of the different biosensor strategies and
engineering techniques that can be used to alter biosensor binding specificity as well as the
operational and dynamic ranges.
Higher biosensor efficiency and response times are also possible by using cis-acting RNA
biosensors, also known as riboswitches. These systems work on a faster time scale, since only
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
transcription of the encoding RNA is necessary and there are no regulatory proteins to be
translated. Essentially, riboswitches consist of three main components: an aptamer (sensor), a
reporter or selector gene (actuator), and a bridge that links the two and promotes structural
stability (Qian and Cirino, 2016). Following a binding event, a conformational change occurs
in the three-dimensional structure of the aptamer component, leading to an exposed
ribosomal binding site (RBS) upstream of the actuator locus (Qian and Cirino, 2016). Gene
expression is hence activated by the production of the target chemical, which could be
cofactors, metal ions or amino-acids.
Recently, efforts from Jang et al. (2017) resulted in the development of a bacterial riboswitch
which activates the expression of a GFP reporter upon detection of naringenin, a flavonoid
molecule of plant origin. This riboswitch features a tetracycline resistance marker upstream
of the reporter, thus adding another layer of control (Jang et al., 2017). By adding tetracycline
to the media, the system is thus able to report which strains are the highest producers while
enriching the resultant library of clones for those which contain the encoding construct (Jang
et al., 2017).
Encoding the biosensor in a separate construct can be exploited to achieve higher biosensor
modularity, making the strain optimization faster. Building on the naringenin riboswitch, Xiu
et al. (2017) developed a co-culture version of the biosensor with the biosynthetic pathway
and biosensor components divided into two separate modules. Thus, each of the two bacterial
strains in the community carries one of the two components, minimizing the negative impact
on one of the components when the other is altered or optimized.
3.4 De novo protein engineering
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Considering that the average size of a protein is 200 amino acid residues, and that there are
20 different natural amino acids, the number of diverse amino acid sequences that could
possibly occur is in the order of 20200
. However, the number of distinct proteins that are
produced by existing organisms is only about 1012
. From this, one can deduce that evolution
has explored and selected just a small fraction of the array of all possible protein structures
(Huang et al., 2015).
In this context, de novo protein engineering is useful to generate and explore proteins that do
not exist in nature, including entirely novel enzymes to catalyse non-natural reactions. This
promising approach is an ultimate tool to fill the biochemical gaps in the in-silico pathway
designs that could not be completed by standard enzyme searching or re-engineering existing
enzymes.
De novo protein engineering attempts to create artificial proteins using computational tools
that predict chemical reactions, interactions and tridimensional structures using the principles
of molecular dynamics, quantum mechanics and protein biophysics. Several reviews take a
deeper look at this topic, including Kiss et al. (2013), Zanghellini (2014), Wolfson et al.
(2015) and Huang et al. (2015).
One of the most relevant approach for de novo enzyme design is the “inside out” method, as
revised by Kiss et al. (2013) and Zanghellini (2014). This method consists of four steps,
illustrated in Figure 4C. The first step is to elect an appropriate catalytic mechanism, establish
the transition state of the reaction, and then use quantum mechanics calculations to choose
the tridimensional arrangements of amino acids residues around the transition state that
maximizes its stability. This arrangement is, thus, an idealized active site, also named
“theozyme”, or theoretical enzyme (Kiss et al., 2013). Next, RosettaMatch algorithm is used
to match the putative active site with different protein backbone scaffolds, stored in its
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
database, which can support the geometry of the theoretical structural pocket. Subsequently,
RosettaDesign (Liu and Kuhlman, 2006) algorithm is used to optimize the identity and
position of the amino acid residues surrounding the transition state to both stabilize its
conformation and to maximize the affinities of the residues with the transition state. Finally,
as several possible designs are created for each transition state structure, the designs are
ranked based on geometrical compatibility with substrates and products, as well as on
transition-state binding energy.
The pioneering studies of Jiang et al., (2008) and Röthlisberger et al. (2008) used this
approach to successfully expand the range of possible enzymatic reactions by creating totally
novel enzymes. Röthlisberger et al. (2008) computationally designed a protein catalyst for
Kemp elimination. Their designed enzyme performs the Kemp elimination of 5-
nitrobenzisoxazole, where a proton (H) transfer from carbon, simultaneously with a cut of the
nitrogen–oxygen (N-O) bond, resulted in the product cyanophenol. Jiang et al., (2008)
designed retro-aldolases that use four different catalytic motifs to catalyze the breaking of a
carbon-carbon bond in a non-natural substrate, in this case, 4-hydroxy-4-(6-methoxy-2-
naphthyl)-2-butanone. Since then, the same “inside out” approach was used to create a vast
range of enzymes for non-natural reactions, including the Diels-Alder reaction (Siegel et al.,
2010) and Morita−Baylis−Hillman reactions (Bjelic et al., 2013). Zastrow and Pecoraro
(2014) further extended the range of novel catalytic proteins to the more complex
metalloenzymes, by creating an enzyme for p-nitrophenyl acetate hydrolysis that uses two
different metal ions: a Zn(II) ion, which is important for catalytic activity, and a Hg(II) ion,
which provides structural stability.
Two of the latest challenges for de novo protein design, the creation of novel β-barrel proteins
and the introduction of binding sites for small molecules, were also recently addressed. Dou
et al (2018) remarkably created for the first time a novel and functional β-barrel protein, as
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
well as the first full de novo design of a small-molecule-binding protein. Their protein
construct binds a chromophore DFHBI (3,5-difluoro-4-hydroxy-benzylidene imidazolinone)
and was successfully expressed in E. coli, yeast and mammalian cells. Besides the proven
usefulness of the designed protein for fluorescence-based imaging and sensing applications,
their work was a proof of concept that the field of de novo protein design is accelerating
towards increasingly advanced ligand-binding proteins, sensors and catalysts.
In conclusion, while rational design and de novo protein engineering are still the most
difficult ways to obtain a new enzyme due to the requirement of sophisticated computational
methods, recent examples of successfully applied rational and de novo protein designs are
bringing new prospects for the field. We also expect that further advances in the automated
workflows based on biosensors and robotics in recently established biofoundries will aid the
analysis required for the discovery and engineering of novel catalysts by these different
protein engineering techniques.
4. Host optimization
Host optimization is critical for transferring the production concept from laboratory scale to
commercialization. Even after prediction, ranking and selection of a suitable synthetic
pathway, with the most favorable thermodynamic and minimal reaction steps, and, despite
properly expressing the pathway enzymes and optimizing enzyme activity, frequently,
synthetic pathways for non-natural chemicals are not economically feasible. Recurrent
problems that challenge those pathways are related to energy and cofactors imbalance,
carbons diverted to by-products, cell burden caused by overexpression of the pathway,
toxicity of products and intermediates and so forth. A more comprehensive view of the cell’s
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
interactions and behaviours is thus warranted and hence, there is need to leverage genomic
scale metabolic models (Mahadevan et al., 2005).
Ideally, the host strain would divert as much flux as possible to the desired production
pathway. However, the excess demand for carbon, cofactors and energy for the pathway may
impose a burden for the cell, decreasing growth and cell fitness (Chubukov et al., 2016).
Thus, cofactor usage and generation, substrate consumption, strain oxygenation conditions
and energy balances are parameters that must be considered.
Aside from computational methods such as metabolic modeling, there are various concepts
within systems biology which can be experimentally implemented. Several strategies can be
used in the framework of GSMMs, including finding growth-coupled production (section
4.1.2; Klamt and Mahadevan, 2015) or orthogonality strategies (section 4.4; Pandit et al.,
2017). On the other hand, to address the toxicity of products and intermediates, a good
strategy is to engineer the regulation machinery of the cell by applying the concept of
dynamic control (section 4.3; Chen and Liu, 2018), while also balancing the pathway
expression.
One example of the importance of host optimization is demonstrated by the work of Yim et
al. (2011). They were the first to demonstrate the production of the non-natural chemical 1,4
BDO in vivo, with the initial strain producing 1.2 g/L in 50 h. However, their research group
raised the productivity to about 80-90 g/L by applying genome-scale metabolic models
(GSMMs) and flux balance analysis (FBA) to identify targets for gene deletion, increasing
the flux to products (Burgard et al., 2016; Andreozzi et al., 2016). In this section, we review
how these systems biology concepts can be applied for host optimization.
4.1 Genome-scale models and analysis
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
With the onset of whole-genome sequencing arose the development of genome-scale
metabolic models (GSMMs), which are powerful tools that facilitate the in-silico study of
metabolism. The models are mathematically represented by the stoichiometry of all the
reactions in the metabolic network of an organism (O’Brien et al., 2015; Lee and Kim, 2015).
Developments in next-generation sequencing have allowed faster sequencing than ever
before, and combined with other technologies, has greatly increased the availability and
speed at which GSMMs are constructed (Monk et al., 2014; Henry et al., 2010). Due to the
relative simplicity of GSMMs, exogenous metabolites, reactions and genes may be added
relatively easily using model manipulation platforms such as COBRA Toolbox and may
include gene-protein-reactions (GPRs) (Reed et al., 2003). In addition, large libraries of
GSMMs, such as BiGG Models (King et al., 2015), and open source platforms such as KBase
(Arkin et al., 2018), have made accessing and manipulating metabolic models a simple task.
4.1.1 Phenotype Simulation Methods
An important step in the pipeline for synthetic biology systems is phenotype simulation.
Here, GSMMs are used along with constraint-based modeling (CBM) methods to predict
phenotypic traits such as growth rate and reaction fluxes under various conditions. At this
point, all CBM methods rely on the assumption of an intracellular steady state in terms of
metabolite concentrations. The most commonly used CBM method for phenotype simulations
is Flux Balance Analysis (FBA) (Orth et al. 2010; O’Brien et al., 2015; Lewis et al., 2012).
This method is often used under the assumption that organisms will naturally operate with the
objective of maximizing growth. Other methods that assume different objective functions,
including minimization of metabolic adjustment (MOMA) (Segre et al., 2002), regulatory
on/off minimization (ROOM) (Shlomi et al., 2005), relative change (RELATCH) (Kim et al.,
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
2011), and maximum entropy (MAXENT) (Martino and Martino, 2018) have also been
shown to be effective. There are additionally several variants of FBA that help to achieve
different objectives, including minimization of network flux (Lewis et al., 2010),
consideration of dynamic systems (Mahadevan et al., 2002), including both dynamics and
enzyme usage (Jabarivelisdeh and Waldherr, 2018), and incorporation of regulatory
information (Covert et al., 2001; Shlomi et al., 2007). These methods may be accessed
through open-source platforms such as the Constrain-Based Reconstruction and Analysis
(COBRA) Toolbox (Schellenberger et al., 2011), COBRApy (Ebrahim et al., 2013), Cameo
(Cardoso et al., 2018) and the Portable System for the Analysis of Metabolic Models
(PSAMM) (Steffensen et al., 2016).
4.1.2 Computational Strain Optimization Methods
To achieve a certain network functionality, such as, the increase of metabolic flux through a
non-natural pathway, Computational Strain Optimization Methods (CSOMs) can be used to
identify genetic perturbations such as genetic knockouts, over/under expression of genes,
heterologous insertions and cofactor specificity swapping. Such methods attempt to achieve a
desired network functionality by taking an optimization approach to solutions, as opposed to
complete enumeration of all possible results (Maia et al., 2015). Each of these methods
generally requires the use of a metabolic model as well as the implementation of a phenotype
simulation method. The most important CSOMs are presented in Figure 5.
Several CSOMs have been published, however, this review will focus primarily on
constraint-based methods because they can be built directly from stoichiometric data, which
is, in turn, easily obtained from genome sequencing (Maia et al., 2015). As opposed to the
traditional building of manually curated GSMMs, the open-source platform CarveMe
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
presents a top-down automated methodology of genomic reconstruction that uses a manually
curated universal model which is then combined with annotated genomes to create species-
specific or community models in a fast and scalable manner (Machado et al., 2018).
Growth-coupled pathway design
Synthetic chemical production pathways often impose a metabolic burden on the organism.
Consequently, within a population, the strain that loses the functionality of the synthetic
pathway may outcompete the producing strains. One strain optimization approach that is
frequently used to circumvent this disadvantage is to couple growth with the synthetic
pathway.
This strategy will ensure that product will be consistently produced over time, meaning that
growth, or even the survival of the cell, can only happen if there is flux though the synthetic
pathway. In addition, it is possible to use adaptive laboratory evolution to select for
increasing chemical production capacity while screening for maximum growth (Conrad et al.,
2011).
The principle of this strain optimization approach is to use CSOMs, such as OptKnock, to
find deletion strategies that either force flux through the synthetic pathway to balance energy
and redox cofactors, or, to ensure the synthesis of biomass precursors, for example, by
reducing the cell’s ability to synthetize building block metabolites. As a result, the growth of
the organism itself becomes the driving force of chemical production (Klamt and Mahadevan,
2015).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
The coupling can be either weak, or strong. Weak coupling happens when satisfactory
product yield is only obtained if the cell grows with maximal or close-to-maximal biomass
yield. Strong coupling, otherwise, means that chemical production must happen, even without
growth, to ensure cell’s survival, (Klamt and Mahadevan, 2015). A depiction of the
production envelopes for strong and weak growth coupling is shown in Figure 5A. It has been
shown by Klamt and Mahadevan (2015) that, under certain conditions, any metabolite of the
central metabolism can be strongly coupled in E. coli. Von Kamp and Klamt (2017) further
validated this theory in S. cerevisae, Corynebacterium glutamicum, Aspergillus niger and the
cyanobacterium Synechocystis sp..
Although there are not many examples of this strategy being applied to producing non-natural
compounds, its suitability has been recently demonstrated for coupling the production of a
series of natural compounds in several different microorganisms, both aerobically and
anaerobically.
FBA-based methods
The first CSOM that predicts growth-coupled designs to be developed was OptKnock, which
suggests a set of reaction knockouts that maximize production of a target metabolite (Burgard
et al., 2003). While the solutions proposed by OptKnock are growth-coupled, this coupling is
often weak. Some methods that produce strong growth-coupled strategies were developed to
address this and are discussed in this review. As abovementioned, a representation of
OptKnock, as well as the difference between strong and weak coupling, are shown in Figure
5A.
Methods such as OptKnock produce solutions which are considered to be “optimistic”, as the
optimal solution is used as the operating point in terms of the engineering objective. This
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
issue was addressed through RobustKnock, which considers alternate solutions that could
divert flux from the target compound, as well as by the pair of methods P-OptKnock and P-
ROOM, which assume a “worst-case scenario” for the engineering objective (Tepper and
Shlomi, 2009), (Apaydin et al., 2017). The first method to introduce gene over and under
expression alongside genetic knockouts was OptReg (Pharkya and Maranas, 2006). Later, an
algorithm termed Enhancing Metabolism with Iterative Linear Optimization (EMILiO) was
formulated by Yang et al. (2011), which provided results similar to OptReg but using a
computationally scalable approach. Methods that suggest heterologous reactions as genetic
perturbations include OptStrain and SimOptStrain, which calculate heterologous additions
and knockouts separately and simultaneously, respectively (Pharkya, 2004; Kim et al., 2011).
A final method worth mentioning is OptSwap, which is one of the few CSOMs that predicts
modulations in cofactor specificity alongside genetic knockouts (King and Feist, 2013).
EMA-Based Methods
CSOMs relying on Elementary Mode Analysis (EMA) are powerful methods that compute
sets of elementary flux modes (EMs), which are non-decomposable sets of reactions in a
network (Ruckerbauer et al., 2015). A minimal cut set (MCS) is defined as an irreducible set
of reactions based on elementary flux modes of which when inactivated, results in metabolic
network failure for a particular function (Klamt and Gilles, 2004). In application, these
undesired modes can include any network arrangement in which a minimum yield for some
target product is not achieved. In effect, elimination of the undesired modes leads to strong
growth coupling. One can also include additional constraints, such as to biomass yield, to
select for desired modes in the form of constrained minimal cut sets (cMCSs) (Hädicke and
Klamt, 2011). A depiction of the solution spaces for both MCS and cMCS is shown in Figure
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
5B and C. The results of these methods are given as a list of genetic knockout sets that
produce growth-coupled strategies. Recently, these methods have been adapted to incorporate
two-stage fermentations strategies, termed the Metabolic Valve Enumerator (MoVE)
(Venayak et al, 2018a). MoVE calculates a set of metabolic valves, which are reactions
expressed during a stage of growth then turned off during a stage of production, as well as a
set of reaction knockouts which remain inactive throughout both stages. The result is a strain
that achieves a minimal biomass yield during the growth stage, and a strong growth-coupled
production of a target compound during the production stage, as is shown in Figure 5D.
Methods based on these principles may be used in conjunction with the CellNetAnalyzer
framework (von Kamp and Klamt, 2017).
The scalability of MCS and other EM-based methods on GSMMs is poor due to the
computational nature of mixed integer linear programming (MILP) problems. The
MCSEnumerator was developed to produce MCSs from the smallest number of perturbations
to the largest. This is done by enumerating all elementary modes, mapping the MCSs to the
modes and then determining which EMs are the smallest (von Kamp and Klamt, 2014).
Regulatory information was incorporated to eliminate results containing non-enzymatic or
non-annotated reactions, denoted as regulatory constrained MCS (rcMCS) (Jungreuthmayer
and Zanghellini, 2012), while constrained regulatory MCS (cRegMCS) was developed to
consider both genetic knockouts and over/underexpression of genes, resulting in more
solutions requiring fewer genetic perturbations (Mahadevan et al., 2015). Whereas most
MCS-based methods are approached at the reaction-level, many reactions in GSMM are poor
targets for experimental knockouts, which was an issue addressed by genetic MCS (gMCS), a
method which incorporates gene expression data (Apaolaza et al., 2017). Attempts to sort
through cMCSs by pure strain performance such as genetic algorithm MCS (GA-MCS) have
been published, but their efficacy on GSMMs has not yet been shown (Nair et al., 2015). An
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
EMA-based method that does not focus on MCSs is the modular cell (MODCELL)
framework, which aims to design a strain that is auxotrophic for some exchangeable
production mode that would be coupled to target compound production and growth (Trinh et
al., 2015). In addressing the computational burden of EMA-based methods, it was shown
that alternative integer linear programming (AILP) could be used to significantly reduce the
computational time required in calculating MCSs (Song et al., 2017).
Metaheuristic methods
Metaheuristic CSOMs are advantageous in that they are much less computationally expensive
than FBA- and EMA-based methods. This is due to the decoupling of the biological and
engineering objective functions, allowing for independent calculations for each. This is
beneficial, as computational power is made readily available for more complex phenotype
simulation methods. Despite the benefits of metaheuristic methods, due to their nature these
methods cannot guarantee an optimal solution. A schematic of the typical workflow for
metaheuristic methods is shown in Figure 5E. The first metaheuristic method that was
published was OptGene, which uses a genetic algorithm modeled to evolutionary theory to
predict sets of genetic knockouts (Patil et al., 2005). Other methods incorporate the presence
of noise (Costanza et al., 2012), build iteratively upon previous solutions (Lun et al., 2009;
Rockwell et al., 2013), or consider shadow price analysis (Ohno et al., 2014). Metaheuristic
methods hold promise in the design of organisms that produce non-natural chemicals due to
the facilitated scalability with GSMMs.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
4.2 Kinetic Modeling for Strain Improvement
As previously mentioned, GSMMs are uncomplicated since they incorporate only
stoichiometric data. While this has some advantages, it also makes them limited in terms of
bioengineering applications. While stoichiometric models assume a quasi-steady state and set
the change in metabolite concentrations with time to zero, kinetic models do not make this
assumption. This leads to the requirement to incorporate enzymatic expression and kinetic
data as well as other dynamics that are not captured by stoichiometric models. Kinetic models
may therefore be advantageous in the analysis of transient behaviours of strains that are often
difficult to estimate.
As an alternative to traditional constraint-based models, kinetic models further include
physiologically relevant data, such as time-dependent metabolite concentrations and
enzymatic regulation. This enables the dynamic analysis of biological systems for enhanced
in-silico hypothesis generation. Kinetic models, however, suffer from the burden of
estimating several parameters which are usually not readily available, so the number of
reactions that they explicitly describe is often mostly limited to core metabolism (Srinivasan
et al, 2015). However, it is expected that as metabolomic, proteomic, and transcriptomic data
becomes more accessible under various conditions, kinetic models will become more useful.
Machine learning algorithms have been suggested as a promising solution to the issue of
parameterization. The ORACLE (Optimization and Risk Analysis of Complex Living
Entities) framework effectively integrates kinetics with thermodynamic, omics data (such as
fluxomics and metabolomics), along with network stoichiometry is depicted in Figure 5E
(Miskovic and Hatzimanikatis, 2010). ORACLE was successfully implemented by Andreozzi
et al. (2016) to predict a modular pathway for the biosynthesis of 1,4-BDO in E. coli. Other
machine learning algorithms were also applied to build kinetic models for the production of
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
limonene and isopentenol in E. coli, giving qualitatively accurate results (Costello and
Martin, 2018).
4.3 Dynamic control and pathway balancing
A frequent difficulty in the metabolic engineering of microorganisms is achieving the finest
control possible over the production of the target chemical (natural or non-natural). Ideally, at
any given time, an efficient cell should modulate its level of chemical production in
accordance with its environmental conditions and intracellular metabolic state. Thus, the idea
behind pathway balancing is to optimize the inadequacies that can arise from the introduction
of heterologous pathways.
Traditional methods of pathway balancing often resort to static strategies like pathway
overexpression, gene deletions to deviate fluxes, or promoter engineering for altering
expression levels (Xu et al., 2014). The most widely recognized balancing strategy is
preventing the accumulation of problematic metabolites by increasing its downstream fluxes
while decreasing upstream ones (Brockman and Prather, 2015; Jones et al., 2015). However,
such modifications only scratch the surface, as production relies on cell fitness, which
depends on the organism’s ability to sense and respond to recurrent environmental
perturbations (Venayak et al., 2015; Dahl et al., 2013; Ewald et al., 2017; Leonard et al.,
2010). Overexpression can also have negative implications, such as growth inhibition (Liu et
al., 2015; Xu, 2018). Jones et al. (2015) provide a more detailed review on the existent
approaches for pathway balancing, including ribosomal binding site engineering, post-
translational modifications, and others mentioned previously.
Accordingly, there is an existing consensus that cells with dynamic and automated self-
regulation are better candidates for industrial use and scale-up (Ewald et al., 2017). The most
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
prominent form of dynamic control uses genetic circuits based on inducible and modular
gene expression, usually featuring activation and repression of activity of a particular
transcription factor (Liu et al., 2015; Rantasalo et al., 2018; Xu et al., 2014). An advantage of
this system is the high specificity of transcription factors to recognize an input signal; for
example, the presence of a metabolite or a change in temperature or other condition (Xu,
2018). Gene circuits that modulate pathway overexpression according to the levels of
available product lead to enhanced survival and improved titers (Liu et al., 2015). Other
studies use metabolic switches for the separate modulation of expression of upstream and
downstream reactions to a metabolite according to its abundance (Xu et al., 2014). This better
manages cellular resources but can also respond to the need for optimized fermentations
which typically require multiple steps with different substrates or oxygen uptake rates, and in
which either growth or production are prioritized at different times (Figure 6C). Two-state
(bistable) dynamic control provided by tools such as genetic toggle switches facilitates such
fermentation protocols by quickly turning off and restoring wild type growth phenotypes
when production is not prioritized, thereby minimizing the cellular effects of growth burden
(Anesiadis et al., 2013; Bothfeld et al., 2017; Venayak et al., 2018b). The central idea
underlying the concept of dynamic control is the temporal segregation of cellular growth and
bioproduction pathways, so that both are optimized in such a way that the trade-off that
usually exists between them is minimized (Figure 6C). This approach allows the same strain
to maximize yield from a given substrate and minimize production of undesired by-products,
while still growing unaffected by the detrimental effects a bioproduction pathway would
typically impose (Bothfeld et al., 2017).
Dynamic control has been implemented for the production of some high-value non-natural
chemicals over the years, although its use is still infrequent compared to natural chemical
production. Liu et al. (2015) introduced a 1,4-BDO production pathway into E. coli and
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
demonstrated that a two-part biosynthetic strategy with separate carbon fluxes for production
and dynamic regulation achieved autonomous production. Their system not only increases the
efficiency of 1,4-BDO synthesis but is also inherently trivial to optimize given the low degree
of commonalities between the production and regulatory pathways (Liu et al., 2015). This last
feature is known as orthogonality and is discussed in the next section in more detail, as well
as in the context of the Liu et al. (2015). Self-regulation has also been explored to prevent
toxic intermediate accumulation in the mevalonate pathway, which can be used to make a
variety of terpene-derived synthetic drugs (Dahl et al., 2013). Other technologies such as
CRISPR-based repressible promoter regulation are also used in the design of genetic circuits
for minimized crosstalk and high responsiveness, which is directly applicable to non-natural
chemical production (Didovyk et al., 2016).
Dynamic regulation for the production of non-natural chemicals is still not widespread.
However, given the ineffectiveness of static methods alone and the great advantages
automated control presents for host optimization, it could become useful to achieve more
efficient and competitive bioproduction (Zhang et al., 2015; Wang et al., 2017c; Wang et al.,
2018). Recent innovations in dynamic control include the use of gene circuit toolboxes and
the integration of multiple gene circuits for more robust and even more sophisticated
expression. Despite being promising for scale-up, only a few recent efforts have attempted
such a task (Rantasalo et al., 2018). Solutions to the issue of using chemical inducers have
also been explored, since toxic effects and low responsiveness are relevant concerns. For
example, the use of electromagnetic radiation has been considered for dynamic gene
expression, as it can provide a proven, quick, and highly reversible signal (Salinas et al.,
2017).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
4.4 Orthogonality principles applied to pathway design
Another strategy to decouple production from cellular growth is the use of orthogonality
principles. Metabolic pathways are orthogonal when they operate with minimal interference
between the synthetic pathway and the biomass-producing native pathways. In principle,
those pathways do not share any enzymes and may be connected by a single metabolite
(Pandit et al., 2017). This metabolite can be used as a precursor for either biomass or
chemical production, and it is possible to design strategies that introduce metabolic “valves”
to isolate the growth and the chemical production pathways in a parallel fashion (Pandit et al.,
2017). The more enzymes these pathways share, the more opportunity for cross-talk between
them will exist (Figure 6A). On the other hand, by minimizing the common reactions
between the two network branches, it is possible to create a more orthogonal system (Figure
6B). Orthogonal networks can bypass large portions of native pathways transforming sugars
directly to chemicals, or yet, include junctions or valves through which precursors can be
used for growth, and, when convenient, can be diverged for chemical production. Such
pathways are considered to be growth-independent, meaning that orthogonality is contrary to
growth-coupling.
Accounting for substrate orthogonality is also important. Metabolites like glucose or fatty
acids are highly involved in core metabolism, making more orthogonal substrates such as
ethylene glycol and xylose more desirable (Pandit et al., 2017). Xylose-based orthogonal
pathways could be beneficial for future non-natural chemical production, since it is
extensively found in agricultural residues as part of hemicellulose and does not compete with
the food supply. One example of this concept being applied in the production of non-natural
chemicals is presented by Liu and Lu (2015). In this work, they produced 1,4 BDO from
xylose in E. coli. As this microbe is naturally able to catabolise xylose, they deleted three
native genes encoding common enzymes to make the synthetic pathway orthogonal to central
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
metabolism. Their strategy was to use xylose for chemical production and other carbon
sources for cellular growth. A quorum-sensing mechanism was also exploited for production
control, and the resulting strain was able to produce of 1,4 BDO autonomously, although in
quite low amounts (Liu and Lu, 2015).
A problem that would predictably arise from the construction of highly integrated systems,
such as gene circuits, is the need for greater orthogonality in their design. A lack of
orthogonality can lead to unexpected imbalances in expression modulation due to
interference of certain DNA-binding factors with the successful association of the correct
factor-promoter complexes, thereby defeating the purpose of dynamic circuits. This problem
was handled by Rantasalo et al. (2018) by the creation of a transcription factor-promoter
toolbox library. In that work, they characterized different promoter regions and engineered
transcription regulators for optimized pathway expression in S. cerevisiae, and then used it to
develop a memory-enabled genetic switch able to induce and repress production according to
need. The key feature of their transcription factor-promoter toolbox library is the high
orthogonality between the proteins and their respective binding sites.
Another promising strategy for the implementation of orthogonality in metabolic engineering
is the use of non-natural cofactors to create orthogonal gene circuits for energy transfer.
Redox cofactor balancing is well-studied, and, in most cases, involves changing the
specificity of proteins to use a different electron carrier (e.g. NAD instead of NADP) or
overproducing the required cofactor (Lee et al., 2012). However, natural cofactors, such as
NAD, link multiple energy transfer modules and metabolic circuits, and thus, concerns have
been raised about the use of natural cofactors for production pathways, given their low
orthogonality (Figure 7A). The use of synthetic cofactors for highly selective energy transfer
has hence been proposed as an orthogonal alternative that further segregates production and
central metabolism at the pathway level (Figure 7B). Such synthetic cofactor-linked circuits
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
contain two parts; one which strips the electrons from the energy carrier and another which
replaces them. The system proposed by Wang et al. (2017c) uses phosphite dehydrogenase
and malate dehydrogenase to recycle the synthetic cofactor nicotinamide cytosine
dinucleotide or NCD (Figure 7C). This way, competition for natural cofactors (i.e. NAD)
does not affect non-natural chemical production pathways, facilitating strain optimization.
Orthogonal pathway construction also has a place in the optimization of production of non-
natural metabolites, such as synthetic amino acids, for in vivo reactions mediated by
recombinant proteins. Ma et al. (2018) constructed a heterologous and orthogonal pathway
for azidohomoalanine production in E. coli, an unconventional amino acid with
biotechnological applications. Such strategies could be repurposed to produce completely
synthetic amino acids unheard of in nature, thus helping to reduce bioproduction costs and
meet demands for new non-natural chemical and biochemical products (Völler and Budisa,
2017).
5. Conclusion
In this review, we highlighted the emerging case studies for the direct production of non-
natural chemicals in microorganisms. In fact, this shift to non-natural chemical production
has intensified in the past 5 years, with more than 40 different compounds now demonstrated
in laboratory scale. In addition, at least one of them, 1,4 BDO, is being produced using a
commercial scale bioprocess.
In fact, a variety of new computational and experimental systems biology tools are available
for metabolic engineering and can significantly influence the development of hosts for non-
natural chemical production and feasibility of its commercialization. Such tools include
biosensors, high throughput robotic screening platforms, de novo protein design, directed
evolution and dynamic control methods based on orthogonality. However, not all tools are as
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
impactful on microbial production. For instance, computational strain optimization methods
have the potential to multiply chemical production several times. On the other hand, pathway
balancing sometimes results only in a small increase in the overall production and it is only
practical once the appropriate enzymes have been selected and implemented successfully.
Finally, chemical production is absolutely absent without the precise enzymes to catalyze the
reactions involved in the designed pathway. Hence, finding the required enzymes is critical
for implementing synthetic pathways.
In fact, while in-silico pathway design is sufficiently advanced to be used to find pathways
for virtually any organic chemical, one of the main hindrances for developing new pathways
for non-natural chemicals is the discovery and engineering of novel enzymes encoding the
non-natural activity. Usually there is high prospect for finding promiscuous enzymes or for
adapting a natural enzyme for the desired reaction. However, if no enzyme could be selected
then, having to resort for de novo enzyme design results in an enormous challenge. Due to
this difficulty of engineering non-natural enzymes, the combination of synthetic pathways
with chemical transformations still might be required for some cases (Lee et al., 2019).
However, it is clear that many of the advanced methods for host optimization have not yet
been used in most studies of non-natural chemical bioproduction. Hence, there is an
opportunity to integrate such approaches for host optimization with the emerging automated
robotic workflows to accelerate the development of new microbial cell factories.
Acknowledgements:
This work was supported by the Natural Sciences and Engineering Research Council
(NSERC), the NSERC Industrial Biocatalysis Network (IBN), Biochemicals from Cellulosic
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Biomass (BioCeB) grant from the Ontario Research Fund (Research Excellence) and a grant
from the Genome Canada Genomics Applied Partnership Program (GAPP).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
References
Andreozzi, S., Chakrabarti, A., Soh, K.C., Burgard, A., Yang, T.H., Van Dien, S., Miskovic,
L., Hatzimanikatis, V., 2016. Identification of metabolic engineering targets for the
enhancement of 1,4-butanediol production in recombinant E. coli using large-scale kinetic
models. Metab. Eng. 35, 148–159. doi:10.1016/j.ymben.2016.01.009
Anesiadis, N., Kobayashi, H., Cluett, W.R., Mahadevan, R., 2013. Analysis and Design of a
Genetic Circuit for Dynamic Metabolic Engineering. ACS Syn. Bio. 2, 442–452.
doi:10.1021/sb300129j
Apaolaza, I., San José-Eneriz, E., Tobalina, L., Miranda, E., Garate, L., Agirre, X., Prósper,
F., Planes, F.J., 2017. An in-silico approach to predict and exploit synthetic lethality in cancer
metabolism. Nat. Commun. 8. doi:10.1038/s41467-017-00555-y
Apaydin, M., Xu, L., Zeng, B., Qian, X., 2017. Robust mutant strain design by pessimistic
optimization. BMC Genomics 18. doi:10.1186/s12864-017-4025-7
Arkin, A.P., Cottingham, R.W., Henry, C.S., Harris, N.L., Stevens, R.L., et al., 2018. KBase:
The United States Department of Energy Systems Biology Knowledgebase. Nat. Biotechnol.
36, 566–569. doi:10.1038/nbt.4163
Arnold, F.H., 2017. Directed Evolution: Bringing New Chemistry to Life. Angew. Chem.,
Int. Ed. 57, 4143–4148. doi:10.1002/anie.201708408
Atsumi, S., Hanai, T., Liao, J.C., 2008. Non-fermentative pathways for synthesis of
branched-chain higher alcohols as biofuels. Nature 451, 86–89. doi:10.1038/nature06450
Bajaj, P., Sreenilayam, G., Tyagi, V., Fasan, R., 2016. Gram-Scale Synthesis of Chiral
Cyclopropane-Containing Drugs and Drug Precursors with Engineered Myoglobin Catalysts
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Featuring Complementary Stereoselectivity. Angew. Chem., Int. Ed. 55, 16110–16114.
doi:10.1002/anie.201608680
Becker, J., Kuhl, M., Kohlstedt, M., Starck, S., Wittmann, C., 2018. Metabolic engineering of
Corynebacterium glutamicum for the production of cis, cis-muconic acid from lignin.
Microb. Cell Fact.17. doi:10.1186/s12934-018-0963-2
Bjelic, S., Nivón, L.G., Çelebi-Ölçüm, N., Kiss, G., Rosewall, C.F., Lovick, H.M., Ingalls,
E.L., Gallaher, J.L., Seetharaman, J., Lew, S., Montelione, G.T., Hunt, J.F., Michael, F.E.,
Houk, K.N., Baker, D., 2013. Computational Design of Enone-Binding Proteins with
Catalytic Activity for the Morita–Baylis–Hillman Reaction. ACS Chem. Biol. 8, 749–757.
doi:10.1021/cb3006227
Bothfeld, W., Kapov, G., Tyo, K.E.J., 2017. A Glucose-Sensing Toggle Switch for
Autonomous, High Productivity Genetic Control. ACS Synthetic Biology 6, 1296–1304.
doi:10.1021/acssynbio.6b00257
Brockman, I.M., Prather, K.L.J., 2015. Dynamic metabolic engineering: New strategies for
developing responsive cell factories. Biotechnol. J. 10, 1360–1369.
doi:10.1002/biot.201400422
Burgard, A., Burk, M.J., Osterhout, R., Van Dien, S., Yim, H., 2016. Development of a
commercial scale process for production of 1,4-butanediol from sugar. Curr. Opin.
Biotechnol. 42, 118–125. doi:10.1016/j.copbio.2016.04.016
Burgard, A.P., Pharkya, P., Maranas, C.D., 2003. Optknock: A bilevel programming
framework for identifying gene knockout strategies for microbial strain optimization.
Biotechnol. Bioeng. 84, 647–657. doi:10.1002/bit.10803
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Calhoun, S., Korczynska, M., Wichelecki, D.J., San Francisco, B., Zhao, S., Rodionov, D.A.,
Vetting, M.W., Al-Obaidi, N.F., Lin, H., O’Meara, M.J., Scott, D.A., Morris, J.H., Russel,
D., Almo, S.C., Osterman, A.L., Gerlt, J.A., Jacobson, M.P., Shoichet, B.K., Sali, A., 2018.
Prediction of enzymatic pathways by integrative pathway mapping. eLife 7.
doi:10.7554/elife.31097
Campodonico, M.A., Andrews, B.A., Asenjo, J.A., Palsson, B.O., Feist, A.M., 2014.
Generation of an atlas for commodity chemical production in Escherichia coli and a novel
pathway prediction algorithm, GEM-Path. Metab. Eng. 25, 140–158.
doi:10.1016/j.ymben.2014.07.009
Cann, A.F., Liao, J.C., 2008. Production of 2-methyl-1-butanol in engineered Escherichia
coli. Appl. Microbiol. Biotechnol. 81, 89–98. doi:10.1007/s00253-008-1631-y
Carbonell, P., Faulon, J.-L., 2010. Molecular signatures-based prediction of enzyme
promiscuity. Bioinformatics 26, 2012–2019. doi:10.1093/bioinformatics/btq317
Carbonell, P., Parutto, P., Baudier, C., Junot, C., Faulon, J.-L., 2013. Retropath: Automated
Pipeline for Embedded Metabolic Circuits. ACS Syn. Bio. 3, 565–577.
doi:10.1021/sb4001273
Carbonell, P., Parutto, P., Herisson, J., Pandit, S.B., Faulon, J.-L., 2014. XTMS: pathway
design in an eXTended metabolic space. Nucleic Acids Res. 42, W389–W394.
doi:10.1093/nar/gku362
Carbonell, P., Planson, A.-G., Fichera, D., Faulon, J.-L., 2011. A retrosynthetic biology
approach to metabolic pathway design for therapeutic production. BMC Syst. Biol. 5, 122.
doi:10.1186/1752-0509-5-122
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Cardoso, J.G.R., Jensen, K., Lieven, C., Lærke Hansen, A.S., Galkina, S., Beber, M.,
Özdemir, E., Herrgård, M.J., Redestig, H., Sonnenschein, N., 2018. Cameo: A Python
Library for Computer Aided Metabolic Engineering and Optimization of Cell Factories. ACS
Synthetic Biology 7, 1163–1166. doi:10.1021/acssynbio.7b00423
Caspi, R., Billington, R., Ferrer, L., Foerster, H., Fulcher, C.A., Keseler, I.M., Kothari, A.,
Krummenacker, M., Latendresse, M., Mueller, L.A., Ong, Q., Paley, S., Subhraveti, P.,
Weaver, D.S., Karp, P.D., 2015. The MetaCyc database of metabolic pathways and enzymes
and the BioCyc collection of pathway/genome databases. Nucleic Acids Res.44, D471–D480.
doi:10.1093/nar/gkv1164
Chae, T.U., Ko, Y.-S., Hwang, K.-S., Lee, S.Y., 2017. Metabolic engineering of Escherichia
coli for the production of four-, five- and six-carbon lactams. Metab. Eng. 41, 82–91.
doi:10.1016/j.ymben.2017.04.001
Chen, G.S., Siao, S.W., Shen, C.R., 2017. Saturated mutagenesis of ketoisovalerate
decarboxylase V461 enabled specific synthesis of 1-pentanol via the ketoacid elongation
cycle. Scientific Reports 7. doi:10.1038/s41598-017-11624-z
Chen, X., Liu, L., 2018. Gene Circuits for Dynamically Regulating Metabolism. Trends
Biotechnol. 36, 751–754. doi:10.1016/j.tibtech.2017.12.007
Chen, X.F., Xia, X.X., Lee, S.Y., Qian, Z.G., 2018. Engineering tunable biosensors for
monitoring putrescine in Escherichia coli. Biotechnol. Bioeng. 115, 1014–1027.
doi:10.1002/bit.26521
Cheong, S., Clomburg, J.M., Gonzalez, R., 2016. Energy- and carbon-efficient synthesis of
functionalized small molecules in bacteria using non-decarboxylative Claisen condensation
reactions. Nat Biotechnol 34, 556–561. doi:10.1038/nbt.3505
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Cheong, S., Clomburg, J.M., Gonzalez, R., 2018. A synthetic pathway for the production of
2-hydroxyisovaleric acid in Escherichia coli. J. Ind. Microbiol. Biotechnol. 45, 579–588.
doi:10.1007/s10295-018-2005-9
Cho, A., Yun, H., Park, J., Lee, S., Park, S., 2010. Prediction of novel synthetic pathways for
the production of desired chemicals. BMC Syst. Biol. 4, 35. doi:10.1186/1752-0509-4-35
Choi, Y.J., Lee, S.Y., 2013. Microbial production of short-chain alkanes. Nature 502, 571–
574. doi:10.1038/nature12536
Chu, H.S., Ahn, J.-H., Yun, J., Choi, I.S., Nam, T.-W., Cho, K.M., 2015. Direct fermentation
route for the production of acrylic acid. Metab. Eng. 32, 23–29.
doi:10.1016/j.ymben.2015.08.005
Chubukov, V., Mukhopadhyay, A., Petzold, C.J., Keasling, J.D., Martín, H.G., 2016.
Synthetic and systems biology for microbial production of commodity chemicals. npj
Systems Biology and Applications 2. doi:10.1038/npjsba.2016.9
Cobb, R.E., Chao, R., Zhao, H., 2013. Directed evolution: Past, present, and future. AIChE J.
59, 1432–1440. doi:10.1002/aic.13995
Coelho, P.S., Brustad, E.M., Kannan, A., Arnold, F.H., 2013. Olefin Cyclopropanation via
Carbene Transfer Catalyzed by Engineered Cytochrome P450 Enzymes. Science 339, 307–
310. doi:10.1126/science.1231434
Connor, M.R., Cann, A.F., Liao, J.C., 2010. 3-Methyl-1-butanol production in Escherichia
coli: random mutagenesis and two-phase fermentation. Appl. Microbiol. Biotechnol. 86,
1155–1164. doi:10.1007/s00253-009-2401-1
Conrad, T.M., Lewis, N.E., Palsson, B.O., 2011. Microbial laboratory evolution in the era of
genome-scale science. Molecular Systems Biology 7, 509–509. doi:10.1038/msb.2011.42
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Costanza, J., Carapezza, G., Angione, C., Lió, P., Nicosia, G., 2012. Robust design of
microbial strains. Bioinformatics 28, 3097–3104. doi:10.1093/bioinformatics/bts590
Costello, Z., Martin, H.G., 2018. A machine learning approach to predict metabolic pathway
dynamics from time-series multiomics data. npj Systems Biology and Applications 4.
doi:10.1038/s41540-018-0054-3
Covert, M.W., Schilling, C.H., Palsson, B., 2001. Regulation of Gene Expression in Flux
Balance Models of Metabolism. J. Theor. Biol. 213, 73–88. doi:10.1006/jtbi.2001.2405
Dahl, R.H., Zhang, F., Alonso-Gutierrez, J., Baidoo, E., Batth, T.S., Redding-Johanson,
A.M., Petzold, C.J., Mukhopadhyay, A., Lee, T.S., Adams, P.D., Keasling, J.D., 2013.
Engineering dynamic pathway regulation using stress-response promoters. Nat. Biotechnol.
31, 1039–1046. doi:10.1038/nbt.2689
Dai, L., Tao, F., Tang, H., Guo, Y., Shen, Y., Xu, P., 2017. Directing enzyme devolution for
biosynthesis of alkanols and 1,n-alkanediols from natural polyhydroxy compounds. Metab.
Eng. 44, 70–80. doi:10.1016/j.ymben.2017.09.005
Dai, Z., Nielsen, J., 2015. Advancing metabolic engineering through systems biology of
industrial microorganisms. Curr. Opin. Biotechnol. 36, 8–15.
doi:10.1016/j.copbio.2015.08.006
De Martino, A., De Martino, D., 2018. An introduction to the maximum entropy approach
and its application to inference problems in biology. Heliyon 4, e00596.
doi:10.1016/j.heliyon.2018.e00596
De Paepe, B., Peters, G., Coussement, P., Maertens, J., De Mey, M., 2017. Tailor-made
transcriptional biosensors for optimizing microbial cell factories. J. Ind. Microbiol.
Biotechnol. 44, 623–645. doi:10.1007/s10295-016-1862-3
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Delépine, B., Duigou, T., Carbonell, P., Faulon, J.-L., 2017. RetroPath2.0: A retrosynthesis
workflow for metabolic engineers. Metab. Eng. 45, 158–170.
doi:10.1016/j.ymben.2017.12.002
Didovyk, A., Borek, B., Tsimring, L., Hasty, J., 2016. Transcriptional regulation with
CRISPR-Cas9: principles, advances, and applications. Current Opinion in Biotechnology 40,
177–184. doi:10.1016/j.copbio.2016.06.003
Dietrich, J.A., McKee, A.E., Keasling, J.D., 2010. High-Throughput Metabolic Engineering:
Advances in Small-Molecule Screening and Selection. Annual Review of Biochemistry 79,
563–590. doi:10.1146/annurev-biochem-062608-095938
Dietrich, J.A., Shis, D.L., Alikhani, A., Keasling, J.D., 2013. Transcription factor-based
screens and synthetic selections for microbial small-molecule biosynthesis. ACS Synth. Biol.
2, 47–58. doi:10.1021/sb300091d
Dixon, N., Duncan, J.N., Geerlings, T., Dunstan, M.S., McCarthy, J.E.G., Leys, D.,
Micklefield, J., 2010. Reengineering orthogonally selective riboswitches. Proc. Natl. Acad.
Sci. 107, 2830–2835. doi:10.1073/pnas.0911209107
Dou, J., Vorobieva, A.A., Sheffler, W., Doyle, L.A., Park, H., Bick, M.J., Mao, B., Foight,
G.W., Lee, M.Y., Gagnon, L.A., Carter, L., Sankaran, B., Ovchinnikov, S., Marcos, E.,
Huang, P.-S., Vaughan, J.C., Stoddard, B.L., Baker, D., 2018. De novo design of a
fluorescence-activating β-barrel. Nature 561, 485–491. doi:10.1038/s41586-018-0509-0
Duigou, T., du Lac, M., Carbonell, P., Faulon, J.-L., 2018. RetroRules: a database of reaction
rules for engineering biology. Nucleic Acids Res. doi:10.1093/nar/gky940
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Ebrahim, A., Lerman, J.A., Palsson, B.O., Hyduke, D.R., 2013. COBRApy: COnstraints-
Based Reconstruction and Analysis for Python. BMC Syst. Biol. 7, 74. doi:10.1186/1752-
0509-7-74
Ewald, J., Bartl, M., Dandekar, T., Kaleta, C., 2017. Optimality principles reveal a complex
interplay of intermediate toxicity and kinetic efficiency in the regulation of prokaryotic
metabolism. PLoS Comput. Biol. 13, 1–19. doi:10.1371/journal.pcbi.1005371
Furubayashi, M., Ikezumi, M., Takaichi, S., Maoka, T., Hemmi, H., Ogawa, T., Saito, K.,
Tobias, A.V., Umeno, D., 2015. A highly selective biosynthetic pathway to non-natural C50
carotenoids assembled from moderately selective enzymes. Nat. Commun. 6.
doi:10.1038/ncomms8534
Guo, D., Zhu, J., Deng, Z., Liu, T., 2014. Metabolic engineering of Escherichia coli for
production of fatty acid short-chain esters through combination of the fatty acid and 2-keto
acid pathways. Metab. Eng. 22, 69–75. doi:10.1016/j.ymben.2014.01.003
Gupta, U., Le, T., Hu, W.-S., Bhan, A., Daoutidis, P., 2018. Automated network generation
and analysis of biochemical reaction pathways using RING. Metab. Eng. 49, 84–93.
doi:10.1016/j.ymben.2018.07.009
Hadadi, N., Hafner, J., Shajkofci, A., Zisaki, A., Hatzimanikatis, V., 2016. ATLAS of
Biochemistry: A Repository of All Possible Biochemical Reactions for Synthetic Biology and
Metabolic Engineering Studies. ACS Synth. Biol. 5, 1155–1166.
doi:10.1021/acssynbio.6b00054
Hadadi, N., Hatzimanikatis, V., 2015. Design of computational retrobiosynthesis tools for the
design of de novo synthetic pathways. Curr. Opin. Chem. Biol. 28, 99–104.
doi:10.1016/j.cbpa.2015.06.025
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Hädicke, O., Klamt, S., 2011. Computing complex metabolic intervention strategies using
constrained minimal cut sets. Metab. Eng. 13, 204–213. doi:10.1016/j.ymben.2010.12.004
Hammer, S.C., Kubik, G., Watkins, E., Huang, S., Minges, H., Arnold, F.H., 2017. Anti-
Markovnikov alkene oxidation by metal-oxo–mediated enzyme catalysis. Science 358, 215–
218. doi:10.1126/science.aao1482
Hatzimanikatis, V., Li, C., Ionita, J.A., Henry, C.S., Jankowski, M.D., Broadbelt, L.J., 2005.
Exploring the diversity of complex metabolic networks. Bioinformatics 21, 1603–1609.
doi:10.1093/bioinformatics/bti213
Heath, A.P., Bennett, G.N., Kavraki, L.E., 2011. An Algorithm for Efficient Identification of
Branched Metabolic Pathways. Journal of Computational Biology 18, 1575–1597.
doi:10.1089/cmb.2011.0165
Henry, C.S., DeJongh, M., Best, A.A., Frybarger, P.M., Linsay, B., Stevens, R.L., 2010.
High-throughput generation, optimization and analysis of genome-scale metabolic models.
Nat. Biotechnol. 28, 977–982. doi:10.1038/nbt.1672
Ho, J.C.H., Pawar, S.V., Hallam, S.J., Yadav, V.G., 2017. An Improved Whole-Cell
Biosensor for the Discovery of Lignin-Transforming Enzymes in Functional Metagenomic
Screens. ACS Syn. Bio.7, 392–398. doi:10.1021/acssynbio.7b00412
Hossain, G.S., Yuan, H., Li, J., Shin, H., Wang, M., Du, G., Chen, J., Liu, L., 2016.
Metabolic engineering of Raoultella ornithinolytica BF60 for the production of 2, 5-
furandicarboxylic acid from 5-hydroxymethylfurfural . Appl. Environ. Microbiol.
AEM.02312-16. doi:10.1128/aem.02312-16
doi:10.1016/j.ymben.2015.03.009
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Huang, P.-S., Feldmeier, K., Parmeggiani, F., Fernandez Velasco, D.A., Höcker, B., Baker,
D., 2015. De novo design of a four-fold symmetric TIM-barrel protein with atomic-level
accuracy. Nat. Chem. Biol.12, 29–34. doi:10.1038/nchembio.1966
Jabarivelisdeh, B., Waldherr, S., 2018. Optimization of bioprocess productivity based on
metabolic-genetic network models with bilevel dynamic programming. Biotechnol. Bioeng.
115, 1829–1841. doi:10.1002/bit.26599
Jang, S., Jang, S., Xiu, Y., Kang, T.J., Lee, S.-H., Koffas, M.A.G., Jung, G.Y., 2017.
Development of Artificial Riboswitches for Monitoring of Naringenin In Vivo. ACS Syn.
Bio. 6, 2077–2085. doi:10.1021/acssynbio.7b00128
Jeffryes, J.G., Colastani, R.L., Elbadawi-Sidhu, M., Kind, T., Niehaus, T.D., Broadbelt, L.J.,
Hanson, A.D., Fiehn, O., Tyo, K.E.J., Henry, C.S., 2015. MINEs: open access databases of
computationally predicted enzyme promiscuity products for untargeted metabolomics.
Journal of Cheminformatics 7. doi:10.1186/s13321-015-0087-1
Jensen, M.K., Keasling, J.D. (Eds.), 2018. Synthetic Metabolic Pathways, Methods in
Molecular Biology. Springer New York. doi:10.1007/978-1-4939-7295-1
Jiang, L., Althoff, E.A., Clemente, F.R., Doyle, L., Rothlisberger, D., Zanghellini, A.,
Gallaher, J.L., Betker, J.L., Tanaka, F., Barbas, C.F., Hilvert, D., Houk, K.N., Stoddard, B.L.,
Baker, D., 2008. De Novo Computational Design of Retro-Aldol Enzymes. Science 319,
1387–1391. doi:10.1126/science.1152692
Jones, J.A., Toparlak, T.D., Koffas, M.A.G., 2015. Metabolic pathway balancing and its role
in the production of biofuels and chemicals. Curr. Opin. Biotechnol. 33, 52–59.
doi:10.1016/j.copbio.2014.11.013
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Joo, J.C., Khusnutdinova, A.N., Flick, R., Kim, T., Bornscheuer, U.T., Yakunin, A.F.,
Mahadevan, R., 2017. Alkene hydrogenation activity of enoate reductases for an
environmentally benign biosynthesis of adipic acid. Chemical Science 8, 1406–1413.
doi:10.1039/c6sc02842j
Jung, Y.K., Kim, T.Y., Park, S.J., Lee, S.Y., 2010. Metabolic engineering of Escherichia coli
for the production of polylactic acid and its copolymers. Biotechnol. Bioeng. 105, 161–171.
doi:10.1002/bit.22548
Jungreuthmayer, C., Zanghellini, J., 2012. Designing optimal cell factories: integer
programming couples elementary mode analysis with regulation. BMC Syst. Biol. 6, 103.
doi:10.1186/1752-0509-6-103
Kanehisa, M., Goto S., 2000. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic
Acids Res. 28, 27–30. doi:10.1093/nar/28.1.27
Kaneko, A., Ishii, Y., Kirimura, K., 2011. High-yield Production of cis,cis-Muconic Acid
from Catechol in Aqueous Solution by Biocatalyst. Chemistry Letters 40, 381–383.
doi:10.1246/cl.2011.381
Kang, S.-Y., Choi, O., Lee, J.K., Ahn, J.-O., Ahn, J.S., Hwang, B.Y., Hong, Y.-S., 2015.
Artificial de novo biosynthesis of hydroxystyrene derivatives in a tyrosine overproducing
Escherichia coli strain. Microb. Cell Fact.14. doi:10.1186/s12934-015-0268-7
Kataoka, N., Vangnai, A.S., Pongtharangkul, T., Yakushi, T., Matsushita, K., 2017.
Production of 1,3-diols in Escherichia coli. Bioresour. Technol. 245, 1538–1541.
doi:10.1016/j.biortech.2017.05.082
Keseler, I.M., Mackie, A., Santos-Zavaleta, A., Billington, R., Bonavides-Martínez, C.,
Caspi, R., Fulcher, C., Gama-Castro, S., Kothari, A., Krummenacker, M., Latendresse, M.,
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Muñiz-Rascado, L., Ong, Q., Paley, S., Peralta-Gil, M., Subhraveti, P., Velázquez-Ramírez,
D.A., Weaver, D., Collado-Vides, J., Paulsen, I., Karp, P.D., 2016. The EcoCyc database:
reflecting new knowledge about Escherichia coli K-12. Nucleic Acids Res.45, D543–D550.
doi:10.1093/nar/gkw1003
Kim, B., Park, H., Na, D., Lee, S.Y., 2013. Metabolic engineering of Escherichia coli for the
production of phenol from glucose. Biotechnol J 9, 621–629. doi:10.1002/biot.201300263
Kim, J., Reed, J.L., Maravelias, C.T., 2011. Large-Scale Bi-Level Strain Design Approaches
and Mixed-Integer Programming Solution Techniques. PLoS ONE 6, e24162.
doi:10.1371/journal.pone.0024162
King, Z.A., Feist, A.M., 2013. Optimizing Cofactor Specificity of Oxidoreductase Enzymes
for the Generation of Microbial Production Strains—OptSwap. Industrial Biotechnology 9,
236–246. doi:10.1089/ind.2013.0005
King, Z.A., Lu, J., Dräger, A., Miller, P., Federowicz, S., Lerman, J.A., Ebrahim, A., Palsson,
B.O., Lewis, N.E., 2015. BiGG Models: A platform for integrating, standardizing and sharing
genome-scale models. Nucleic Acids Res. 44, D515–D522. doi:10.1093/nar/gkv1049
Kiss, G., Çelebi-Ölçüm, N., Moretti, R., Baker, D., Houk, K.N., 2013. Computational
Enzyme Design. Angew. Chem., Int. Ed. 52, 5700–5725. doi:10.1002/anie.201204077
Kitade, Y., Hashimoto, R., Suda, M., Hiraga, K., Inui, M., 2018. Production of 4-
Hydroxybenzoic Acid by an Aerobic Growth-Arrested Bioprocess Using Metabolically
Engineered Corynebacterium glutamicum. Appl. Environ. Microbiol. 84.
doi:10.1128/aem.02587-17
Klamt, S., Gilles, E.D., 2004. Minimal cut sets in biochemical reaction networks.
Bioinformatics 20, 226–234. doi:10.1093/bioinformatics/btg395
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Klamt, S., Haus, U.-U., Theis, F., 2009. Hypergraphs and Cellular Networks. PLoS Comput.
Biol. 5, e1000385. doi:10.1371/journal.pcbi.1000385
Klamt, S., Mahadevan, R., 2015. On the feasibility of growth-coupled product synthesis in
microbial strains. Metab. Eng. 30, 166–178. doi:10.1016/j.ymben.2015.05.006
Kohlstedt, M., Starck, S., Barton, N., Stolzenberger, J., Selzer, M., Mehlmann, K., Schneider,
R., Pleissner, D., Rinkel, J., Dickschat, J.S., Venus, J., B.J.H. van Duuren, J., Wittmann, C.,
2018. From lignin to nylon: Cascaded chemical and biochemical conversion using
metabolically engineered Pseudomonas putida. Metab. Eng. 47, 279–293.
doi:10.1016/j.ymben.2018.03.003
Koopman, F., Wierckx, N., de Winde, J.H., Ruijssenaars, H.J., 2010. Efficient whole-cell
biotransformation of 5-(hydroxymethyl)furfural into FDCA, 2,5-furandicarboxylic acid.
Bioresour. Technol. 101, 6291–6296. doi:10.1016/j.biortech.2010.03.050
Kumar, A., Suthers, P.F., Maranas, C.D., 2012. MetRxn: a knowledgebase of metabolites and
reactions spanning metabolic models and databases. BMC Bioinformatics 13, 6.
doi:10.1186/1471-2105-13-6
Leavitt, J.M., Wagner, J.M., Tu, C.C., Tong, A., Liu, Y., Alper, H.S., 2017. Biosensor-
Enabled Directed Evolution to Improve Muconic Acid Production in Saccharomyces
cerevisiae. Biotechnol. J. 12, 1600687. doi:10.1002/biot.201600687
Lee, J.W., Kim, T.Y., Jang, Y.-S., Choi, S., Lee, S.Y., 2011. Systems metabolic engineering
for chemicals and materials. Trends Biotechnol. 29, 370–378.
doi:10.1016/j.tibtech.2011.04.001
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Lee, J.W., Na, D., Park, J.M., Lee, J., Choi, S., Lee, S.Y., 2012. Systems metabolic
engineering of microorganisms for natural and non-natural chemicals. Nat. Chem. Biol.8,
536–546. doi:10.1038/nchembio.970
Lee, S.Y., Kim, H.U., 2015. Systems strategies for developing industrial microbial strains.
Nat. Biotechnol. 33, 1061–1072. doi:10.1038/nbt.3365
Lee, S.Y., Kim, H.U., Chae, T.U., Cho, J.S., Kim, J.W., Shin, J.H., Kim, D.I., Ko, Y.-S.,
Jang, W.D., Jang, Y.-S., 2019. A comprehensive metabolic map for production of bio-based
chemicals. Nat. Catal. 2, 18–33. doi:10.1038/s41929-018-0212-4
Leonard, E., Ajikumar, P.K., Thayer, K., Xiao, W.-H., Mo, J.D., Tidor, B., Stephanopoulos,
G., Prather, K.L.J., 2010. Combining metabolic and protein engineering of a terpenoid
biosynthetic pathway for overproduction and selectivity control. Proc. Natl. Acad. Sci. 107,
13654–13659. doi:10.1073/pnas.1006138107
Lewis, N.E., Hixson, K.K., Conrad, T.M., Lerman, J.A., Charusanti, P., Polpitiya, A.D.,
Adkins, J.N., Schramm, G., Purvine, S.O., Lopez-Ferrer, D., Weitz, K.K., Eils, R., König, R.,
Smith, R.D., Palsson, B.Ø., 2010. Omic data from evolved E. coli are consistent with
computed optimal growth from genome-scale models. Mol. Syst. Biol. 6.
doi:10.1038/msb.2010.47
Lewis, N.E., Nagarajan, H., Palsson, B.O., 2012. Constraining the metabolic genotype–
phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 10, 291–
305. doi:10.1038/nrmicro2737
Li, C., Henry, C.S., Jankowski, M.D., Ionita, J.A., Hatzimanikatis, V., Broadbelt, L.J., 2004.
Computational discovery of biochemical routes to specialty chemicals. Chemical Engineering
Science 59, 5051–5060. doi:10.1016/j.ces.2004.09.021
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Li, S., Si, T., Wang, M., Zhao, H., 2015. Development of a Synthetic Malonyl-CoA Sensor in
Saccharomyces cerevisiae for Intracellular Metabolite Monitoring and Genetic Screening.
ACS Synth. Biol. 4, 1308–1315. doi:10.1021/acssynbio.5b00069
Lim, H.G., Jang, S., Jang, S., Seo, S.W., Jung, G.Y., 2018. Design and optimization of
genetically encoded biosensors for high-throughput screening of chemicals. Curr. Opin.
Biotechnol. 54, 18–25. doi:10.1016/j.copbio.2018.01.011
Lin, J.-L., Wagner, J.M., Alper, H.S., 2017. Enabling tools for high-throughput detection of
metabolites: Metabolic engineering and directed evolution applications. Biotechnol. Adv. 35,
950–970. doi:10.1016/j.biotechadv.2017.07.005
Lin, P.P., Mi, L., Morioka, A.H., Yoshino, K.M., Konishi, S., Xu, S.C., Papanek, B.A., Riley,
L.A., Guss, A.M., Liao, J.C., 2015. Consolidated bioprocessing of cellulose to isobutanol
using Clostridium thermocellum. Metab. Eng. 31, 44–52. doi:10.1016/j.ymben.2015.07.001
Liu, C., Men, X., Chen, H., Li, M., Ding, Z., Chen, G., Wang, F., Liu, H., Wang, Q., Zhu, Y.,
Zhang, H., Xian, M., 2018. A systematic optimization of styrene biosynthesis in Escherichia
coli BL21(DE3). Biotechnol. Biofuels 11. doi:10.1186/s13068-018-1017-z
Liu, D., Xiao, Y., Evans, B.S., Zhang, F., 2015. Negative feedback regulation of fatty acid
production based on a malonyl-CoA sensor-actuator. ACS Synth. Biol. 4, 132–140.
doi:10.1021/sb400158w
Liu, H., Lu, T., 2015. Autonomous production of 1,4-butanediol via a de novo biosynthesis
pathway in engineered Escherichia coli. Metab. Eng. 29, 135–141.
Liu, Y., Kuhlman, B., 2006. RosettaDesign server for protein design. Nucl. Ac. Res.34,
W235–W238. doi:10.1093/nar/gkl163
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Lun, D.S., Rockwell, G., Guido, N.J., Baym, M., Kelner, J.A., Berger, B., Galagan, J.E.,
Church, G.M., 2009. Large-scale identification of genetic design strategies using local search.
Mol. Syst. Biol. 5. doi:10.1038/msb.2009.7
Ma, Y., Di Salvo, M.L., Budisa, N., 2018. Self-Directed in Cell Production of Methionine
Analogue Azidohomoalanine by Synthetic Metabolism and Its Incorporation into Model
Proteins, in: Methods in Molecular Biology. Springer New York, pp. 127–135.
doi:10.1007/978-1-4939-7574-7_7
Machado, D., Andrejev, S., Tramontano, M., Patil, K.R., 2018. Fast automated reconstruction
of genome-scale metabolic models for microbial species and communities. Nucleic Acids
Res. 46, 7542–7553. doi:10.1093/nar/gky537
Mahadevan, R., Burgard, A.P., Famili, I., Van Dien, S., Schilling, C.H., 2005. Applications
of metabolic modeling to drive bioprocess development for the production of value-added
chemicals. Biotechnol. Bioprocess Eng. 10, 408–417. doi:10.1007/bf02989823
Mahadevan, R., Edwards, J.S., Doyle, F.J., III, 2002. Dynamic Flux Balance Analysis of
Diauxic Growth in Escherichia coli. Biophysical Journal 83, 1331–1340. doi:10.1016/s0006-
3495(02)73903-9
Mahadevan, R., von Kamp, A., Klamt, S., 2015. Genome-scale strain designs based on
regulatory minimal cut sets. Bioinformatics 31, 2844–2851.
doi:10.1093/bioinformatics/btv217
Maia, P., Rocha, M., Rocha, I., 2015. In-Silico Constraint-Based Strain Optimization
Methods: the Quest for Optimal Cell Factories. Microbiology and Molecular Biology
Reviews 80, 45–67. doi:10.1128/mmbr.00014-15
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Mak, W.S., Tran, S., Marcheschi, R., Bertolani, S., Thompson, J., Baker, D., Liao, J.C.,
Siegel, J.B., 2015. Integrative genomic mining for enzyme function to enable engineering of
a non-natural biosynthetic pathway. Nat. Commun. 6. doi:10.1038/ncomms10005
Marcheschi, R.J., Gronenberg, L.S., Liao, J.C., 2013. Protein engineering for metabolic
engineering: Current and next-generation tools. Biotechnol. J. 8, 545–555.
doi:10.1002/biot.201200371
Matsuda, F., Ishii, J., Kondo, T., Ida, K., Tezuka, H., Kondo, A., 2013. Increased isobutanol
production in Saccharomyces cerevisiae by eliminating competing pathways and resolving
cofactor imbalance. Microb. Cell Fact.12, 119. doi:10.1186/1475-2859-12-119
McKenna, R., Nielsen, D.R., 2011. Styrene biosynthesis from glucose by engineered E. coli.
Metab. Eng. 13, 544–554. doi:10.1016/j.ymben.2011.06.005
McKenna, R., Pugh, S., Thompson, B., Nielsen, D.R., 2013. Microbial production of the
aromatic building-blocks (S)-styrene oxide and (R)-1,2-phenylethanediol from renewable
resources. Biotechnol. J. 8, 1465–1475. doi:10.1002/biot.201300035
McKenna, R., Thompson, B., Pugh, S., Nielsen, D.R., 2014. Rational and combinatorial
approaches to engineering styrene production by Saccharomyces cerevisiae. Microb. Cell
Fact.13. doi:10.1186/s12934-014-0123-2
Mehrer, C.R., Incha, M.R., Politz, M.C., Pfleger, B.F., 2018. Anaerobic production of
medium-chain fatty alcohols via a β-reduction pathway. Metab. Eng. 48, 63–71.
doi:10.1016/j.ymben.2018.05.011
Miskovic, L., Hatzimanikatis, V., 2010. Production of biofuels and biochemicals: in need of
an ORACLE. Trends in Biotechnology 28, 391–397. doi:10.1016/j.tibtech.2010.05.003
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Miyamoto, K.T., Komatsu, M., Ikeda, H., 2014. Discovery of Gene Cluster for Mycosporine-
Like Amino Acid Biosynthesis from Actinomycetales Microorganisms and Production of a
Novel Mycosporine-Like Amino Acid by Heterologous Expression. Appl. Environ.
Microbiol. 80, 5028–5036. doi:10.1128/aem.00727-14
Monk, J., Nogales, J., Palsson, B.O., 2014. Optimizing genome-scale network
reconstructions. Nat. Biotechnol. 32, 447–452. doi:10.1038/nbt.2870
Moretti, S., Martin, O., Van Du Tran, T., Bridge, A., Morgat, A., Pagni, M., 2015.
MetaNetX/MNXref – reconciliation of metabolites and biochemical reactions to bring
together genome-scale metabolic networks. Nucleic Acids Res. 44, D523–D526.
doi:10.1093/nar/gkv1117
Mutturi, S., 2017. FOCuS: a metaheuristic algorithm for computing knockouts from genome-
scale models for strain optimization. Molecular BioSystems 13, 1355–1363.
doi:10.1039/c7mb00204a
Nair, G., Jungreuthmayer, C., Hanscho, M., Zanghellini, J., 2015. Designing minimal
microbial strains of desired functionality using a genetic algorithm. Algorithms for Molecular
Biology 10. doi:10.1186/s13015-015-0060-6
Nemr, K., Müller, J.E.N., Joo, J.C., Gawand, P., Choudhary, R., Mendonca, B., Lu, S., Yu,
X., Yakunin, A.F., Mahadevan, R., 2018. Engineering a short, aldolase-based pathway for
(R)-1,3-butanediol production in Escherichia coli. Metab. Eng. 48, 13–24.
doi:10.1016/j.ymben.2018.04.013
Nguyen, A.D., Hwang, I.Y., Lee, O.K., Kim, D., Kalyuzhnaya, M.G., Mariyana, R., Hadiyati,
S., Kim, M.S., Lee, E.Y., 2018. Systematic metabolic engineering of Methylomicrobium
alcaliphilum 20Z for 2,3-butanediol production from methane. Metabolic Engineering 47,
323–333. doi:10.1016/j.ymben.2018.04.010
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Niu, W., Draths, K.M., Frost, J.W., 2002. Benzene-Free Synthesis of Adipic Acid.
Biotechnol. Prog. 18, 201–211. doi:10.1021/bp010179x
Noda, S., Shirai, T., Oyama, S., Kondo, A., 2016. Metabolic design of a platform Escherichia
coli strain producing various chorismate derivatives. Metab. Eng. 33, 119–129.
doi:10.1016/j.ymben.2015.11.007
Noor, E., Eden, Eran., Milo, R., Alon, Uri. 2010. Central Carbon Metabolism as a Minimal
Biochemical Walk between Precursors for Biomass and Energy. Mol Cell. 39(5):809-20.
doi:10.1016/j.molcel.2010.08.031
O’Brien, E.J., Monk, J.M., Palsson, B.O., 2015. Using Genome-scale Models to Predict
Biological Capabilities. Cell 161, 971–987. doi:10.1016/j.cell.2015.05.019
Oberhardt, M.A., Zarecki, R., Reshef, L., Xia, F., Duran-Frigola, M., Schreiber, R., Henry,
C.S., Ben-Tal, N., Dwyer, D.J., Gophna, U., Ruppin, E., 2016. Systems-Wide Prediction of
Enzyme Promiscuity Reveals a New Underground Alternative Route for Pyridoxal 5’-
Phosphate Production in E. coli. PLoS Comput. Biol. 12, e1004705.
doi:10.1371/journal.pcbi.1004705
Ohno, S., Shimizu, H., Furusawa, C., 2013. FastPros: screening of reaction knockout
strategies for Metab. Eng.. Bioinformatics 30, 981–987. doi:10.1093/bioinformatics/btt672
Orth, J.D., Thiele, I., Palsson, B.Ø., 2010. What is flux balance analysis? Nat. Biotechnol. 28,
245–248. doi:10.1038/nbt.1614
Packer, M.S., Liu, D.R., 2015. Methods for the directed evolution of proteins. Nat. Rev.
Genet. 16, 379–394. doi:10.1038/nrg3927
Pandit, A.V., Srinivasan, S., Mahadevan, R., 2017. Redesigning metabolism based on
orthogonality principles. Nat. Commun. 8, 15188. doi:10.1038/ncomms15188
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Patil, K., Rocha, I., Förster, J., Nielsen, J., 2005. Evolutionary programming as a platform for
in silico metabolic engineering. BMC Bioinformatics 6, 308. doi:10.1186/1471-2105-6-308
Pertusi, D.A., Stine, A.E., Broadbelt, L.J., Tyo, K.E.J., 2014. Efficient searching and
annotation of metabolic networks using chemical similarity. Bioinformatics 31, 1016–1024.
doi:10.1093/bioinformatics/btu760
Pharkya, P., 2004. OptStrain: A computational framework for redesign of microbial
production systems. Genome Research 14, 2367–2376. doi:10.1101/gr.2872004
Pharkya, P., Maranas, C.D., 2006. An optimization framework for identifying reaction
activation/inhibition or elimination candidates for overproduction in microbial systems.
Metab. Eng. 8, 1–13. doi:10.1016/j.ymben.2005.08.003
Pitkänen, E., Jouhten, P., Rousu, J., 2009. Inferring branching pathways in genome-scale
metabolic networks. BMC Syst. Biol. 3. doi:10.1186/1752-0509-3-103
Placzek, S., Schomburg, I., Chang, A., Jeske, L., Ulbrich, M., Tillack, J., Schomburg, D.,
2016. BRENDA in 2017: new perspectives and new tools in BRENDA. Nucleic Acids Res.
45, D380–D388. doi:10.1093/nar/gkw952
Prather, K.L.J., Martin, C.H., 2008. De novo biosynthetic pathways: rational design of
microbial chemical factories. Curr. Opin. Biotechnol. 19, 468–474.
doi:10.1016/j.copbio.2008.07.009
Pyne, M.E., Narcross, L., Melgar, M., Kevvai, K., Mookerjee, S., Leite, G.B., Martin, V.J.J.,
2018. An Engineered Aro1 Protein Degradation Approach for Increased cis,cis-Muconic
Acid Biosynthesis in Saccharomyces cerevisiae. Appl. Environ. Microbiol. 84.
doi:10.1128/aem.01095-18
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Qian, S., Cirino, P.C., 2016. Using metabolite-responsive gene regulators to improve
microbial biosynthesis. Curr. Opin. Chem. Eng.14, 93–102. doi:10.1016/j.coche.2016.08.020
Raj, K., Partow, S., Correia, K., Khusnutdinova, A.N., Yakunin, A.F., Mahadevan, R., 2018.
Biocatalytic production of adipic acid from glucose using engineered Saccharomyces
cerevisiae. Metab. Eng. Commun. 6, 28–32. doi:10.1016/j.meteno.2018.02.001
Rantasalo, A., Kuivanen, J., Penttilä, M., Jäntti, J., Mojzita, D., 2018. Synthetic Toolkit for
Complex Genetic Circuit Engineering in Saccharomyces cerevisiae. ACS Synth. Biol. 7,
1573–1587. doi:10.1021/acssynbio.8b00076
Reed, J.L., Vo, T.D., Schilling, C.H., Palsson, B.O., 2003. An expanded genome-scale model
of Escherichia coli K-12 (iJR904 GSM/GPR). Genome Biology 4, R54. doi:10.1186/gb-
2003-4-9-r54
Ren, Y., Yang, S., Yuan, Q., Sun, X., 2015. Microbial production of phenol via salicylate
decarboxylation. RSC Adv. 5, 92685–92689. doi:10.1039/c5ra20104g
Rocha, I., Maia, P., Evangelista, P., Vilaça, P., Soares, S., Pinto, J.P., Nielsen, J., Patil, K.R.,
Ferreira, E.C., Rocha, M., 2010. OptFlux: an open-source software platform for in silico
metabolic engineering. BMC Systems Biology 4, 45. doi:10.1186/1752-0509-4-45
Rockwell, G., Guido, N.J., Church, G.M., 2013. Redirector: Designing Cell Factories by
Reconstructing the Metabolic Objective. PLoS Comput. Biol. 9, e1002882.
doi:10.1371/journal.pcbi.1002882
Rode, A.B., Endoh, T., Sugimoto, N., 2015. Tuning riboswitch-mediated gene regulation by
rational control of aptamer ligand binding properties. Angew. Chemie - Int. Ed. 54, 905–909.
doi:10.1002/anie.201407385
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Rogers, J.K., Church, G.M., 2016. Genetically encoded sensors enable real-time observation
of metabolite production. Proc. Natl. Acad. Sci. 113, 2388–2393.
doi:10.1073/pnas.1600375113
Rogers, J.K., Taylor, N.D., Church, G.M., 2016. Biosensor-based engineering of biosynthetic
pathways. Curr. Opin. Biotechnol. 42, 84–91. doi:10.1016/j.copbio.2016.03.005
Röthlisberger, D., Khersonsky, O., Wollacott, A.M., Jiang, L., DeChancie, J., Betker, J.,
Gallaher, J.L., Althoff, E.A., Zanghellini, A., Dym, O., Albeck, S., Houk, K.N., Tawfik, D.S.,
Baker, D., 2008. Kemp elimination catalysts by computational enzyme design. Nature 453,
190–195. doi:10.1038/nature06879
Ruckerbauer, D.E., Jungreuthmayer, C., Zanghellini, J., 2015. Predicting genetic engineering
targets with Elementary Flux Mode Analysis: a review of four current methods. New
Biotechnology 32, 534–546. doi:10.1016/j.nbt.2015.03.017
Salinas, F., Rojas, V., Delgado, V., Agosin, E., Larrondo, L.F., 2017. Optogenetic switches
for light-controlled gene expression in yeast. Appl. Microbiol. Biotechnol. 101, 2629–2640.
doi:10.1007/s00253-017-8178-8
Salvachúa, D., Johnson, C.W., Singer, C.A., Rohrer, H., Peterson, D.J., Black, B.A., Knapp,
A., Beckham, G.T., 2018. Bioprocess development for muconic acid production from
aromatic compounds and lignin. Green Chem. 20, 5007–5019. doi:10.1039/c8gc02519c
Schellenberger, J., Que, R., Fleming, R.M.T., Thiele, I., Orth, J.D., Feist, A.M., Zielinski,
D.C., Bordbar, A., Lewis, N.E., Rahmanian, S., Kang, J., Hyduke, D.R., Palsson, B.Ø., 2011.
Quantitative prediction of cellular metabolism with constraint-based models: the COBRA
Toolbox v2.0. Nature Protocols 6, 1290–1307. doi:10.1038/nprot.2011.308
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Segler, M.H.S., Preuss, M., Waller, M.P., 2018. Planning chemical syntheses with deep
neural networks and symbolic AI. Nature 555, 604–610. doi:10.1038/nature25978
Segre, D., Vitkup, D., Church, G.M., 2002. Analysis of optimality in natural and perturbed
metabolic networks. Proc. Natl. Acad. Sci. 99, 15112–15117. doi:10.1073/pnas.232349399
Shin, J.H., Park, S.H., Oh, Y.H., Choi, J.W., Lee, M.H., Cho, J.S., Jeong, K.J., Joo, J.C., Yu,
J., Park, S.J., Lee, S.Y., 2016. Metabolic engineering of Corynebacterium glutamicum for
enhanced production of 5-aminovaleric acid. Microb. Cell Fact.15. doi:10.1186/s12934-016-
0566-8
Shlomi, T., Berkman, O., Ruppin, E., 2005. Regulatory on/off minimization of metabolic flux
changes after genetic perturbations. Proc. Natl. Acad. Sci. 102, 7695–7700.
doi:10.1073/pnas.0406346102
Shlomi, T., Eisenberg, Y., Sharan, R., Ruppin, E., 2007. A genome-scale computational study
of the interplay between transcriptional regulation and metabolism. Mol. Syst. Biol. 3.
doi:10.1038/msb4100141
Siegel, J.B., Zanghellini, A., Lovick, H.M., Kiss, G., Lambert, A.R., St.Clair, J.L., Gallaher,
J.L., Hilvert, D., Gelb, M.H., Stoddard, B.L., Houk, K.N., Michael, F.E., Baker, D., 2010.
Computational Design of an Enzyme Catalyst for a Stereoselective Bimolecular Diels-Alder
Reaction. Science 329, 309–313. doi:10.1126/science.1190239
Siripong, W., Wolf, P., Kusumoputri, T.P., Downes, J.J., Kocharin, K., Tanapongpipat, S.,
Runguphan, W., 2018. Metabolic engineering of Pichia pastoris for production of isobutanol
and isobutyl acetate. Biotechnol. Biofuels 11. doi:10.1186/s13068-017-1003-x
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Sivakumar, T.V., Giri, V., Park, J.H., Kim, T.Y., Bhaduri, A., 2016. ReactPRED: a tool to
predict and analyze biochemical reactions. Bioinformatics btw491.
doi:10.1093/bioinformatics/btw491
Skjoedt, M.L., Snoek, T., Kildegaard, K.R., Arsovska, D., Eichenberger, M., Goedecke, T.J.,
Rajkumar, A.S., Zhang, J., Kristensen, M., Lehka, B.J., Siedler, S., Borodina, I., Jensen,
M.K., Keasling, J.D., 2016. Engineering prokaryotic transcriptional activators as metabolite
biosensors in yeast. Nat. Chem. Biol. 12, 951–958. doi:10.1038/nchembio.2177
Snoek, T., Romero-Suarez, D., Zhang, J., Ambri, F., Skjoedt, M.L., Sudarsan, S., Jensen,
M.K., Keasling, J.D., 2018. An Orthogonal and pH-Tunable Sensor-Selector for Muconic
Acid Biosynthesis in Yeast. ACS Synth. Biol. 7, 995–1003. doi:10.1021/acssynbio.7b00439
Song, H.-S., Goldberg, N., Mahajan, A., Ramkrishna, D., 2017. Sequential computation of
elementary modes and minimal cut sets in genome-scale metabolic networks using alternate
integer linear programming. Bioinformatics 33, 2345–2353.
doi:10.1093/bioinformatics/btx171
Srinivasan, S., Cluett, W.R., Mahadevan, R., 2015. Constructing kinetic models of
metabolism at genome-scales: A review. Biotechnol. J. 10, 1345–1359.
doi:10.1002/biot.201400522
Steffensen, J.L., Dufault-Thompson, K., Zhang, Y., 2016. PSAMM: A Portable System for
the Analysis of Metabolic Models. PLoS Comput. Biol. 12, e1004732.
doi:10.1371/journal.pcbi.1004732
Tai, Y.-S., Xiong, M., Jambunathan, P., Wang, J., Wang, J., Stapleton, C., Zhang, K., 2016.
Engineering nonphosphorylative metabolism to generate lignocellulose-derived products.
Nat. Chem. Biol. 12, 247–253. doi:10.1038/nchembio.2020
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Tepper, N., Shlomi, T., 2009. Predicting metabolic engineering knockout strategies for
chemical production: accounting for competing pathways. Bioinformatics 26, 536–543.
doi:10.1093/bioinformatics/btp704
Thompson, B., Pugh, S., Machas, M., Nielsen, D.R., 2018. Muconic Acid Production via
Alternative Pathways and a Synthetic “Metabolic Funnel.” ACS Synth. Biol. 7, 565–575.
doi:10.1021/acssynbio.7b00331
Tiwari, M.K., Singh, R., Singh, R.K., Kim, I.-W., Lee, J.-K., 2012. Computational
approaches for rational design of proteins with novel functionalities. Comput. Struct.
Biotechnol. J. 2, e201204002. doi:10.5936/csbj.201209002
Trinh, C.T., Liu, Y., Conner, D.J., 2015. Rational design of efficient modular cells. Metab.
Eng. 32, 220–231. doi:10.1016/j.ymben.2015.10.005
Tseng, H.-C., Prather, K.L.J., 2012. Controlled biosynthesis of odd-chain fuels and chemicals
via engineered modular metabolic pathways. Proc. Natl. Acad. Sci. 109, 17925–17930.
doi:10.1073/pnas.1209002109
Tsuge, Y., Kawaguchi, H., Yamamoto, S., Nishigami, Y., Sota, M., Ogino, C., Kondo, A.,
2018. Metabolic engineering of Corynebacterium glutamicum for production of sunscreen
shinorine. Biosci., Biotechnol., Biochem.82, 1252–1259.
doi:10.1080/09168451.2018.1452602
Turk, S.C.H.J., Kloosterman, W.P., Ninaber, D.K., Kolen, K.P.A.M., Knutova, J., Suir, E.,
Schürmann, M., Raemakers-Franken, P.C., Müller, M., de Wildeman, S.M.A., Raamsdonk,
L.M., van der Pol, R., Wu, L., Temudo, M.F., van der Hoeven, R.A.M., Akeroyd, M., van der
Stoel, R.E., Noorman, H.J., Bovenberg, R.A.L., Trefzer, A.C., 2015. Metabolic Engineering
toward Sustainable Production of Nylon-6. ACS Synth. Biol. 5, 65–73.
doi:10.1021/acssynbio.5b00129
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Venayak, N., Anesiadis, N., Cluett, W.R., Mahadevan, R., 2015. Engineering metabolism
through dynamic control. Curr. Opin. Biotechnol. 34, 142–152.
doi:10.1016/j.copbio.2014.12.022
Venayak, N., von Kamp, A., Klamt, S., Mahadevan, R., 2018a. MoVE identifies metabolic
valves to switch between phenotypic states. Nat. Commun. 9. doi:10.1038/s41467-018-
07719-4
Venayak, N., Raj, K., Jaydeep, R., Mahadevan, R., 2018b. An Optimized Bistable Metabolic
Switch to Decouple Phenotypic States during Anaerobic Fermentation. ACS Syn. Bio.
doi:10.1021/acssynbio.8b00284
Vogt, M., Brüsseler, C., Ooyen, J. van, Bott, M., Marienhagen, J., 2016. Production of 2-
methyl-1-butanol and 3-methyl-1-butanol in engineered Corynebacterium glutamicum.
Metab. Eng. 38, 436–445. doi:10.1016/j.ymben.2016.10.007
Völler, J.-S., Budisa, N., 2017. Coupling genetic code expansion and metabolic engineering
for synthetic cells. Curr. Opin. Biotechnol. 48, 1–7. doi:10.1016/j.copbio.2017.02.002
von Kamp, A., Klamt, S., 2014. Enumeration of Smallest Intervention Strategies in Genome-
Scale Metabolic Networks. PLoS Comput. Biol. 10, e1003378.
doi:10.1371/journal.pcbi.1003378
von Kamp, A., Klamt, S., 2017. Growth-coupled overproduction is feasible for almost all
metabolites in five major production organisms. Nat. Commun. 8, 15956.
doi:10.1038/ncomms15956
Walther, T., Topham, C.M., Irague, R., Auriol, C., Baylac, A., Cordier, H., Dressaire, C.,
Lozano-Huguet, L., Tarrat, N., Martineau, N., Stodel, M., Malbert, Y., Maestracci, M., Huet,
R., André, I., Remaud-Siméon, M., François, J.M., 2017. Construction of a synthetic
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
metabolic pathway for biosynthesis of the non-natural methionine precursor 2,4-
dihydroxybutyric acid. Nat. Commun. 8, 15828. doi:10.1038/ncomms15828
Wang, J., Jain, R., Shen, X., Sun, X., Cheng, M., Liao, J.C., Yuan, Q., Yan, Y., 2017b.
Rational engineering of diol dehydratase enables 1,4-butanediol biosynthesis from xylose.
Metab. Eng. 40, 148–156. doi:10.1016/j.ymben.2017.02.003
Wang, J., Yang, Y., Zhang, R., Shen, X., Chen, Z., Wang, J., Yuan, Q., Yan, Y., 2018.
Microbial production of branched-chain dicarboxylate 2-methylsuccinic acid via enoate
reductase-mediated bioreduction. Metab. Eng. 45, 1–10. doi:10.1016/j.ymben.2017.11.007
Wang, L., Dash, S., Ng, C.Y., Maranas, C.D., 2017a. A review of computational tools for
design and reconstruction of metabolic pathways. Synth Syst Biotechnol. 2, 243–252.
doi:10.1016/j.synbio.2017.11.002
Wang, L., Ji, D., Liu, Y., Wang, Q., Wang, X., Zhou, Y.J., Zhang, Y., Liu, W., Zhao, Z.K.,
2017c. Synthetic Cofactor-Linked Metabolic Circuits for Selective Energy Transfer. ACS
Catal. 7, 1977–1983. doi:10.1021/acscatal.6b03579
Wang, X., Chen, X., Yang, Y., 2012. Spatiotemporal control of gene expression by a light-
switchable transgene system. Nat. Met. 9, 266–269. doi:10.1038/nmeth.1892
Werpy, T.A. and Petersen, G., Eds. (2004) Top Value Added Chemicals from Biomass.
Volume I: Results of Screening for Potential Candidates from Sugars and Synthesis Gas.
Pacific Northwest National Laboratory (PNNL) and National Renewable Energy Laboratory
(NREL).
Woolfson, D.N., Bartlett, G.J., Burton, A.J., Heal, J.W., Niitsu, A., Thomson, A.R., Wood,
C.W., 2015. De novo protein design: how do we expand into the universe of possible protein
structures? Curr. Opin. Struct. Biol. 33, 16–26. doi:10.1016/j.sbi.2015.05.009
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Xiu, Y., Jang, S., Jones, J.A., Zill, N.A., Linhardt, R.J., Yuan, Q., Jung, G.Y., Koffas,
M.A.G., 2017. Naringenin-responsive riboswitch-based fluorescent biosensor module for
Escherichia coli co-cultures. Biotechnol. Bioeng.114, 2235–2244. doi:10.1002/bit.26340
Xu, P., 2018. Production of chemicals using dynamic control of metabolic fluxes. Curr. Opin.
Biotechnol. 53, 12–19. doi:10.1016/j.copbio.2017.10.009
Xu, P., Li, L., Zhang, F., Stephanopoulos, G., Koffas, M., 2014. Improving fatty acids
production by engineering dynamic pathway regulation and metabolic control. Proc. Natl.
Acad. Sci. 111, 11299–11304. doi:10.1073/pnas.1406401111
Yamamoto, S., Suda, M., Niimi, S., Inui, M., Yukawa, H., 2013. Strain optimization for
efficient isobutanol production using Corynebacterium glutamicum under oxygen
deprivation. Biotechnol. Bioeng. 110, 2938–2948. doi:10.1002/bit.24961
Yang, J.E., Park, S.J., Kim, W.J., Kim, H.J., Kim, B.J., Lee, H., Shin, J., Lee, S.Y., 2018.
One-step fermentative production of aromatic polyesters from glucose by metabolically
engineered Escherichia coli strains. Nat. Commun. 9. doi:10.1038/s41467-017-02498-w
Yang, L., Cluett, W.R., Mahadevan, R., 2011. EMILiO: A fast algorithm for genome-scale
strain design. Metab. Eng. 13, 272–281. doi:10.1016/j.ymben.2011.03.002
Yim, H., Haselbeck, R., Niu, W., Pujol-Baxley, C., Burgard, A., Boldt, J., Khandurina, J.,
Trawick, J.D., Osterhout, R.E., Stephen, R., Estadilla, J., Teisan, S., Schreyer, H.B., Andrae,
S., Yang, T.H., Lee, S.Y., Burk, M.J., Van Dien, S., 2011. Metabolic engineering of
Escherichia coli for direct production of 1,4-butanediol. Nat. Chem. Biol.7, 445–452.
doi:10.1038/nchembio.580
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Yu, T., Zhou, Y.J., Huang, M., Liu, Q., Pereira, R., David, F., Nielsen, J., 2018.
Reprogramming Yeast Metabolism from Alcoholic Fermentation to Lipogenesis. Cell 174,
1549–1558.e14. doi:10.1016/j.cell.2018.07.013
Yuan, H., Liu, Y., Li, J., Shin, H., Du, G., Shi, Z., Chen, J., Liu, L., 2018. Combinatorial
synthetic pathway fine-tuning and comparative transcriptomics for metabolic engineering of
Raoultella ornithinolytica BF60 to efficiently synthesize 2,5-furandicarboxylic acid.
Biotechnol. Bioeng.. doi:10.1002/bit.26725
Zanghellini, A., 2014. De novo computational enzyme design. Curr. Opin. Biotechnol. 29,
132–138. doi:10.1016/j.copbio.2014.03.002
Zanghellini, J., Ruckerbauer, D.E., Hanscho, M., Jungreuthmayer, C., 2013. Elementary flux
modes in a nutshell: Properties, calculation and applications. Biotechnol. J. 8, 1009–1016.
doi:10.1002/biot.201200269
Zastrow, M.L., Pecoraro, V.L., 2014. Designing Hydrolytic Zinc Metalloenzymes.
Biochemistry 53, 957–978. doi:10.1021/bi4016617
Zhang, J., Jensen, M.K., Keasling, J.D., 2015. Development of biosensors and their
application in metabolic engineering. Curr. Opin. Chem. Biol. 28, 1–8.
doi:10.1016/j.cbpa.2015.05.013
Zhao, M., Huang, D., Zhang, X., Koffas, M.A.G., Zhou, J., Deng, Y., 2018. Metabolic
engineering of Escherichia coli for producing adipic acid through the reverse adipate-
degradation pathway. Metab. Eng. 47, 254–262. doi:10.1016/j.ymben.2018.04.002
Zhou, H., Vonk, B., Roubos, J.A., Bovenberg, R.A.L., Voigt, C.A., 2015. Algorithmic co-
optimization of genetic constructs and growth conditions: application to 6-ACA, a potential
nylon-6 precursor. Nucleic Acids Res. gkv1071. doi:10.1093/nar/gkv1071
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Ziemert, N., Alanjary, M., Weber, T., 2016. The evolution of genome mining in microbes – a
review. Nat. Prod. Rep. 33, 988–1005. doi:10.1039/c6np00025h
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Tables and Figure Captions
Table 1. Non-natural chemicals currently being chemically produced from biomass
Chemical Uses Raw material Company
FDCA, PEF Bioplastic glucose (YXY
technology)
Avantium, The
Netherlands
Lactide, PLA Bioplastic bio-lactic acid NatureWorks, MN,
U.S.A.
Corbion, The
Netherlands
Butadiene Biorubber bio 1,3 BDO Eni-Versalis, Italy
Synthetic Paraffinic
Kerosene (ATJ-SPK)
Jetfuel bioethanol LanzaTech, IL, U.S.A.
Methyl Acrylate,
Isosorbide Diacrylate
bio-based acrylate resins glucose PTT MCC Biochem,
Thailand
Adipic acid Nylon fibers glucose Verdezyne*, CA, U.S.A.
Rennovia*, CA, U.S.A.
Isobutylene Fuel additives glucose Global Bioenergies,
France
hydroxymethylamine Nylon fibers glucose Rennovia*, CA, U.S.A.
1,6-Hexanediol polyurethanes glucose Rennovia*, CA, U.S.A.
Levoglucosenone,
Hydroxymethyl
Butyrolactone
Drugs, herbicides,
flavourings and fragrances
cellulose Circa, Australia
Abbreviations: FDCA – 2,5 furan dicarboxylic acid; PEF – polyethylene furanoate; PLA –
polylactic acid; BDO – butanediol; * Currently in finatial recuperation.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Table 2. Laboratory scale bioprocess for non-natural chemicals production
Compound Titer (g/L) Microrganism Substrate Systems Biology
Strategy
Reference
1,4 BDO 18.0 Escherichia coli glucose SimPheny
Biopathway
Predictor;
Promiscuous
enzyme screen;
OptKnock
deletions
Yim et al.,
2011
1,4 BDO 0.5 Escherichia coli D-xylose Promiscuous
enzyme screen;
dynamic control;
orthogonality
Liu and Lu,
2015
1,4 BDO 16.5 Escherichia coli D-xylose Promiscuous
enzyme screen;
enzyme rational
design
Tai et al., 2016
1,4 BDO 0.2 Escherichia coli D-xylose Enzyme rational
design
Wang et al.,
2017b
1,3 BDO 2.4 Escherichia coli glucose Promiscuous
enzyme screen
Nemr et al.,
2018
1,3 PDO 0.416 Escherichia coli glucose,
propionate
- Kataoka et al.,
2017
1,2‐Phenylethanediol 1.23 Escherichia coli glucose
Promiscuous
enzyme screen
Mckeena et al.,
2013
4-Methyl-1,3-
pentanediol
0.072 Escherichia coli glucose,
isobutyrate
- Kataoka et al.,
2017
1,2,4-Butanetriol 0.003 Escherichia coli glucose,
glycolate
- Kataoka et al.,
2017
Isobutanol 22.0 Escherichia coli glucose - Atsumi and
Liao, 2008
Isobutanol 1.62 Saccharomyces
cerevisiae
glucose - Matsuda et al.,
2013
Isobutanol 25.4 Corynebacterium
glutamicum
glucose - Yamamoto et
al., 2013
Isobutanol 5.4 Clostridium
thermocellum
cellulose - Lin et al., 2015
Isobutanol 2.22 Pichia pastoris glucose Promiscuous
enzyme screen
Siripong et al.,
2018
2-Methyl-1-
butanol 1.25 Escherichia coli glucose
Promiscuous
enzyme screen
Cann and Liao,
2008
2-Methyl-1-
butanol 0.37
Corynebacterium
glutamicum glucose
- Vogt et al.,
2016
Isopentanol 9.5 Escherichia coli glucose
- Connor et al.,
2010
Isopentanol 0.086 Escherichia coli glucose
- Guo et al.,
2014
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
1-Pentanol 1.137 Escherichia coli
trans-2-
pentanoate
and formate
Promiscuous
enzyme screen Tseng and
Prather, 2012
1-Pentanol 4.3 Escherichia coli glucose
Enzyme rational
design
Chen et al.,
2017
1-Pentanol 0.137 Escherichia coli
1,2
pentanediol
Enzyme rational
design Dai et al., 2017
1-Pentanol 0.118 Escherichia coli glucose
Enzyme rational
design; genome
mining
Mak et al.,
2015
1-Hexanol 0.341 Escherichia coli glucose Enzyme rational
design; genome
mining
Mak et al.,
2015
1-Heptanol 0.269 Escherichia coli glucose Enzyme rational
design; genome
mining
Mak et al.,
2015
Nonane 0.327 Escherichia coli glucose
- Choi and Lee,
2013
Dodecane 0.136 Escherichia coli glucose
- Choi and Lee,
2013
Tridecane 0.064 Escherichia coli glucose
- Choi and Lee,
2013
2-Methyl-
dodecane 0.042 Escherichia coli glucose
- Choi and Lee,
2013
Tetradecane 0.009 Escherichia coli glucose
- Choi and Lee,
2013
Isobutyl acetate 0.051 Pichia pastoris glucose
Promiscuous
enzyme screen
Siripong et al.,
2018
2,4-
dihydroxybutyric
acid 1.8 Escherichia coli glucose
Thermodynamic
and stoichiometric
analysis;
enzyme
rational design;
pathway balancing
Walther et al.,
2017
Muconic acid 36.6 Escherichia coli glucose
- Niu et al.,
2008
Muconic acid 59 Escherichia coli cathecol
- Kaneko et al.,
2011
Muconic acid 0.141
Saccharomyces
cerevisiae glucose
FBA; Pathway
balancing
Curran et al.,
2013
Muconic acid 2.1
Saccharomyces
cerevisiae glucose
Biosensor Leavitt et al.,
2017
Muconic acid 64.2
Pseudomonas
putida cathecol
Pathway balancing Kohlstedt et
al., 2018
Muconic acid 3.1 Escherichia coli glucose
Thermodynamic;
Elementary Flux
Mode Analysis
Thompson et
al., 2017
Muconic acid 1.8
Corynebacterium
glutamicum lignin
- Becker et al.,
2018
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Muconic acid 85
Corynebacterium
glutamicum cathecol
- Becker et al.,
2018
Muconic acid 5.1
Saccharomyces
cerevisiae glucose
- Pyne et al.,
2018
Muconic acid 52.3
Pseudomonas
putida
benzoate,
coumarate,
corn stover
-
Salvachua et
al., 2018
Adipic acid <0.001 Escherichia coli glucose - Yu et al., 2014
Adipic acid 2.5 Escherichia coli glycerol Cheong et al.,
2016
Adipic acid 0.003 Saccharomyces
cerevisiae
glucose -
Raj et al., 2018
Adipic acid 68 Escherichia coli glucose Pathway balancing Zhao et al.,
2018
Acrylic acid 0.12 Escherichia coli glucose
Promiscuous
enzyme screen
Chu et al.,
2015
4-
Hydroxybenzoic
Acid 1.82 Escherichia coli glucose
-
Noda et al.,
2016
4-
Hydroxybenzoic
Acid 36.6
Corynebacterium
glutamicum glucose
Promiscuous
enzyme screen Kitade et al.,
2018
2-
hydroxyisovaleric
acid 7.8 Escherichia coli glucose
- Cheong et al.,
2018
2-Methylsuccinic
acid 3.61 Escherichia coli glucose
Promiscuous
enzyme screen
Wang et al.,
2018
Polylactic acid 3%CDW Escherichia coli glucose
MOMA; FBA Jung et al.,
2010
Butyrolactam 54.14 Escherichia coli glucose
In-silico flux
response analysis;
thermodynamic
analysis;
promiscuous
enzyme screen
Chae et al.,
2017
Valerolactam 1.18 Escherichia coli glucose
In-silico flux
response analysis;
thermodynamic
analysis;
promiscuous
enzyme screen
Chae et al.,
2017
Caprolactam <0.001 Escherichia coli glucose
In-silico flux
response analysis;
thermodynamic
analysis;
promiscuous
enzyme screen
Chae et al.,
2017
Caprolactam 0.160 Escherichia coli glucose Promiscuous Turk et al.,
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
enzyme screen 2016
6-Aminocaproic
acid 0.048 Escherichia coli glucose
Pathway balancing Zhou et al.,
2015
5-Aminovaleric
acid 33.1
Corynebacterium
glutamicum glucose
- Shin et al.,
2016
D-Phenyllactic
acid 1.62 Escherichia coli glucose
Promiscuous
enzyme screen
Yang et al.,
2018
Poly (3HB-co-D-
phenyllactate) 13.9 Escherichia coli glucose
Promiscuous
enzyme screen
Yang et al.,
2018
Shinorine 0.019
Corynebacterium
glutamicum
sodium
gluconate
- Tsuge et al.,
2018
Shinorine 0.154
Streptomyces
avermitilis glucose
Genome mining Miyamoto et
al., 2014
Styrene 0.3 Escherichia coli glucose
Promiscuous
enzyme screen
Mckenna and
Nielsen, 2011
Styrene oxide 1.32 Escherichia coli glucose
Promiscuous
enzyme screen
Mckeena et al.,
2013
Styrene 0.029
Saccharomyces
cerevisiae glucose
- Mckenna et al.,
2014
Styrene 0.35 Escherichia coli glucose
Promiscuous
enzyme screen;
pathway balancing Liu et al., 2018
3,4-
Dihydroxystyrene 0.063 Escherichia coli glucose
- Kang et al.,
2015
4-Hydroxy-3-
methoxy styrene 0.064 Escherichia coli glucose
- Kang et al.,
2015
4-Hydroxy
styrene 0.355 Escherichia coli glucose
- Kang et al.,
2015
FDCA 30.1
Pseudomonas
putida 5-HMF
- Koopman et
al., 2010
FDCA 13.9
Raoultella
ornithinolytica 5-HMF
- Hussain et al.,
2016
FDCA 34.5
Raoultella
ornithinolytica 5-HMF
Pathway balancing Yuan et al.,
2018
Phenol 1.69 Escherichia coli glucose
- Kim et al.,
2013
Phenol 0.472 Escherichia coli glucose
- Ren et al.,
2015
Phenol 0.377 Escherichia coli glucose
Thermodynamic
analysis;
promiscuous
enzyme screen;
elementar flux
modes
Thompson et
al., 2018
C50 astaxanthin
770 μg
gDCW−1 Escherichia coli
glucose
Directed evolution Furubayashi et
al., 2015
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Abbreviations: FBA – Flux Balance Analysis; MOMA – Minimization of Metabolic
Adjustment; FDCA - 2,5-Furandicarboxylic acid; 5-HMF – Hydroxymethylfurfural.
Figure 1. Synthetic pathways for non-natural chemicals. 1,4 BDO can be produced from
xylose by manipulating native xylose catabolism in two different ways or from the central
metabolism, using ketoglutarate and succinyl-CoA as intermediates. 1,3 BDO can be
produced from acetyl-CoA either using acetaldehyde or butyryl-CoA as intermediates.
Acrylic acid can be produced by extrapolating from the propanoate metabolism, using
acryloyl-CoA as a precursor. Adipic acid can be produced either from benzoate or from
central metabolism by using a reverse benzoate degradation pathway, or from shikimate
pathway, using dehydroshikimate (DHS) as intermediate and muconic acid, a non-natural
chemical itself, as precursor. Shikimate pathway can also be extended for producing 4-
hydroxybenzoic acid or phenol. In addition, several amino acids are key intermediates for
non-natural chemical production. Isoleucine and valine biosynthesis pathways can be
manipulated for producing 2-methyl-1-butanol, isobutanol and isobutyl acetate. Styrene and
styrene derivatives can be produced from tyrosine, by extrapolating from phenylpropanoid
biosynthesis. Finally, lysine and glutamate degradation pathways can be manipulated for
producing 5-aminovalerate, valerolactam, 6-aminocaproate, caprolactam and butyrolactam.
Figure 2. Typical workflow for implementation of systems biology based tools in
microorganisms. (A) Genomic, transcriptomic and proteomic data are pulled from databases
and analyzed. (B) Pathway prediction tools are used to compute intermediates from the
target product to the starting compound. (C) Natural enzymes that catalyze pathway
reactions are pulled from databases; if activities are low or not available, rational engineering,
directed evolution or de novo enzyme design may be applied. (D) Strain design algorithms
are applied to genome-scale metabolic models to determine a set of genetic perturbations for
achieving an engineering objective. (E) Engineered enzymes and genetic perturbations are
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
applied to the host strain in vivo. (F) Several strains are screened for optimal activity using
high-throughput methods. (G) Scale-up engineering at the bioreactor level is completed for
high-performing strains.
Figure 3. Key steps of in-silico non-native pathway design. (A) Databases for known
biochemical reactions and molecules and databases for hypothetical extended reaction
network. (B) Network representation used in different pathway computation tools. In a simple
graph network, only one major substrate and one product are kept for each reaction. In a
bipartite graph, the two types of nodes represent reaction operators and intermediate
compounds, respectively. In a (directed) hypergraph, each hyper arc represents a different
reaction connecting multiple reactants and products. (C) Three types of molecular and
reaction representation incorporated in different tools, 1. BEM or bond-electron-matrix
representation of reactants and products uses a matrix to include all the bond information
between each pair of non-hydrogen atoms, 2. SMILES representation of molecular structure
and SMARTS/SMIRKS way of writing reaction rule, 3. Atom mapping tracks each atom in
the reactants. (D) Different search algorithms include breadth-first search, depth first search
(from source compounds), retrosynthesis (search starting from target compound) and double-
direction search. (E) Various pathway ranking/prioritization methods include estimating
chemical structural similarity, pathway length, enzyme docking, thermodynamic feasibility
and number of non-native reaction steps.
Figure 4. Enzyme engineering approaches. (A) Directed evolution process starts with mining
genomic metadata for suitable starting enzymes or identifying key residues for catalytic
activity in an existing starting enzyme. The next step involves creating a combinatorial
library by random mutagenesis of the gene of interest (error-prone PCR, DNA-shuffling) or
targeted mutagenesis of the residues of interest (site-directed mutagenesis, site-saturated
mutagenesis). The variants are then expressed in vivo and those with adequately robust
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
growth (assayed by fluorescence-based methods) (ie. FACS), culturing in liquid medium or
on LB Agar plates, or phage display are selected for production screens using a variety of
methods of measurement, including gas and liquid chromatography, biosensors and mass
spectrometry. For combinatorial libraries generated by random mutagenesis, the variants that
pass the screen for production are iteratively mutated, selected for, and screened until those
that remain exhibit a satisfactory level of performance. For libraries obtained by targeted
mutagenesis, the enzyme structure is updated to include the most successful mutation and
may undergo targeted mutagenesis for another key residues one-by-one or simultaneously
(MAGE). (B) Biosensors take the production of the target compound as an input signal, and
output either an increased expression of a reporter gene such as GFP or an increased
expression of an antibiotic resistance gene. (C) In the “inside out” enzyme design, the
catalytic mechanism is first chosen by defining the reactants and products, after which the
transition state of the reaction is established. Then, complex calculations are performed
computationally to obtain the 3-D structure of the amino acids around the transition state to
maximize stability (the ‘theozyme’). Finally, the RosettaDesign algorithm is used to predict
the 3-D structure of amino acids around the theozyme to stabilize it by matching a protein
scaffold.
Figure 5. (A) Mixed-Integer Linear Programming methods such as OptKnock can suggest
knockout strategies that result in strong or weak growth coupling. (B) Minimal cut sets
(MCS) eliminate undesired phenotypes by incorporating a minimum product yield as a
constraint. (C) Constrained minimal cut sets (cMCS) build on minimal cut sets by
incorporating an additional minimum biomass yield as a constraint. (D) MoVE suggests
metabolic valves to switch between a growth state and a production state. (E) Metaheuristic
methods have decoupled cellular and engineering objectives. (F) Kinetic models are
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
burdened by parameterization requirements which may be addressed using computational
alternatives.
Figure 6. Orthogonal production and dynamic control as alternatives to growth-coupled
production. (A) Pathways featuring a greater number of reactions shared between the biomass
or growth (native metabolism) and production branches will suffer from interference more
characteristic of growth-coupled systems. (B) Minimizing the number of common enzymatic
steps leads to a reduction in the cross-talk and an increase in orthogonality can be attained at
the organism level. Such orthogonal networks are characterized by minimal cross-talk
between growth and biosynthetic pathways. (C) Dynamic control tools such as genetic toggle
switches can reversibly divert flux through the biomass and production branches at any given
time. This is achieved by an external signal such as the presence of chemical inducer, absence
of substrate, or a change in temperature. The bistable switch allows for the minimization of
the trade-off between growth and production that usually acts as a barrier for both to be
optimal. In state 1, the chemical inducer is represented as pink molecules and it leads to the
induction of expression of the pink repressor protein as well as the pink gene. The pink
repressor blocks expression of the yellow gene and the yellow repressor protein. In state 2,
the chemical inducer is represented as yellow molecules and it leads to the induction of
expression of the yellow repressor protein and the yellow gene instead. Consequently, the
yellow repressor blocks the expression of the pink gene and the pink repressor protein.
Hence, this bistable switch alternates between states during which one of the genes is
expressed, and the other is repressed. CP refers to a common precursor shared between the
biomass and production branches of the network.
Figure 7. Non-natural redox cofactors may be used to increase the orthogonality of
biosynthetic pathways. (A) When the redox states in a biosynthetic pathway of interest are
mediated by natural redox cofactors, cross-talk will exist with the cell’s natural metabolism.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
(B) The use of non-natural redox cofactors for a biosynthetic pathway while the cellular
metabolism continues recognizing natural cofactors can reduce interference and increase the
orthogonality at the pathway level. The most popular non-natural redox cofactors used for
this purpose include nicotinamide cytosine dinucleotide (NCD) and nicotinamide flucytosine
dinucleotide (NFCD), both based on the natural backbone of nicotinamide adenine
dinucleotide (NAD), represented in (C) as nicotinamide nucleobase dinucleotide (NND).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Highlights
Pathways to more than 60 Non-natural chemicals described
Computational tools for pathway prediction and host optimization for non-natural chemical
summarized
Enzyme discovery and engineering methods using directed evolution, biosensors and de novo
design presented
Emerging concepts in orthogonal pathway design and dynamic control discussed
ACCEPTED MANUSCRIPT