systems biology based metabolic engineering for non-natural … · 2019. 4. 17. · mcs minimal cut...

TSpace Research Repository tspace.library.utoronto.ca

Systems biology based metabolic engineering for non-natural chemicals

Alessandra Biz, Scott Proulx, Zhiqing Xu, Kavya Siddartha, Alex

Mulet Indrayanti, Radhakrishnan Mahadevan

Version Post-print/Accepted Manuscript

Citation (published version)

Publisher’s Statement

Biz, A., Proulx, S., Xu, Z., Siddartha, K., Indrayanti, A.M. and Mahadevan, R., 2019. Systems biology based metabolic engineering for non-natural chemicals. Biotechnology Advances. The article has been published in final form at [10.1016/j.biotechadv.2019.04.001]

Copyright / License This work is licensed under the Creative Commons

Attribution-NonCommercial-NoDerivatives 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

How to cite TSpace items

Always cite the published version, so the author(s) will receive recognition through services that track

citation counts, e.g. Scopus. If you need to cite the page number of the author manuscript from TSpace because you cannot access the published version, then cite the TSpace version in addition to the published

version using the permanent URI (handle) found on the record page.

This article was made openly accessible by U of T Faculty. Please tell us how this access benefits you. Your story matters.

http://creativecommons.org/licenses/by-nc-nd/4.0/

https://tspace.library.utoronto.ca/feedback

Accepted Manuscript

Systems biology based metabolic engineering for non-naturalchemicals

Alessandra Biz, Scott Proulx, Zhiqing Xu, Kavya Siddartha, AlexMulet Indrayanti, Radhakrishnan Mahadevan

PII: S0734-9750(19)30060-6DOI: https://doi.org/10.1016/j.biotechadv.2019.04.001Reference: JBA 7379

To appear in: Biotechnology Advances

Received date: 4 December 2018Revised date: 23 February 2019Accepted date: 1 April 2019

Please cite this article as: A. Biz, S. Proulx, Z. Xu, et al., Systems biology based metabolicengineering for non-natural chemicals, Biotechnology Advances, https://doi.org/10.1016/j.biotechadv.2019.04.001

This is a PDF file of an unedited manuscript that has been accepted for publication. Asa service to our customers we are providing this early version of the manuscript. Themanuscript will undergo copyediting, typesetting, and review of the resulting proof beforeit is published in its final form. Please note that during the production process errors maybe discovered which could affect the content, and all legal disclaimers that apply to thejournal pertain.

https://doi.org/10.1016/j.biotechadv.2019.04.001



ACC

EPTE

D M

ANU

SCR

IPT

Systems Biology Based Metabolic Engineering for Non-natural Chemicals

Alessandra Biz1, Scott Proulx

1, Zhiqing Xu

1, Kavya Siddartha

1, Alex Mulet Indrayanti

1,

Radhakrishnan Mahadevan1,*

[email protected]

1Department of Chemical Engineering and Applied Chemistry, University of Toronto,

Toronto, ON, Canada

*Corresponding Author.

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Abstract

Production of chemicals in microorganisms is no longer restricted to products arising from

native metabolic potential. In this review, we highlight the evolution of metabolic

engineering studies, from the production of natural chemicals fermented from biomass

hydrolysates, to the engineering of microorganisms for the production of non-natural

chemicals. Advances in synthetic biology are accelerating the successful development of

microbial cell factories to directly produce value-added chemicals. Here we outline the

emergence of novel computational tools for the creation of synthetic pathways, for designing

artificial enzymes for non-natural reactions and for re-wiring host metabolism to increase the

metabolic flux to products. We also highlight exciting opportunities for applying directed

evolution of enzymes, dynamic control of growth and production, growth-coupling strategies

as well as decoupled strategies based on orthogonal pathways in the context of non-natural

chemicals.

Keywords: systems biology; non-natural chemicals; in-silico pathway design; directed

evolution; de novo protein engineering; genome-scale models; flux balance analysis; dynamic

control; growth-coupled production; orthogonality

Abbreviations

FBA Flux Balance Analysis

MOMA Minimization of Metabolic Adjustment

FDCA 2,5-Furandicarboxylic acid

5-HMF Hydroxymethylfurfural

PEF polyethylene furanoate

PLA polylactic acid

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

BDO butanediol

GSMMs Genome-Scale Metabolic Models

ER enoate reductases

1,3 PDO 1,3 pentanediol

CBM Constraint-Based Modeling

COBRA Constraint-Based Reconstruction and Analysis

CSOMs Computational Strain Optimization Methods

MCS minimal cut sets

EMs elementary flux modes

NCD nicotinamide cytosine dinucleotide

EMA Elementary Mode Analysis

MOVE Metabolic Valve Enumerator

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

1. Introduction

In 2004, when the U. S. Department of Energy released the report “Top Value-Added

Chemicals from Biomass” (Werpy and Petersen, 2004) the idea was to screen platform

chemicals that were naturally produced by microorganisms from sugars, which, in turn, could

be chemically transformed into molecules of industrial relevance. In fact, currently there are

several companies producing bulk and fine chemicals from renewable sources, although, in

most cases, via chemical routes (Table 1).

Now, instead of producing these platform chemicals, more studies are being driven by the

idea that any chemical, including non-natural chemicals, could be potentially made directly in

microbial cell factories (Lee et al., 2012). The technology that is accelerating this shift is the

integration of systems biology with metabolic engineering (Jensen and Keasling, 2018).

Non-natural chemicals are molecules, such as, fuels, materials and pharmaceuticals, that

rarely occur in nature and are primarily being made chemically by using petroleum as starting

material (Lee et al., 2012). As they are not made naturally by any organism, there are no

known natural pathways that lead to the production of these compounds. However, recent

advances in systems biology based metabolic engineering are being used to uncover synthetic

pathways and rewire host metabolism for their biochemical production from sugars (Jensen

and Keasling, 2018). These studies, and the insights learned from them, are the focus of this

review.

Table 2 shows that the production of non-natural chemicals is already a reality, with more

than 40 different compounds being produced in different hosts, in particular, Escherichia

coli, Pseudomonas putida, Corynebacterium glutamicum and Saccharomyces cerevisiae. The

major synthetic pathways for producing non-natural compounds are highlighted in Figure 1.

Recently, a few of those compounds have reached the commercial scale. One noteworthy

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

example is the development of a process for the production of 1,4 butanediol (1,4 BDO)

directly in E. coli by the company Genomatica (San Diego, CA), which reached the

production scale of 30000 tons/year.

Systems biology is generally defined by the integration of computational techniques and

biological Big Data, acquired by using high-throughput technologies, to unravel the

principles of complex biological systems (Dai and Nielsen, 2015). Recently, there has been a

focus on synthetic biology, which involves the use of engineering fundamentals, such as

systems abstraction, part standardization, characterization and modeling, for the design,

construction and optimization of genetic circuits and synthetic pathways. Thus, systems

biology based metabolic engineering is the field of study that combines systems and synthetic

biology to develop processes for making non-natural chemicals biologically (Lee et al.,

2011). Such strategies are based on, 1) identifying de novo pathways that could provide a

route to produce the proposed chemical, 2) identifying the enzyme candidates that could be

used for the proposed reactions of the pathway, and 3) determining and implementing the

genetic interventions that can be made in the host for increasing flux to products (Lee et al.,

2011). The general outline of how systems biology based metabolic engineering is used to

design microbes for non-natural chemicals production is shown in Figure 2.

In this review, we will briefly present each of these concepts, including the latest

achievements in systems biology based strategies to design microbial cell factories and how

these technologies are being used for producing non-natural chemicals. Important aspects that

will be covered in this review are: de novo pathway design, engineering of enzymes through

directed evolution, model-based designs for both growth-coupled synthetic pathways and

decoupled growth and production, as well as dynamic control and orthogonality concepts

based on metabolic valves.

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

2. In-silico pathway design

In-silico pathway design is the preliminary step in the development of novel biosynthetic

pathways. In the past decade, a large number of pathway design tools were developed,

making use of divergent approaches (i.e., different molecule encodings, pathway network

topologies, pruning methods, search algorithms, pathway prioritization metrics etc.) (Wang et

al., 2017; Hadadi and Hatzimanikatis, 2015). However, only a small portion of those

algorithms allow prediction of de novo pathways with non-natural intermediates and

chemicals, while most tools have focused on searching the reaction networks of metabolites

in natural biochemical processes.

A generalized workflow for de novo pathway design involves the following critical steps: (a)

the identification, from a list of starting materials, of all chemically plausible pathways to the

desired target compound on the basis of chemical structure transformation, (b) the pathway

prioritization based on various criteria, including thermodynamic feasibility, number of non-

natural reaction steps, enzyme docking and cellular resources cost etc., and (c) the selection

of the most suitable host organism and designing the synthetic expression constructs

(pathway integration).

In this section, we will briefly discuss how such steps are linked together to build a pipeline

for the discovery of new bio-pathways, as well as the recently developed computational tools

designed for that task, and the main challenges that need to be overcome regarding pathway

prediction.

2.1 Pathway identification

Recent efforts for de novo pathway discovery involve two closely-related approaches to

pathway identification, one of which is the construction of a putative reaction network using

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

generalized reaction transformations of promiscuous enzymes, which is often referred to as

“graph-based” pathway finding methods (BNICE, SimPheny, ReactPRED, GEM-path,

RetroPath 2.0) (Wang et al., 2017a). The other approach is the creation of an extended

pathway database (XTMS, MINE, ATLAS of Biochemistry) which allows a fast search for

pathways with non-natural intermediates and reactions (Carbonell et al., 2014). Both

strategies explore generalized reaction transformations of putative promiscuous enzymes that

can act upon structurally similar molecules (Figure 3A).

Different molecular and reaction encoding systems (Figure 3C) have been proposed to solve

the pathway identification in a retrosynthesis context. SimPheny (Yim et al., 2011) and

BNICE (Hatzimanikatis et al., 2005) use the third level EC number to guide their reaction

rules encoding where molecules and reaction operators are represented in the form of BEM

(bond-electron matrix), while GEM-path (Campodonico et al., 2014) and RetroPath2.0

(Delépine et al., 2017) utilize the simpler SMILES/SMARTS and SMIRKS system to encode

the compounds and transformations (Figure 3C). Most of the pathfinding algorithms use in-

house sets of transformation rules, for example, the latest version of BNICE has 361

generalized reaction rules, covering ~90% of KEGG 2014 (with 9000+ reactions).

Retropath2.0 and ReactPRED (Sivakumar et al., 2016) take a more data-driven approach.

ReactPRED automates the generation of transformation rules from user-defined reactions and

RetroPath2.0 systematically extracts thousands of transformation rules (expressed in

SMARTS/SMIRKS format) from 31527 reactions in the MetaNetX database, automatically

identifying the reaction centre and generating generalized reaction operator for each reaction

in the database. These automatically generated rules have specified more neighboring atoms

to the reaction centre and therefore are relatively less “general” as compared to reaction rules

in other methods (i.e., BNICE).

An update of the reaction rules database used in RetroPath2.0, RetroRules (Duigou et

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

al.,2018) extract over 400,000 rules from multiple public databases including MetaNetX

(Moretti et al., 2016), Brenda (Placzek et al., 2017), BiGG (King et al., 2015), KEGG

(Kanehisa and Goto, 2000), MetaCyc (Caspi et al., 2016), MetRxn (Kumar et al., 2012) etc.

While most currently used reaction rules include only substructures of the substrates and

products, the recently developed RING (Gupta et al., 2018), sets more constraints in the

transformation rules, for example, upper and lower bounds of size of each substrate, what

substructure must not be included in each reactant, etc.

Various network pruning methods have also been developed to reduce the redundant branches

(reactions) and nodes (substrates) in the pathway network. SimIndex and SimZyme (Pertusi

et al., 2014) screen intermediate compounds based on their structural similarities to the target

compound and the enzyme’s native substrates, respectively. GEM-Path uses estimated

reaction thermodynamics to guide the network search. SimPheny (BioPathway Predictor) sets

a bound for molecule size. Other approaches include calculating a penalty score based on

enzyme sequence consistency in Retropath2.0.

The network search algorithm (Figure 3D) most commonly used by different de novo

pathway prediction tools is breadth-first search (some with pathway finding heuristics)

starting from the target compound (as used in RetroPath 2.0 and Delepine et al., 2017). This

retrosynthetic approach works well for relatively large molecular structures. However, since

transformation rules cannot be used to predict the reactants and products at the same time,

retrosynthesis does not work as well for pathways that consist of decomposition reactions. In

addition to this retrosynthetic approach, breadth-first search starting from the source

compounds, depth-first search (as used in BNICE, Hatzimanikatis et al., 2005) and double

direction search from the both the source and target chemicals (used in Uri Alon

Biopathfinder, Noor et al., 2010) are also used in different pathway identification algorithms.

Breadth-first search starting from the source compounds (as used in ReTrace, Pitkanen et al.,

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

2009) is applicable to identify pathways with complex pathway topology. The bottleneck of

this approach is the fast growth of the number of compounds (i.e., combinatorial explosion in

the hypergraph case) and relatively heavier computational burden. Likewise, double-direction

search from both the source and target chemicals (as used in Uri Alon Biopathfinder, Noor et

al., 2010) is a combination of substrates-side breadth-first-search and product-side

retrosynthesis and allows identification of pathways with complicated structures while saving

runtime to a great extent. Finally, depth-first search (as used in BNICE, Hatzimanikatis et al.,

2005) is an alternative method used by some pathway identification algorithms, yet it is

limited to simple graph pathway networks. To our knowledge, there is no existing pathway

finding algorithm that uses depth-first search to explore a hypergraph reaction network.

Several pathfinding webservers extended the search space of existing databases by applying

the graph-based pathway finding techniques, such as XTMS, ATLAS and MINE. XTMS

(Carbonell et al., 2014), for example, collects metabolites and reactions from MetaCyc and

EcoCyc (Caspi et al., 2016, Keseler et al., 2017) and then uses Retropath (Carbonell et al.,

2011; Carbonell et al., 2013) to identify intermediate heterologous reaction steps. On the

other hand, ATLAS (Hadadi et al., 2016) and MINE (Jeffryes et al., 2015) are hypothetical

databases built based on KEGG and BNICE computational frameworks.

One of the main challenges that still remain concerning pathway identification is to enhance

the prediction of linear pathways and to recognize branched pathways (using a hypergraph or

a bipartite graph network representation) (Figure 3B). Earlier pathway finding algorithms,

i.e., that use simple edges to represent reactions from one compound to another, are limited in

identifying linear pathways and are not capable of predicting branched pathways with more

complex topologies, such as, condensation or synthetic reaction steps. Retropath2.0 is already

an advance in this sense as it uses hypergraph network representation of the pathway

network, where the details of a reaction can be stored perfectly by a directed hyper-edge

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

connecting multiple substrates and products (Klamt et al., 2009). However, the limitation is

that it allows each promiscuous enzymatic reaction to take only one non-native substrate.

There are also other algorithms that try to identify branched pathways through merging linear

pathways, such as LPAT (Heath et al., 2011). However, to our knowledge, there is no current

computational method that uses hyper-edge reaction rules which would allow multiple non-

native substrates in the reaction. Though using hyper-edge rules will heavily increase the

computational burden, hyper-edge reaction rules will allow identification of literally all

potentially reachable compounds and broaden the chemical repertoire in the putative network.

2.2 Pathway Ranking and Selection

Pathway ranking, as shown in Figure 3E, is another important step in the biosynthetic

pathway design workflow. Resulting pathways from the identification step are usually very

numerous and cannot be analysed manually. These pathways are, therefore, ranked based on

pathway length, number of non-native reaction steps, and thermodynamic feasibility, in

which a more negative delta G is more favorable (Li et al., 2004). GSMM (Genome-scale

metabolic models) and FBA (flux balance analysis), which will be discussed in section 4.1,

are also performed in some methods to estimate the maximum product yield.

Some recently developed pathway design methods combine different pathway restraints and

factors and sum them into an overall score for pathway prioritization. For example, the

method proposed by Calhoun et al. (2018) includes molecular docking, cheminformatics,

similarity ensemble approach (SEA) and chemical transformation feasibility in the scoring

function they have developed for identifying pathways. Other possible approaches for

pathway ranking include assessing the toxicity of intermediate compounds and binding site

covalence (i.e., Cho et al., 2010).

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Finally, artificial intelligence may bring new solutions to pathway discovery and ranking. A

recent computer-aided tool for chemical synthesis used a deep neural network trained on

millions of chemical reactions to rank transformation patterns and determine the feasible

synthesis method (Segler et al., 2018). Such tool uses Monte Carlo tree search and symbolic

artificial intelligence to learn from the database and rank the most promising retrosynthetic

steps, simulating the decision-making process of an expert. Although it was used in the

context of designing retrosynthesis steps for organic chemistry, it could also be applied for de

novo biochemical pathway design.

3. Enzyme discovery and optimization

A subsequent step following in-silico pathway design is to find suitable enzymes that will

make the consecutive chemical transformations predicted by the computational tools.

However, even considering that most of the prediction tools apply methods based on

experimentally identified enzymatic reaction rules, it is very likely that most of the pathways

will contain at least one unknown enzyme, which is especially the case concerning pathways

for non-natural chemicals (Campodonico et al., 2014).

In some scenarios, researchers will have an idea of which enzyme, or set of enzymes, is the

best candidate for making the target chemical (Lee et al., 2012). However, besides this

rational design, recent advances in enzyme engineering expanded the possibilities of

implementation of the in-silico pathway designs in microbial hosts. For example, directed

evolution is used to further optimize natural enzymes for increased activity and substrate

specificity (Arnold, 2017). Additionally, even if no enzyme is found for the proposed

reaction, related enzymes can be modified via rational design (Tiwari et al., 2012), or could,

potentially, be created de novo by using protein simulation algorithms (Zanghellini, 2014).

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Nevertheless, it is worth to mention that while directed evolution is at an advanced stage of

development, there are not many successful cases of enzymes being created de novo despite

the recent progress in the field.

In this section, we will discuss genome mining for enzyme selection, de novo protein

engineering and directed evolution, focusing on the cutting-edge techniques that are allowing

the production of non-natural chemicals in microorganisms.

3.1 Genome mining for enzyme selection and optimization

A starting point for the construction of the pathway is to evaluate the subclass of the enzyme

predicted by the prediction algorithms and search in the available databases for enzymes that

would perform the proposed reaction. If a suitable enzyme for a determined substrate is not

found, there is still the possibility for the discovery of enzymes that may react,

promiscuously, with the desired substrate, since many enzymes act on multiple substrates

(Prather and Martin, 2008). Searching for enzymes in genome databases using only genetic

information and without a structure at hand is often called “genome mining” (Ziemert et al.,

2016) and precedes the process of expression and screening for performance. The best

candidates are then selected for combinatorial expression in a heterologous host.

One example of this concept of enzyme promiscuity screening being applied to pathways for

producing non-natural chemicals is the production of adipic acid via the muconic acid

pathway. Until recently, the enzyme needed to reduce both double bonds of muconic acid to

make adipic acid was unknown. To this end, Joo et al. (2017) cloned and screened about 25

genes from putative old yellow enzymes and 6 genes from putative enoate reductases (ER)

from GenBank database for activity against muconic acid. They found that at least 2 of such

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

ERs, from Bacillus coagulans and Clostidium acetobutylicum had excellent activity reducing

muconic acid, discovering the missing link of that pathway.

Another example is the 1,4 BDO production in E. coli using the synthetic pathway predicted

in-silico by SimPheny (Yim et al., 2011), already mentioned in section 2.1. The 7 step

pathway required 5 heterologous enzymes, that had to be screened from different

microorganisms, including cat2, sucD and 4hbd, from Porphyromonas gingivalis, sucA from

Mycobacterium bovis, and adhE2 from Clostridium acetobutylicum (Yim et al., 2011).

In-silico pathway prediction models can aid in this “screen and select” approach by

identifying and ranking candidate enzymes. Carbonell and Faulon (2010) created a method

for predicting catalytic and substrate promiscuity using gene sequence similarities and

supervised machine learning algorithms. In the same manner, the genome-wide method

PROPER (enzyme PROmiscuity PrEdictoR) predicts promiscuous activities of metabolic

genes (Oberhardt et al., 2016). For that, it uses a permissive PSI-BLAST approach to search

for distant gene sequence similarities and analyze phylogenetic trees to assign secondary

activities to enzymes. This method was used in combination with genome-scale metabolic

modeling (GEM-PROPER) to predict the effect of exchanging enzymes with promiscuous

analogues from alternative metabolic pathways. A next step to further advance those models

would be to validate the promiscuity score based on gene sequence correspondence by using

similarities in binding site topography.

3.2 Directed evolution and rational design of enzymes

If a suitable promiscuous enzyme cannot be found to complete the metabolic pathway, it is

necessary to design novel enzymes for such reactions that have not been identified to occur in

nature, for example, using directed evolution and rational design on enzymes (Figure 4A).

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Directed evolution is an experimental technique used for protein engineering that imitates the

natural process of evolution, but in a considerably faster manner. For that, a large library

made from a starting enzyme mutant undergoes several rounds of iterative mutagenesis under

a selection pressure to elect the ones capable of best performing a desired function (Cobb et

al., 2013).

Directed evolution is a powerful tool in metabolic engineering as it allows researchers to

select for the enzyme with the right features, such as, improved activity in non-native

substrates, changing enantioselectivity, or even to create a new non-natural enzyme activity,

without having to devote resources into elucidating why those features make the enzyme

behave as desired (Packer and Liu, 2015; Arnold, 2017).

The first stage of directed evolution is obtaining the library of mutants. Several mutagenesis

methods have been developed for this purpose, including, random mutagenesis, site-directed

mutagenesis and site-saturated mutagenesis (Marcheschi et al., 2013). DNA shuffling, or

recombination, can also be used to make random combinatorial libraries. It involves

randomly fragmenting and rejoining DNA sequences and is often used in place of or in

addition to error-prone PCR to generate a diverse combinatorial library (Cobb et al., 2013).

Other random mutagenesis methods include transcription factor engineering, multiplex

genome engineering (MAGE), and are covered in the review by Marcheschi et al. (2013).

Directed evolution was used to adapt a series of enzymes to perform totally novel non-natural

reactions, including the enantioselective cyclopropanation of styrenes (Coelho et al., 2013), a

metal-oxo–mediated anti-Markovnikov oxidation of styrenes (Hammer et al., 2017) and an

asymmetric cyclopropanation using evolved myoglobin (Bajaj et al., 2016). The use of

directed evolution has also been used for the production of non-natural chemicals, such as the

non-natural carotenoid C50 astaxanthin (Furubayashi et al., 2015).

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

The primary obstacle to any directed evolution work is the development of an adequately

sensitive and robust screen for the enzyme activity that is being selected for (Cobb et al.,

2013). The current gold standards for such measurements, gas and liquid chromatography

methods (GC and LC) and mass spectrometry (MS), are not scalable for high-throughput

operations due to factors such as reagent cost and signal to noise ratio (Dietrich et. al, 2010).

As the libraries generated for directed evolution are large (102 – 10

6 variants) (Dietrich et. al,

2010), these methods of measurement become the bottleneck of the process. While this

limitation can be overcome through the reduction of libraries by using rational design prior to

directed evolution (Cobb et al, 2013), such libraries may still be considered large for LC-MS

based methods.

Rational protein design, instead of creating random mutants straightaway, proposes a

preliminary in-silico screening, in which many combinations of amino acid exchanges would

be tested beforehand by computational simulation to identify the changes that are most likely

to succeed, thus creating a much smaller library of mutants to screen compared to random

mutagenesis. This method allows solutions with the mutation of dozens of amino acids at a

time. Several Molecular Dynamics (MD) simulations tools are available to analyse the

tridimensional enzyme structures and guide rational designs.

Recently, Walther et al. (2017) demonstrated the usefulness of rational design of enzymes in

the context of non-natural chemicals. They used molecular modeling and structural analysis

for engineering the enzymes needed for the production of the non-natural chemical 2,4

dehydroxybutyric acid (2,4 DHB): malate kinase, malate semialdehyde dehydrogenase and

malate semialdehyde reductase. Similarly, Tai et al. (2016) and Wang et al. (2017b) also used

structural analysis to redesign native 2-ketoacid decarboxylase and diol dehydratase,

respectively, for heterologous production of 1,4 BDO.

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

3.3 High-throughput library screening with biosensors

As the systems biology-based techniques rely on the acquisition of large amounts of data, a

significant bottleneck is the screening and analysis. While the generation of mutant libraries

can be very high throughput, their characterization is limited by the use of lower throughput

techniques such as chromatography (e.g. HPLC, GC) and mass spectrometry (Rogers et al.,

2016; Skjoedt et al., 2016). An alternative method to overcome this limitation in screening is

the development of biosensors to enable the screening or selection of massive libraries, for

example, in directed evolution (Rogers and Church, 2016; Rogers et al., 2016).

Biosensors are based on the use of biological mechanism of molecular recognition for a target

chemical. In principle, a signal molecule, that can be detected by color, fluorescence, or

which ensures cell fitness, is coupled with the presence of the target chemical (Rogers et al.,

2016).

Thus, biosensors can be used to screen libraries of enzymes when searching for novel

biocatalysts for the purposes of host development and optimization (Figure 4B). For example,

biosensors expressed in vivo can help identify the highest producers of a certain non-natural

product in a set of clones obtained from metabolic engineering strategies, such as, the

introduction of a heterologous production pathway (Snoek et al., 2018). Additionally,

biosensors can be applied to the identification of the best-performing enzymes to react on a

given substrate (Ho et al., 2018).

Most biosensors are based on either a transcription factor or a sensor-actuator system. For

example, when applied to the production of non-natural chemicals, the chemical produced

serves as the input signal, which upon transcription factor recognition will result in the

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

expression of an actuator component such as a reporter gene (e.g. GFP) or an antibiotic

resistance gene that will enable survival (Dietrich et al., 2013; Li et al., 2015; Leavitt et al.,

2017; Snoek et al., 2018).

Ligand-recognition orthogonality is a key feature of any biosensor. When a biosensor is

designed to respond to a non-natural chemical that is being produced, the system has an

inherent element of orthogonality since, in theory, no other chemical should be detected.

However, cross-reactivity with highly similar molecules is possible and needs to be addressed

through methods such as negative selection. Dixon et al. (2010) strategically mutated a

purine-binding riboswitch native to E. coli by targeting the most important base pairs

involved in ligand binding. They were able to construct two mutant switches with high

selectivity for ammeline and azacytosine (two non-natural ligands in said organism) and

virtually none towards the original purine ligands (Dixon et al., 2010).

Another biosensor feature that is promising for future high-throughput screens is the addition

of levels of control by changing media condition at a given time point. Snoek et al. (2018)

developed a biosensor-selector system which responds to cis, cis-muconic acid concentration

by inducing a G418 resistance-conferring gene, with the false positive rate being altered by

changing the media conditions. Analogously, Chen et al. (2018) showed in their study that

response time of their putrescine-detecting biosensor was dependent on the nutrients

available in the media, particularly with regards to nitrogen source. De Paepe et al. (2017)

and Zhang et al. (2015) provide an in-depth review of the different biosensor strategies and

engineering techniques that can be used to alter biosensor binding specificity as well as the

operational and dynamic ranges.

Higher biosensor efficiency and response times are also possible by using cis-acting RNA

biosensors, also known as riboswitches. These systems work on a faster time scale, since only

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

transcription of the encoding RNA is necessary and there are no regulatory proteins to be

translated. Essentially, riboswitches consist of three main components: an aptamer (sensor), a

reporter or selector gene (actuator), and a bridge that links the two and promotes structural

stability (Qian and Cirino, 2016). Following a binding event, a conformational change occurs

in the three-dimensional structure of the aptamer component, leading to an exposed

ribosomal binding site (RBS) upstream of the actuator locus (Qian and Cirino, 2016). Gene

expression is hence activated by the production of the target chemical, which could be

cofactors, metal ions or amino-acids.

Recently, efforts from Jang et al. (2017) resulted in the development of a bacterial riboswitch

which activates the expression of a GFP reporter upon detection of naringenin, a flavonoid

molecule of plant origin. This riboswitch features a tetracycline resistance marker upstream

of the reporter, thus adding another layer of control (Jang et al., 2017). By adding tetracycline

to the media, the system is thus able to report which strains are the highest producers while

enriching the resultant library of clones for those which contain the encoding construct (Jang

et al., 2017).

Encoding the biosensor in a separate construct can be exploited to achieve higher biosensor

modularity, making the strain optimization faster. Building on the naringenin riboswitch, Xiu

et al. (2017) developed a co-culture version of the biosensor with the biosynthetic pathway

and biosensor components divided into two separate modules. Thus, each of the two bacterial

strains in the community carries one of the two components, minimizing the negative impact

on one of the components when the other is altered or optimized.

3.4 De novo protein engineering

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Considering that the average size of a protein is 200 amino acid residues, and that there are

20 different natural amino acids, the number of diverse amino acid sequences that could

possibly occur is in the order of 20200

. However, the number of distinct proteins that are

produced by existing organisms is only about 1012

. From this, one can deduce that evolution

has explored and selected just a small fraction of the array of all possible protein structures

(Huang et al., 2015).

In this context, de novo protein engineering is useful to generate and explore proteins that do

not exist in nature, including entirely novel enzymes to catalyse non-natural reactions. This

promising approach is an ultimate tool to fill the biochemical gaps in the in-silico pathway

designs that could not be completed by standard enzyme searching or re-engineering existing

enzymes.

De novo protein engineering attempts to create artificial proteins using computational tools

that predict chemical reactions, interactions and tridimensional structures using the principles

of molecular dynamics, quantum mechanics and protein biophysics. Several reviews take a

deeper look at this topic, including Kiss et al. (2013), Zanghellini (2014), Wolfson et al.

(2015) and Huang et al. (2015).

One of the most relevant approach for de novo enzyme design is the “inside out” method, as

revised by Kiss et al. (2013) and Zanghellini (2014). This method consists of four steps,

illustrated in Figure 4C. The first step is to elect an appropriate catalytic mechanism, establish

the transition state of the reaction, and then use quantum mechanics calculations to choose

the tridimensional arrangements of amino acids residues around the transition state that

maximizes its stability. This arrangement is, thus, an idealized active site, also named

“theozyme”, or theoretical enzyme (Kiss et al., 2013). Next, RosettaMatch algorithm is used

to match the putative active site with different protein backbone scaffolds, stored in its

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

database, which can support the geometry of the theoretical structural pocket. Subsequently,

RosettaDesign (Liu and Kuhlman, 2006) algorithm is used to optimize the identity and

position of the amino acid residues surrounding the transition state to both stabilize its

conformation and to maximize the affinities of the residues with the transition state. Finally,

as several possible designs are created for each transition state structure, the designs are

ranked based on geometrical compatibility with substrates and products, as well as on

transition-state binding energy.

The pioneering studies of Jiang et al., (2008) and Röthlisberger et al. (2008) used this

approach to successfully expand the range of possible enzymatic reactions by creating totally

novel enzymes. Röthlisberger et al. (2008) computationally designed a protein catalyst for

Kemp elimination. Their designed enzyme performs the Kemp elimination of 5-

nitrobenzisoxazole, where a proton (H) transfer from carbon, simultaneously with a cut of the

nitrogen–oxygen (N-O) bond, resulted in the product cyanophenol. Jiang et al., (2008)

designed retro-aldolases that use four different catalytic motifs to catalyze the breaking of a

carbon-carbon bond in a non-natural substrate, in this case, 4-hydroxy-4-(6-methoxy-2-

naphthyl)-2-butanone. Since then, the same “inside out” approach was used to create a vast

range of enzymes for non-natural reactions, including the Diels-Alder reaction (Siegel et al.,

2010) and Morita−Baylis−Hillman reactions (Bjelic et al., 2013). Zastrow and Pecoraro

(2014) further extended the range of novel catalytic proteins to the more complex

metalloenzymes, by creating an enzyme for p-nitrophenyl acetate hydrolysis that uses two

different metal ions: a Zn(II) ion, which is important for catalytic activity, and a Hg(II) ion,

which provides structural stability.

Two of the latest challenges for de novo protein design, the creation of novel β-barrel proteins

and the introduction of binding sites for small molecules, were also recently addressed. Dou

et al (2018) remarkably created for the first time a novel and functional β-barrel protein, as

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

well as the first full de novo design of a small-molecule-binding protein. Their protein

construct binds a chromophore DFHBI (3,5-difluoro-4-hydroxy-benzylidene imidazolinone)

and was successfully expressed in E. coli, yeast and mammalian cells. Besides the proven

usefulness of the designed protein for fluorescence-based imaging and sensing applications,

their work was a proof of concept that the field of de novo protein design is accelerating

towards increasingly advanced ligand-binding proteins, sensors and catalysts.

In conclusion, while rational design and de novo protein engineering are still the most

difficult ways to obtain a new enzyme due to the requirement of sophisticated computational

methods, recent examples of successfully applied rational and de novo protein designs are

bringing new prospects for the field. We also expect that further advances in the automated

workflows based on biosensors and robotics in recently established biofoundries will aid the

analysis required for the discovery and engineering of novel catalysts by these different

protein engineering techniques.

4. Host optimization

Host optimization is critical for transferring the production concept from laboratory scale to

commercialization. Even after prediction, ranking and selection of a suitable synthetic

pathway, with the most favorable thermodynamic and minimal reaction steps, and, despite

properly expressing the pathway enzymes and optimizing enzyme activity, frequently,

synthetic pathways for non-natural chemicals are not economically feasible. Recurrent

problems that challenge those pathways are related to energy and cofactors imbalance,

carbons diverted to by-products, cell burden caused by overexpression of the pathway,

toxicity of products and intermediates and so forth. A more comprehensive view of the cell’s

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

interactions and behaviours is thus warranted and hence, there is need to leverage genomic

scale metabolic models (Mahadevan et al., 2005).

Ideally, the host strain would divert as much flux as possible to the desired production

pathway. However, the excess demand for carbon, cofactors and energy for the pathway may

impose a burden for the cell, decreasing growth and cell fitness (Chubukov et al., 2016).

Thus, cofactor usage and generation, substrate consumption, strain oxygenation conditions

and energy balances are parameters that must be considered.

Aside from computational methods such as metabolic modeling, there are various concepts

within systems biology which can be experimentally implemented. Several strategies can be

used in the framework of GSMMs, including finding growth-coupled production (section

4.1.2; Klamt and Mahadevan, 2015) or orthogonality strategies (section 4.4; Pandit et al.,

2017). On the other hand, to address the toxicity of products and intermediates, a good

strategy is to engineer the regulation machinery of the cell by applying the concept of

dynamic control (section 4.3; Chen and Liu, 2018), while also balancing the pathway

expression.

One example of the importance of host optimization is demonstrated by the work of Yim et

al. (2011). They were the first to demonstrate the production of the non-natural chemical 1,4

BDO in vivo, with the initial strain producing 1.2 g/L in 50 h. However, their research group

raised the productivity to about 80-90 g/L by applying genome-scale metabolic models

(GSMMs) and flux balance analysis (FBA) to identify targets for gene deletion, increasing

the flux to products (Burgard et al., 2016; Andreozzi et al., 2016). In this section, we review

how these systems biology concepts can be applied for host optimization.

4.1 Genome-scale models and analysis

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

With the onset of whole-genome sequencing arose the development of genome-scale

metabolic models (GSMMs), which are powerful tools that facilitate the in-silico study of

metabolism. The models are mathematically represented by the stoichiometry of all the

reactions in the metabolic network of an organism (O’Brien et al., 2015; Lee and Kim, 2015).

Developments in next-generation sequencing have allowed faster sequencing than ever

before, and combined with other technologies, has greatly increased the availability and

speed at which GSMMs are constructed (Monk et al., 2014; Henry et al., 2010). Due to the

relative simplicity of GSMMs, exogenous metabolites, reactions and genes may be added

relatively easily using model manipulation platforms such as COBRA Toolbox and may

include gene-protein-reactions (GPRs) (Reed et al., 2003). In addition, large libraries of

GSMMs, such as BiGG Models (King et al., 2015), and open source platforms such as KBase

(Arkin et al., 2018), have made accessing and manipulating metabolic models a simple task.

4.1.1 Phenotype Simulation Methods

An important step in the pipeline for synthetic biology systems is phenotype simulation.

Here, GSMMs are used along with constraint-based modeling (CBM) methods to predict

phenotypic traits such as growth rate and reaction fluxes under various conditions. At this

point, all CBM methods rely on the assumption of an intracellular steady state in terms of

metabolite concentrations. The most commonly used CBM method for phenotype simulations

is Flux Balance Analysis (FBA) (Orth et al. 2010; O’Brien et al., 2015; Lewis et al., 2012).

This method is often used under the assumption that organisms will naturally operate with the

objective of maximizing growth. Other methods that assume different objective functions,

including minimization of metabolic adjustment (MOMA) (Segre et al., 2002), regulatory

on/off minimization (ROOM) (Shlomi et al., 2005), relative change (RELATCH) (Kim et al.,

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

2011), and maximum entropy (MAXENT) (Martino and Martino, 2018) have also been

shown to be effective. There are additionally several variants of FBA that help to achieve

different objectives, including minimization of network flux (Lewis et al., 2010),

consideration of dynamic systems (Mahadevan et al., 2002), including both dynamics and

enzyme usage (Jabarivelisdeh and Waldherr, 2018), and incorporation of regulatory

information (Covert et al., 2001; Shlomi et al., 2007). These methods may be accessed

through open-source platforms such as the Constrain-Based Reconstruction and Analysis

(COBRA) Toolbox (Schellenberger et al., 2011), COBRApy (Ebrahim et al., 2013), Cameo

(Cardoso et al., 2018) and the Portable System for the Analysis of Metabolic Models

(PSAMM) (Steffensen et al., 2016).

4.1.2 Computational Strain Optimization Methods

To achieve a certain network functionality, such as, the increase of metabolic flux through a

non-natural pathway, Computational Strain Optimization Methods (CSOMs) can be used to

identify genetic perturbations such as genetic knockouts, over/under expression of genes,

heterologous insertions and cofactor specificity swapping. Such methods attempt to achieve a

desired network functionality by taking an optimization approach to solutions, as opposed to

complete enumeration of all possible results (Maia et al., 2015). Each of these methods

generally requires the use of a metabolic model as well as the implementation of a phenotype

simulation method. The most important CSOMs are presented in Figure 5.

Several CSOMs have been published, however, this review will focus primarily on

constraint-based methods because they can be built directly from stoichiometric data, which

is, in turn, easily obtained from genome sequencing (Maia et al., 2015). As opposed to the

traditional building of manually curated GSMMs, the open-source platform CarveMe

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

presents a top-down automated methodology of genomic reconstruction that uses a manually

curated universal model which is then combined with annotated genomes to create species-

specific or community models in a fast and scalable manner (Machado et al., 2018).

Growth-coupled pathway design

Synthetic chemical production pathways often impose a metabolic burden on the organism.

Consequently, within a population, the strain that loses the functionality of the synthetic

pathway may outcompete the producing strains. One strain optimization approach that is

frequently used to circumvent this disadvantage is to couple growth with the synthetic

pathway.

This strategy will ensure that product will be consistently produced over time, meaning that

growth, or even the survival of the cell, can only happen if there is flux though the synthetic

pathway. In addition, it is possible to use adaptive laboratory evolution to select for

increasing chemical production capacity while screening for maximum growth (Conrad et al.,

2011).

The principle of this strain optimization approach is to use CSOMs, such as OptKnock, to

find deletion strategies that either force flux through the synthetic pathway to balance energy

and redox cofactors, or, to ensure the synthesis of biomass precursors, for example, by

reducing the cell’s ability to synthetize building block metabolites. As a result, the growth of

the organism itself becomes the driving force of chemical production (Klamt and Mahadevan,

2015).

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

The coupling can be either weak, or strong. Weak coupling happens when satisfactory

product yield is only obtained if the cell grows with maximal or close-to-maximal biomass

yield. Strong coupling, otherwise, means that chemical production must happen, even without

growth, to ensure cell’s survival, (Klamt and Mahadevan, 2015). A depiction of the

production envelopes for strong and weak growth coupling is shown in Figure 5A. It has been

shown by Klamt and Mahadevan (2015) that, under certain conditions, any metabolite of the

central metabolism can be strongly coupled in E. coli. Von Kamp and Klamt (2017) further

validated this theory in S. cerevisae, Corynebacterium glutamicum, Aspergillus niger and the

cyanobacterium Synechocystis sp..

Although there are not many examples of this strategy being applied to producing non-natural

compounds, its suitability has been recently demonstrated for coupling the production of a

series of natural compounds in several different microorganisms, both aerobically and

anaerobically.

FBA-based methods

The first CSOM that predicts growth-coupled designs to be developed was OptKnock, which

suggests a set of reaction knockouts that maximize production of a target metabolite (Burgard

et al., 2003). While the solutions proposed by OptKnock are growth-coupled, this coupling is

often weak. Some methods that produce strong growth-coupled strategies were developed to

address this and are discussed in this review. As abovementioned, a representation of

OptKnock, as well as the difference between strong and weak coupling, are shown in Figure

5A.

Methods such as OptKnock produce solutions which are considered to be “optimistic”, as the

optimal solution is used as the operating point in terms of the engineering objective. This

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

issue was addressed through RobustKnock, which considers alternate solutions that could

divert flux from the target compound, as well as by the pair of methods P-OptKnock and P-

ROOM, which assume a “worst-case scenario” for the engineering objective (Tepper and

Shlomi, 2009), (Apaydin et al., 2017). The first method to introduce gene over and under

expression alongside genetic knockouts was OptReg (Pharkya and Maranas, 2006). Later, an

algorithm termed Enhancing Metabolism with Iterative Linear Optimization (EMILiO) was

formulated by Yang et al. (2011), which provided results similar to OptReg but using a

computationally scalable approach. Methods that suggest heterologous reactions as genetic

perturbations include OptStrain and SimOptStrain, which calculate heterologous additions

and knockouts separately and simultaneously, respectively (Pharkya, 2004; Kim et al., 2011).

A final method worth mentioning is OptSwap, which is one of the few CSOMs that predicts

modulations in cofactor specificity alongside genetic knockouts (King and Feist, 2013).

EMA-Based Methods

CSOMs relying on Elementary Mode Analysis (EMA) are powerful methods that compute

sets of elementary flux modes (EMs), which are non-decomposable sets of reactions in a

network (Ruckerbauer et al., 2015). A minimal cut set (MCS) is defined as an irreducible set

of reactions based on elementary flux modes of which when inactivated, results in metabolic

network failure for a particular function (Klamt and Gilles, 2004). In application, these

undesired modes can include any network arrangement in which a minimum yield for some

target product is not achieved. In effect, elimination of the undesired modes leads to strong

growth coupling. One can also include additional constraints, such as to biomass yield, to

select for desired modes in the form of constrained minimal cut sets (cMCSs) (Hädicke and

Klamt, 2011). A depiction of the solution spaces for both MCS and cMCS is shown in Figure

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

5B and C. The results of these methods are given as a list of genetic knockout sets that

produce growth-coupled strategies. Recently, these methods have been adapted to incorporate

two-stage fermentations strategies, termed the Metabolic Valve Enumerator (MoVE)

(Venayak et al, 2018a). MoVE calculates a set of metabolic valves, which are reactions

expressed during a stage of growth then turned off during a stage of production, as well as a

set of reaction knockouts which remain inactive throughout both stages. The result is a strain

that achieves a minimal biomass yield during the growth stage, and a strong growth-coupled

production of a target compound during the production stage, as is shown in Figure 5D.

Methods based on these principles may be used in conjunction with the CellNetAnalyzer

framework (von Kamp and Klamt, 2017).

The scalability of MCS and other EM-based methods on GSMMs is poor due to the

computational nature of mixed integer linear programming (MILP) problems. The

MCSEnumerator was developed to produce MCSs from the smallest number of perturbations

to the largest. This is done by enumerating all elementary modes, mapping the MCSs to the

modes and then determining which EMs are the smallest (von Kamp and Klamt, 2014).

Regulatory information was incorporated to eliminate results containing non-enzymatic or

non-annotated reactions, denoted as regulatory constrained MCS (rcMCS) (Jungreuthmayer

and Zanghellini, 2012), while constrained regulatory MCS (cRegMCS) was developed to

consider both genetic knockouts and over/underexpression of genes, resulting in more

solutions requiring fewer genetic perturbations (Mahadevan et al., 2015). Whereas most

MCS-based methods are approached at the reaction-level, many reactions in GSMM are poor

targets for experimental knockouts, which was an issue addressed by genetic MCS (gMCS), a

method which incorporates gene expression data (Apaolaza et al., 2017). Attempts to sort

through cMCSs by pure strain performance such as genetic algorithm MCS (GA-MCS) have

been published, but their efficacy on GSMMs has not yet been shown (Nair et al., 2015). An

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

EMA-based method that does not focus on MCSs is the modular cell (MODCELL)

framework, which aims to design a strain that is auxotrophic for some exchangeable

production mode that would be coupled to target compound production and growth (Trinh et

al., 2015). In addressing the computational burden of EMA-based methods, it was shown

that alternative integer linear programming (AILP) could be used to significantly reduce the

computational time required in calculating MCSs (Song et al., 2017).

Metaheuristic methods

Metaheuristic CSOMs are advantageous in that they are much less computationally expensive

than FBA- and EMA-based methods. This is due to the decoupling of the biological and

engineering objective functions, allowing for independent calculations for each. This is

beneficial, as computational power is made readily available for more complex phenotype

simulation methods. Despite the benefits of metaheuristic methods, due to their nature these

methods cannot guarantee an optimal solution. A schematic of the typical workflow for

metaheuristic methods is shown in Figure 5E. The first metaheuristic method that was

published was OptGene, which uses a genetic algorithm modeled to evolutionary theory to

predict sets of genetic knockouts (Patil et al., 2005). Other methods incorporate the presence

of noise (Costanza et al., 2012), build iteratively upon previous solutions (Lun et al., 2009;

Rockwell et al., 2013), or consider shadow price analysis (Ohno et al., 2014). Metaheuristic

methods hold promise in the design of organisms that produce non-natural chemicals due to

the facilitated scalability with GSMMs.

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

4.2 Kinetic Modeling for Strain Improvement

As previously mentioned, GSMMs are uncomplicated since they incorporate only

stoichiometric data. While this has some advantages, it also makes them limited in terms of

bioengineering applications. While stoichiometric models assume a quasi-steady state and set

the change in metabolite concentrations with time to zero, kinetic models do not make this

assumption. This leads to the requirement to incorporate enzymatic expression and kinetic

data as well as other dynamics that are not captured by stoichiometric models. Kinetic models

may therefore be advantageous in the analysis of transient behaviours of strains that are often

difficult to estimate.

As an alternative to traditional constraint-based models, kinetic models further include

physiologically relevant data, such as time-dependent metabolite concentrations and

enzymatic regulation. This enables the dynamic analysis of biological systems for enhanced

in-silico hypothesis generation. Kinetic models, however, suffer from the burden of

estimating several parameters which are usually not readily available, so the number of

reactions that they explicitly describe is often mostly limited to core metabolism (Srinivasan

et al, 2015). However, it is expected that as metabolomic, proteomic, and transcriptomic data

becomes more accessible under various conditions, kinetic models will become more useful.

Machine learning algorithms have been suggested as a promising solution to the issue of

parameterization. The ORACLE (Optimization and Risk Analysis of Complex Living

Entities) framework effectively integrates kinetics with thermodynamic, omics data (such as

fluxomics and metabolomics), along with network stoichiometry is depicted in Figure 5E

(Miskovic and Hatzimanikatis, 2010). ORACLE was successfully implemented by Andreozzi

et al. (2016) to predict a modular pathway for the biosynthesis of 1,4-BDO in E. coli. Other

machine learning algorithms were also applied to build kinetic models for the production of

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

limonene and isopentenol in E. coli, giving qualitatively accurate results (Costello and

Martin, 2018).

4.3 Dynamic control and pathway balancing

A frequent difficulty in the metabolic engineering of microorganisms is achieving the finest

control possible over the production of the target chemical (natural or non-natural). Ideally, at

any given time, an efficient cell should modulate its level of chemical production in

accordance with its environmental conditions and intracellular metabolic state. Thus, the idea

behind pathway balancing is to optimize the inadequacies that can arise from the introduction

of heterologous pathways.

Traditional methods of pathway balancing often resort to static strategies like pathway

overexpression, gene deletions to deviate fluxes, or promoter engineering for altering

expression levels (Xu et al., 2014). The most widely recognized balancing strategy is

preventing the accumulation of problematic metabolites by increasing its downstream fluxes

while decreasing upstream ones (Brockman and Prather, 2015; Jones et al., 2015). However,

such modifications only scratch the surface, as production relies on cell fitness, which

depends on the organism’s ability to sense and respond to recurrent environmental

perturbations (Venayak et al., 2015; Dahl et al., 2013; Ewald et al., 2017; Leonard et al.,

2010). Overexpression can also have negative implications, such as growth inhibition (Liu et

al., 2015; Xu, 2018). Jones et al. (2015) provide a more detailed review on the existent

approaches for pathway balancing, including ribosomal binding site engineering, post-

translational modifications, and others mentioned previously.

Accordingly, there is an existing consensus that cells with dynamic and automated self-

regulation are better candidates for industrial use and scale-up (Ewald et al., 2017). The most

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

prominent form of dynamic control uses genetic circuits based on inducible and modular

gene expression, usually featuring activation and repression of activity of a particular

transcription factor (Liu et al., 2015; Rantasalo et al., 2018; Xu et al., 2014). An advantage of

this system is the high specificity of transcription factors to recognize an input signal; for

example, the presence of a metabolite or a change in temperature or other condition (Xu,

2018). Gene circuits that modulate pathway overexpression according to the levels of

available product lead to enhanced survival and improved titers (Liu et al., 2015). Other

studies use metabolic switches for the separate modulation of expression of upstream and

downstream reactions to a metabolite according to its abundance (Xu et al., 2014). This better

manages cellular resources but can also respond to the need for optimized fermentations

which typically require multiple steps with different substrates or oxygen uptake rates, and in

which either growth or production are prioritized at different times (Figure 6C). Two-state

(bistable) dynamic control provided by tools such as genetic toggle switches facilitates such

fermentation protocols by quickly turning off and restoring wild type growth phenotypes

when production is not prioritized, thereby minimizing the cellular effects of growth burden

(Anesiadis et al., 2013; Bothfeld et al., 2017; Venayak et al., 2018b). The central idea

underlying the concept of dynamic control is the temporal segregation of cellular growth and

bioproduction pathways, so that both are optimized in such a way that the trade-off that

usually exists between them is minimized (Figure 6C). This approach allows the same strain

to maximize yield from a given substrate and minimize production of undesired by-products,

while still growing unaffected by the detrimental effects a bioproduction pathway would

typically impose (Bothfeld et al., 2017).

Dynamic control has been implemented for the production of some high-value non-natural

chemicals over the years, although its use is still infrequent compared to natural chemical

production. Liu et al. (2015) introduced a 1,4-BDO production pathway into E. coli and

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

demonstrated that a two-part biosynthetic strategy with separate carbon fluxes for production

and dynamic regulation achieved autonomous production. Their system not only increases the

efficiency of 1,4-BDO synthesis but is also inherently trivial to optimize given the low degree

of commonalities between the production and regulatory pathways (Liu et al., 2015). This last

feature is known as orthogonality and is discussed in the next section in more detail, as well

as in the context of the Liu et al. (2015). Self-regulation has also been explored to prevent

toxic intermediate accumulation in the mevalonate pathway, which can be used to make a

variety of terpene-derived synthetic drugs (Dahl et al., 2013). Other technologies such as

CRISPR-based repressible promoter regulation are also used in the design of genetic circuits

for minimized crosstalk and high responsiveness, which is directly applicable to non-natural

chemical production (Didovyk et al., 2016).

Dynamic regulation for the production of non-natural chemicals is still not widespread.

However, given the ineffectiveness of static methods alone and the great advantages

automated control presents for host optimization, it could become useful to achieve more

efficient and competitive bioproduction (Zhang et al., 2015; Wang et al., 2017c; Wang et al.,

2018). Recent innovations in dynamic control include the use of gene circuit toolboxes and

the integration of multiple gene circuits for more robust and even more sophisticated

expression. Despite being promising for scale-up, only a few recent efforts have attempted

such a task (Rantasalo et al., 2018). Solutions to the issue of using chemical inducers have

also been explored, since toxic effects and low responsiveness are relevant concerns. For

example, the use of electromagnetic radiation has been considered for dynamic gene

expression, as it can provide a proven, quick, and highly reversible signal (Salinas et al.,

2017).

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

4.4 Orthogonality principles applied to pathway design

Another strategy to decouple production from cellular growth is the use of orthogonality

principles. Metabolic pathways are orthogonal when they operate with minimal interference

between the synthetic pathway and the biomass-producing native pathways. In principle,

those pathways do not share any enzymes and may be connected by a single metabolite

(Pandit et al., 2017). This metabolite can be used as a precursor for either biomass or

chemical production, and it is possible to design strategies that introduce metabolic “valves”

to isolate the growth and the chemical production pathways in a parallel fashion (Pandit et al.,

2017). The more enzymes these pathways share, the more opportunity for cross-talk between

them will exist (Figure 6A). On the other hand, by minimizing the common reactions

between the two network branches, it is possible to create a more orthogonal system (Figure

6B). Orthogonal networks can bypass large portions of native pathways transforming sugars

directly to chemicals, or yet, include junctions or valves through which precursors can be

used for growth, and, when convenient, can be diverged for chemical production. Such

pathways are considered to be growth-independent, meaning that orthogonality is contrary to

growth-coupling.

Accounting for substrate orthogonality is also important. Metabolites like glucose or fatty

acids are highly involved in core metabolism, making more orthogonal substrates such as

ethylene glycol and xylose more desirable (Pandit et al., 2017). Xylose-based orthogonal

pathways could be beneficial for future non-natural chemical production, since it is

extensively found in agricultural residues as part of hemicellulose and does not compete with

the food supply. One example of this concept being applied in the production of non-natural

chemicals is presented by Liu and Lu (2015). In this work, they produced 1,4 BDO from

xylose in E. coli. As this microbe is naturally able to catabolise xylose, they deleted three

native genes encoding common enzymes to make the synthetic pathway orthogonal to central

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

metabolism. Their strategy was to use xylose for chemical production and other carbon

sources for cellular growth. A quorum-sensing mechanism was also exploited for production

control, and the resulting strain was able to produce of 1,4 BDO autonomously, although in

quite low amounts (Liu and Lu, 2015).

A problem that would predictably arise from the construction of highly integrated systems,

such as gene circuits, is the need for greater orthogonality in their design. A lack of

orthogonality can lead to unexpected imbalances in expression modulation due to

interference of certain DNA-binding factors with the successful association of the correct

factor-promoter complexes, thereby defeating the purpose of dynamic circuits. This problem

was handled by Rantasalo et al. (2018) by the creation of a transcription factor-promoter

toolbox library. In that work, they characterized different promoter regions and engineered

transcription regulators for optimized pathway expression in S. cerevisiae, and then used it to

develop a memory-enabled genetic switch able to induce and repress production according to

need. The key feature of their transcription factor-promoter toolbox library is the high

orthogonality between the proteins and their respective binding sites.

Another promising strategy for the implementation of orthogonality in metabolic engineering

is the use of non-natural cofactors to create orthogonal gene circuits for energy transfer.

Redox cofactor balancing is well-studied, and, in most cases, involves changing the

specificity of proteins to use a different electron carrier (e.g. NAD instead of NADP) or

overproducing the required cofactor (Lee et al., 2012). However, natural cofactors, such as

NAD, link multiple energy transfer modules and metabolic circuits, and thus, concerns have

been raised about the use of natural cofactors for production pathways, given their low

orthogonality (Figure 7A). The use of synthetic cofactors for highly selective energy transfer

has hence been proposed as an orthogonal alternative that further segregates production and

central metabolism at the pathway level (Figure 7B). Such synthetic cofactor-linked circuits

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

contain two parts; one which strips the electrons from the energy carrier and another which

replaces them. The system proposed by Wang et al. (2017c) uses phosphite dehydrogenase

and malate dehydrogenase to recycle the synthetic cofactor nicotinamide cytosine

dinucleotide or NCD (Figure 7C). This way, competition for natural cofactors (i.e. NAD)

does not affect non-natural chemical production pathways, facilitating strain optimization.

Orthogonal pathway construction also has a place in the optimization of production of non-

natural metabolites, such as synthetic amino acids, for in vivo reactions mediated by

recombinant proteins. Ma et al. (2018) constructed a heterologous and orthogonal pathway

for azidohomoalanine production in E. coli, an unconventional amino acid with

biotechnological applications. Such strategies could be repurposed to produce completely

synthetic amino acids unheard of in nature, thus helping to reduce bioproduction costs and

meet demands for new non-natural chemical and biochemical products (Völler and Budisa,

2017).

5. Conclusion

In this review, we highlighted the emerging case studies for the direct production of non-

natural chemicals in microorganisms. In fact, this shift to non-natural chemical production

has intensified in the past 5 years, with more than 40 different compounds now demonstrated

in laboratory scale. In addition, at least one of them, 1,4 BDO, is being produced using a

commercial scale bioprocess.

In fact, a variety of new computational and experimental systems biology tools are available

for metabolic engineering and can significantly influence the development of hosts for non-

natural chemical production and feasibility of its commercialization. Such tools include

biosensors, high throughput robotic screening platforms, de novo protein design, directed

evolution and dynamic control methods based on orthogonality. However, not all tools are as

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

impactful on microbial production. For instance, computational strain optimization methods

have the potential to multiply chemical production several times. On the other hand, pathway

balancing sometimes results only in a small increase in the overall production and it is only

practical once the appropriate enzymes have been selected and implemented successfully.

Finally, chemical production is absolutely absent without the precise enzymes to catalyze the

reactions involved in the designed pathway. Hence, finding the required enzymes is critical

for implementing synthetic pathways.

In fact, while in-silico pathway design is sufficiently advanced to be used to find pathways

for virtually any organic chemical, one of the main hindrances for developing new pathways

for non-natural chemicals is the discovery and engineering of novel enzymes encoding the

non-natural activity. Usually there is high prospect for finding promiscuous enzymes or for

adapting a natural enzyme for the desired reaction. However, if no enzyme could be selected

then, having to resort for de novo enzyme design results in an enormous challenge. Due to

this difficulty of engineering non-natural enzymes, the combination of synthetic pathways

with chemical transformations still might be required for some cases (Lee et al., 2019).

However, it is clear that many of the advanced methods for host optimization have not yet

been used in most studies of non-natural chemical bioproduction. Hence, there is an

opportunity to integrate such approaches for host optimization with the emerging automated

robotic workflows to accelerate the development of new microbial cell factories.

Acknowledgements:

This work was supported by the Natural Sciences and Engineering Research Council

(NSERC), the NSERC Industrial Biocatalysis Network (IBN), Biochemicals from Cellulosic

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Biomass (BioCeB) grant from the Ontario Research Fund (Research Excellence) and a grant

from the Genome Canada Genomics Applied Partnership Program (GAPP).

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

References

Andreozzi, S., Chakrabarti, A., Soh, K.C., Burgard, A., Yang, T.H., Van Dien, S., Miskovic,

L., Hatzimanikatis, V., 2016. Identification of metabolic engineering targets for the

enhancement of 1,4-butanediol production in recombinant E. coli using large-scale kinetic

models. Metab. Eng. 35, 148–159. doi:10.1016/j.ymben.2016.01.009

Anesiadis, N., Kobayashi, H., Cluett, W.R., Mahadevan, R., 2013. Analysis and Design of a

Genetic Circuit for Dynamic Metabolic Engineering. ACS Syn. Bio. 2, 442–452.

doi:10.1021/sb300129j

Apaolaza, I., San José-Eneriz, E., Tobalina, L., Miranda, E., Garate, L., Agirre, X., Prósper,

F., Planes, F.J., 2017. An in-silico approach to predict and exploit synthetic lethality in cancer

metabolism. Nat. Commun. 8. doi:10.1038/s41467-017-00555-y

Apaydin, M., Xu, L., Zeng, B., Qian, X., 2017. Robust mutant strain design by pessimistic

optimization. BMC Genomics 18. doi:10.1186/s12864-017-4025-7

Arkin, A.P., Cottingham, R.W., Henry, C.S., Harris, N.L., Stevens, R.L., et al., 2018. KBase:

The United States Department of Energy Systems Biology Knowledgebase. Nat. Biotechnol.

36, 566–569. doi:10.1038/nbt.4163

Arnold, F.H., 2017. Directed Evolution: Bringing New Chemistry to Life. Angew. Chem.,

Int. Ed. 57, 4143–4148. doi:10.1002/anie.201708408

Atsumi, S., Hanai, T., Liao, J.C., 2008. Non-fermentative pathways for synthesis of

branched-chain higher alcohols as biofuels. Nature 451, 86–89. doi:10.1038/nature06450

Bajaj, P., Sreenilayam, G., Tyagi, V., Fasan, R., 2016. Gram-Scale Synthesis of Chiral

Cyclopropane-Containing Drugs and Drug Precursors with Engineered Myoglobin Catalysts

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Featuring Complementary Stereoselectivity. Angew. Chem., Int. Ed. 55, 16110–16114.

doi:10.1002/anie.201608680

Becker, J., Kuhl, M., Kohlstedt, M., Starck, S., Wittmann, C., 2018. Metabolic engineering of

Corynebacterium glutamicum for the production of cis, cis-muconic acid from lignin.

Microb. Cell Fact.17. doi:10.1186/s12934-018-0963-2

Bjelic, S., Nivón, L.G., Çelebi-Ölçüm, N., Kiss, G., Rosewall, C.F., Lovick, H.M., Ingalls,

E.L., Gallaher, J.L., Seetharaman, J., Lew, S., Montelione, G.T., Hunt, J.F., Michael, F.E.,

Houk, K.N., Baker, D., 2013. Computational Design of Enone-Binding Proteins with

Catalytic Activity for the Morita–Baylis–Hillman Reaction. ACS Chem. Biol. 8, 749–757.

doi:10.1021/cb3006227

Bothfeld, W., Kapov, G., Tyo, K.E.J., 2017. A Glucose-Sensing Toggle Switch for

Autonomous, High Productivity Genetic Control. ACS Synthetic Biology 6, 1296–1304.

doi:10.1021/acssynbio.6b00257

Brockman, I.M., Prather, K.L.J., 2015. Dynamic metabolic engineering: New strategies for

developing responsive cell factories. Biotechnol. J. 10, 1360–1369.

doi:10.1002/biot.201400422

Burgard, A., Burk, M.J., Osterhout, R., Van Dien, S., Yim, H., 2016. Development of a

commercial scale process for production of 1,4-butanediol from sugar. Curr. Opin.

Biotechnol. 42, 118–125. doi:10.1016/j.copbio.2016.04.016

Burgard, A.P., Pharkya, P., Maranas, C.D., 2003. Optknock: A bilevel programming

framework for identifying gene knockout strategies for microbial strain optimization.

Biotechnol. Bioeng. 84, 647–657. doi:10.1002/bit.10803

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Calhoun, S., Korczynska, M., Wichelecki, D.J., San Francisco, B., Zhao, S., Rodionov, D.A.,

Vetting, M.W., Al-Obaidi, N.F., Lin, H., O’Meara, M.J., Scott, D.A., Morris, J.H., Russel,

D., Almo, S.C., Osterman, A.L., Gerlt, J.A., Jacobson, M.P., Shoichet, B.K., Sali, A., 2018.

Prediction of enzymatic pathways by integrative pathway mapping. eLife 7.

doi:10.7554/elife.31097

Campodonico, M.A., Andrews, B.A., Asenjo, J.A., Palsson, B.O., Feist, A.M., 2014.

Generation of an atlas for commodity chemical production in Escherichia coli and a novel

pathway prediction algorithm, GEM-Path. Metab. Eng. 25, 140–158.

doi:10.1016/j.ymben.2014.07.009

Cann, A.F., Liao, J.C., 2008. Production of 2-methyl-1-butanol in engineered Escherichia

coli. Appl. Microbiol. Biotechnol. 81, 89–98. doi:10.1007/s00253-008-1631-y

Carbonell, P., Faulon, J.-L., 2010. Molecular signatures-based prediction of enzyme

promiscuity. Bioinformatics 26, 2012–2019. doi:10.1093/bioinformatics/btq317

Carbonell, P., Parutto, P., Baudier, C., Junot, C., Faulon, J.-L., 2013. Retropath: Automated

Pipeline for Embedded Metabolic Circuits. ACS Syn. Bio. 3, 565–577.

doi:10.1021/sb4001273

Carbonell, P., Parutto, P., Herisson, J., Pandit, S.B., Faulon, J.-L., 2014. XTMS: pathway

design in an eXTended metabolic space. Nucleic Acids Res. 42, W389–W394.

doi:10.1093/nar/gku362

Carbonell, P., Planson, A.-G., Fichera, D., Faulon, J.-L., 2011. A retrosynthetic biology

approach to metabolic pathway design for therapeutic production. BMC Syst. Biol. 5, 122.

doi:10.1186/1752-0509-5-122

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Cardoso, J.G.R., Jensen, K., Lieven, C., Lærke Hansen, A.S., Galkina, S., Beber, M.,

Özdemir, E., Herrgård, M.J., Redestig, H., Sonnenschein, N., 2018. Cameo: A Python

Library for Computer Aided Metabolic Engineering and Optimization of Cell Factories. ACS

Synthetic Biology 7, 1163–1166. doi:10.1021/acssynbio.7b00423

Caspi, R., Billington, R., Ferrer, L., Foerster, H., Fulcher, C.A., Keseler, I.M., Kothari, A.,

Krummenacker, M., Latendresse, M., Mueller, L.A., Ong, Q., Paley, S., Subhraveti, P.,

Weaver, D.S., Karp, P.D., 2015. The MetaCyc database of metabolic pathways and enzymes

and the BioCyc collection of pathway/genome databases. Nucleic Acids Res.44, D471–D480.

doi:10.1093/nar/gkv1164

Chae, T.U., Ko, Y.-S., Hwang, K.-S., Lee, S.Y., 2017. Metabolic engineering of Escherichia

coli for the production of four-, five- and six-carbon lactams. Metab. Eng. 41, 82–91.

doi:10.1016/j.ymben.2017.04.001

Chen, G.S., Siao, S.W., Shen, C.R., 2017. Saturated mutagenesis of ketoisovalerate

decarboxylase V461 enabled specific synthesis of 1-pentanol via the ketoacid elongation

cycle. Scientific Reports 7. doi:10.1038/s41598-017-11624-z

Chen, X., Liu, L., 2018. Gene Circuits for Dynamically Regulating Metabolism. Trends

Biotechnol. 36, 751–754. doi:10.1016/j.tibtech.2017.12.007

Chen, X.F., Xia, X.X., Lee, S.Y., Qian, Z.G., 2018. Engineering tunable biosensors for

monitoring putrescine in Escherichia coli. Biotechnol. Bioeng. 115, 1014–1027.

doi:10.1002/bit.26521

Cheong, S., Clomburg, J.M., Gonzalez, R., 2016. Energy- and carbon-efficient synthesis of

functionalized small molecules in bacteria using non-decarboxylative Claisen condensation

reactions. Nat Biotechnol 34, 556–561. doi:10.1038/nbt.3505

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Cheong, S., Clomburg, J.M., Gonzalez, R., 2018. A synthetic pathway for the production of

2-hydroxyisovaleric acid in Escherichia coli. J. Ind. Microbiol. Biotechnol. 45, 579–588.

doi:10.1007/s10295-018-2005-9

Cho, A., Yun, H., Park, J., Lee, S., Park, S., 2010. Prediction of novel synthetic pathways for

the production of desired chemicals. BMC Syst. Biol. 4, 35. doi:10.1186/1752-0509-4-35

Choi, Y.J., Lee, S.Y., 2013. Microbial production of short-chain alkanes. Nature 502, 571–

574. doi:10.1038/nature12536

Chu, H.S., Ahn, J.-H., Yun, J., Choi, I.S., Nam, T.-W., Cho, K.M., 2015. Direct fermentation

route for the production of acrylic acid. Metab. Eng. 32, 23–29.

doi:10.1016/j.ymben.2015.08.005

Chubukov, V., Mukhopadhyay, A., Petzold, C.J., Keasling, J.D., Martín, H.G., 2016.

Synthetic and systems biology for microbial production of commodity chemicals. npj

Systems Biology and Applications 2. doi:10.1038/npjsba.2016.9

Cobb, R.E., Chao, R., Zhao, H., 2013. Directed evolution: Past, present, and future. AIChE J.

59, 1432–1440. doi:10.1002/aic.13995

Coelho, P.S., Brustad, E.M., Kannan, A., Arnold, F.H., 2013. Olefin Cyclopropanation via

Carbene Transfer Catalyzed by Engineered Cytochrome P450 Enzymes. Science 339, 307–

310. doi:10.1126/science.1231434

Connor, M.R., Cann, A.F., Liao, J.C., 2010. 3-Methyl-1-butanol production in Escherichia

coli: random mutagenesis and two-phase fermentation. Appl. Microbiol. Biotechnol. 86,

1155–1164. doi:10.1007/s00253-009-2401-1

Conrad, T.M., Lewis, N.E., Palsson, B.O., 2011. Microbial laboratory evolution in the era of

genome-scale science. Molecular Systems Biology 7, 509–509. doi:10.1038/msb.2011.42

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Costanza, J., Carapezza, G., Angione, C., Lió, P., Nicosia, G., 2012. Robust design of

microbial strains. Bioinformatics 28, 3097–3104. doi:10.1093/bioinformatics/bts590

Costello, Z., Martin, H.G., 2018. A machine learning approach to predict metabolic pathway

dynamics from time-series multiomics data. npj Systems Biology and Applications 4.

doi:10.1038/s41540-018-0054-3

Covert, M.W., Schilling, C.H., Palsson, B., 2001. Regulation of Gene Expression in Flux

Balance Models of Metabolism. J. Theor. Biol. 213, 73–88. doi:10.1006/jtbi.2001.2405

Dahl, R.H., Zhang, F., Alonso-Gutierrez, J., Baidoo, E., Batth, T.S., Redding-Johanson,

A.M., Petzold, C.J., Mukhopadhyay, A., Lee, T.S., Adams, P.D., Keasling, J.D., 2013.

Engineering dynamic pathway regulation using stress-response promoters. Nat. Biotechnol.

31, 1039–1046. doi:10.1038/nbt.2689

Dai, L., Tao, F., Tang, H., Guo, Y., Shen, Y., Xu, P., 2017. Directing enzyme devolution for

biosynthesis of alkanols and 1,n-alkanediols from natural polyhydroxy compounds. Metab.

Eng. 44, 70–80. doi:10.1016/j.ymben.2017.09.005

Dai, Z., Nielsen, J., 2015. Advancing metabolic engineering through systems biology of

industrial microorganisms. Curr. Opin. Biotechnol. 36, 8–15.

doi:10.1016/j.copbio.2015.08.006

De Martino, A., De Martino, D., 2018. An introduction to the maximum entropy approach

and its application to inference problems in biology. Heliyon 4, e00596.

doi:10.1016/j.heliyon.2018.e00596

De Paepe, B., Peters, G., Coussement, P., Maertens, J., De Mey, M., 2017. Tailor-made

transcriptional biosensors for optimizing microbial cell factories. J. Ind. Microbiol.

Biotechnol. 44, 623–645. doi:10.1007/s10295-016-1862-3

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Delépine, B., Duigou, T., Carbonell, P., Faulon, J.-L., 2017. RetroPath2.0: A retrosynthesis

workflow for metabolic engineers. Metab. Eng. 45, 158–170.

doi:10.1016/j.ymben.2017.12.002

Didovyk, A., Borek, B., Tsimring, L., Hasty, J., 2016. Transcriptional regulation with

CRISPR-Cas9: principles, advances, and applications. Current Opinion in Biotechnology 40,

177–184. doi:10.1016/j.copbio.2016.06.003

Dietrich, J.A., McKee, A.E., Keasling, J.D., 2010. High-Throughput Metabolic Engineering:

Advances in Small-Molecule Screening and Selection. Annual Review of Biochemistry 79,

563–590. doi:10.1146/annurev-biochem-062608-095938

Dietrich, J.A., Shis, D.L., Alikhani, A., Keasling, J.D., 2013. Transcription factor-based

screens and synthetic selections for microbial small-molecule biosynthesis. ACS Synth. Biol.

2, 47–58. doi:10.1021/sb300091d

Dixon, N., Duncan, J.N., Geerlings, T., Dunstan, M.S., McCarthy, J.E.G., Leys, D.,

Micklefield, J., 2010. Reengineering orthogonally selective riboswitches. Proc. Natl. Acad.

Sci. 107, 2830–2835. doi:10.1073/pnas.0911209107

Dou, J., Vorobieva, A.A., Sheffler, W., Doyle, L.A., Park, H., Bick, M.J., Mao, B., Foight,

G.W., Lee, M.Y., Gagnon, L.A., Carter, L., Sankaran, B., Ovchinnikov, S., Marcos, E.,

Huang, P.-S., Vaughan, J.C., Stoddard, B.L., Baker, D., 2018. De novo design of a

fluorescence-activating β-barrel. Nature 561, 485–491. doi:10.1038/s41586-018-0509-0

Duigou, T., du Lac, M., Carbonell, P., Faulon, J.-L., 2018. RetroRules: a database of reaction

rules for engineering biology. Nucleic Acids Res. doi:10.1093/nar/gky940

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Ebrahim, A., Lerman, J.A., Palsson, B.O., Hyduke, D.R., 2013. COBRApy: COnstraints-

Based Reconstruction and Analysis for Python. BMC Syst. Biol. 7, 74. doi:10.1186/1752-

0509-7-74

Ewald, J., Bartl, M., Dandekar, T., Kaleta, C., 2017. Optimality principles reveal a complex

interplay of intermediate toxicity and kinetic efficiency in the regulation of prokaryotic

metabolism. PLoS Comput. Biol. 13, 1–19. doi:10.1371/journal.pcbi.1005371

Furubayashi, M., Ikezumi, M., Takaichi, S., Maoka, T., Hemmi, H., Ogawa, T., Saito, K.,

Tobias, A.V., Umeno, D., 2015. A highly selective biosynthetic pathway to non-natural C50

carotenoids assembled from moderately selective enzymes. Nat. Commun. 6.

doi:10.1038/ncomms8534

Guo, D., Zhu, J., Deng, Z., Liu, T., 2014. Metabolic engineering of Escherichia coli for

production of fatty acid short-chain esters through combination of the fatty acid and 2-keto

acid pathways. Metab. Eng. 22, 69–75. doi:10.1016/j.ymben.2014.01.003

Gupta, U., Le, T., Hu, W.-S., Bhan, A., Daoutidis, P., 2018. Automated network generation

and analysis of biochemical reaction pathways using RING. Metab. Eng. 49, 84–93.

doi:10.1016/j.ymben.2018.07.009

Hadadi, N., Hafner, J., Shajkofci, A., Zisaki, A., Hatzimanikatis, V., 2016. ATLAS of

Biochemistry: A Repository of All Possible Biochemical Reactions for Synthetic Biology and

Metabolic Engineering Studies. ACS Synth. Biol. 5, 1155–1166.


Hadadi, N., Hatzimanikatis, V., 2015. Design of computational retrobiosynthesis tools for the

design of de novo synthetic pathways. Curr. Opin. Chem. Biol. 28, 99–104.

doi:10.1016/j.cbpa.2015.06.025

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Hädicke, O., Klamt, S., 2011. Computing complex metabolic intervention strategies using

constrained minimal cut sets. Metab. Eng. 13, 204–213. doi:10.1016/j.ymben.2010.12.004

Hammer, S.C., Kubik, G., Watkins, E., Huang, S., Minges, H., Arnold, F.H., 2017. Anti-

Markovnikov alkene oxidation by metal-oxo–mediated enzyme catalysis. Science 358, 215–

218. doi:10.1126/science.aao1482

Hatzimanikatis, V., Li, C., Ionita, J.A., Henry, C.S., Jankowski, M.D., Broadbelt, L.J., 2005.

Exploring the diversity of complex metabolic networks. Bioinformatics 21, 1603–1609.

doi:10.1093/bioinformatics/bti213

Heath, A.P., Bennett, G.N., Kavraki, L.E., 2011. An Algorithm for Efficient Identification of

Branched Metabolic Pathways. Journal of Computational Biology 18, 1575–1597.

doi:10.1089/cmb.2011.0165

Henry, C.S., DeJongh, M., Best, A.A., Frybarger, P.M., Linsay, B., Stevens, R.L., 2010.

High-throughput generation, optimization and analysis of genome-scale metabolic models.

Nat. Biotechnol. 28, 977–982. doi:10.1038/nbt.1672

Ho, J.C.H., Pawar, S.V., Hallam, S.J., Yadav, V.G., 2017. An Improved Whole-Cell

Biosensor for the Discovery of Lignin-Transforming Enzymes in Functional Metagenomic

Screens. ACS Syn. Bio.7, 392–398. doi:10.1021/acssynbio.7b00412

Hossain, G.S., Yuan, H., Li, J., Shin, H., Wang, M., Du, G., Chen, J., Liu, L., 2016.

Metabolic engineering of Raoultella ornithinolytica BF60 for the production of 2, 5-

furandicarboxylic acid from 5-hydroxymethylfurfural . Appl. Environ. Microbiol.

AEM.02312-16. doi:10.1128/aem.02312-16

doi:10.1016/j.ymben.2015.03.009

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Huang, P.-S., Feldmeier, K., Parmeggiani, F., Fernandez Velasco, D.A., Höcker, B., Baker,

D., 2015. De novo design of a four-fold symmetric TIM-barrel protein with atomic-level

accuracy. Nat. Chem. Biol.12, 29–34. doi:10.1038/nchembio.1966

Jabarivelisdeh, B., Waldherr, S., 2018. Optimization of bioprocess productivity based on

metabolic-genetic network models with bilevel dynamic programming. Biotechnol. Bioeng.

115, 1829–1841. doi:10.1002/bit.26599

Jang, S., Jang, S., Xiu, Y., Kang, T.J., Lee, S.-H., Koffas, M.A.G., Jung, G.Y., 2017.

Development of Artificial Riboswitches for Monitoring of Naringenin In Vivo. ACS Syn.

Bio. 6, 2077–2085. doi:10.1021/acssynbio.7b00128

Jeffryes, J.G., Colastani, R.L., Elbadawi-Sidhu, M., Kind, T., Niehaus, T.D., Broadbelt, L.J.,

Hanson, A.D., Fiehn, O., Tyo, K.E.J., Henry, C.S., 2015. MINEs: open access databases of

computationally predicted enzyme promiscuity products for untargeted metabolomics.

Journal of Cheminformatics 7. doi:10.1186/s13321-015-0087-1

Jensen, M.K., Keasling, J.D. (Eds.), 2018. Synthetic Metabolic Pathways, Methods in

Molecular Biology. Springer New York. doi:10.1007/978-1-4939-7295-1

Jiang, L., Althoff, E.A., Clemente, F.R., Doyle, L., Rothlisberger, D., Zanghellini, A.,

Gallaher, J.L., Betker, J.L., Tanaka, F., Barbas, C.F., Hilvert, D., Houk, K.N., Stoddard, B.L.,

Baker, D., 2008. De Novo Computational Design of Retro-Aldol Enzymes. Science 319,

1387–1391. doi:10.1126/science.1152692

Jones, J.A., Toparlak, T.D., Koffas, M.A.G., 2015. Metabolic pathway balancing and its role

in the production of biofuels and chemicals. Curr. Opin. Biotechnol. 33, 52–59.

doi:10.1016/j.copbio.2014.11.013

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Joo, J.C., Khusnutdinova, A.N., Flick, R., Kim, T., Bornscheuer, U.T., Yakunin, A.F.,

Mahadevan, R., 2017. Alkene hydrogenation activity of enoate reductases for an

environmentally benign biosynthesis of adipic acid. Chemical Science 8, 1406–1413.

doi:10.1039/c6sc02842j

Jung, Y.K., Kim, T.Y., Park, S.J., Lee, S.Y., 2010. Metabolic engineering of Escherichia coli

for the production of polylactic acid and its copolymers. Biotechnol. Bioeng. 105, 161–171.

doi:10.1002/bit.22548

Jungreuthmayer, C., Zanghellini, J., 2012. Designing optimal cell factories: integer

programming couples elementary mode analysis with regulation. BMC Syst. Biol. 6, 103.

doi:10.1186/1752-0509-6-103

Kanehisa, M., Goto S., 2000. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic

Acids Res. 28, 27–30. doi:10.1093/nar/28.1.27

Kaneko, A., Ishii, Y., Kirimura, K., 2011. High-yield Production of cis,cis-Muconic Acid

from Catechol in Aqueous Solution by Biocatalyst. Chemistry Letters 40, 381–383.

doi:10.1246/cl.2011.381

Kang, S.-Y., Choi, O., Lee, J.K., Ahn, J.-O., Ahn, J.S., Hwang, B.Y., Hong, Y.-S., 2015.

Artificial de novo biosynthesis of hydroxystyrene derivatives in a tyrosine overproducing

Escherichia coli strain. Microb. Cell Fact.14. doi:10.1186/s12934-015-0268-7

Kataoka, N., Vangnai, A.S., Pongtharangkul, T., Yakushi, T., Matsushita, K., 2017.

Production of 1,3-diols in Escherichia coli. Bioresour. Technol. 245, 1538–1541.

doi:10.1016/j.biortech.2017.05.082

Keseler, I.M., Mackie, A., Santos-Zavaleta, A., Billington, R., Bonavides-Martínez, C.,

Caspi, R., Fulcher, C., Gama-Castro, S., Kothari, A., Krummenacker, M., Latendresse, M.,

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Muñiz-Rascado, L., Ong, Q., Paley, S., Peralta-Gil, M., Subhraveti, P., Velázquez-Ramírez,

D.A., Weaver, D., Collado-Vides, J., Paulsen, I., Karp, P.D., 2016. The EcoCyc database:

reflecting new knowledge about Escherichia coli K-12. Nucleic Acids Res.45, D543–D550.

doi:10.1093/nar/gkw1003

Kim, B., Park, H., Na, D., Lee, S.Y., 2013. Metabolic engineering of Escherichia coli for the

production of phenol from glucose. Biotechnol J 9, 621–629. doi:10.1002/biot.201300263

Kim, J., Reed, J.L., Maravelias, C.T., 2011. Large-Scale Bi-Level Strain Design Approaches

and Mixed-Integer Programming Solution Techniques. PLoS ONE 6, e24162.

doi:10.1371/journal.pone.0024162

King, Z.A., Feist, A.M., 2013. Optimizing Cofactor Specificity of Oxidoreductase Enzymes

for the Generation of Microbial Production Strains—OptSwap. Industrial Biotechnology 9,

236–246. doi:10.1089/ind.2013.0005

King, Z.A., Lu, J., Dräger, A., Miller, P., Federowicz, S., Lerman, J.A., Ebrahim, A., Palsson,

B.O., Lewis, N.E., 2015. BiGG Models: A platform for integrating, standardizing and sharing

genome-scale models. Nucleic Acids Res. 44, D515–D522. doi:10.1093/nar/gkv1049

Kiss, G., Çelebi-Ölçüm, N., Moretti, R., Baker, D., Houk, K.N., 2013. Computational

Enzyme Design. Angew. Chem., Int. Ed. 52, 5700–5725. doi:10.1002/anie.201204077

Kitade, Y., Hashimoto, R., Suda, M., Hiraga, K., Inui, M., 2018. Production of 4-

Hydroxybenzoic Acid by an Aerobic Growth-Arrested Bioprocess Using Metabolically

Engineered Corynebacterium glutamicum. Appl. Environ. Microbiol. 84.

doi:10.1128/aem.02587-17

Klamt, S., Gilles, E.D., 2004. Minimal cut sets in biochemical reaction networks.

Bioinformatics 20, 226–234. doi:10.1093/bioinformatics/btg395

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Klamt, S., Haus, U.-U., Theis, F., 2009. Hypergraphs and Cellular Networks. PLoS Comput.

Biol. 5, e1000385. doi:10.1371/journal.pcbi.1000385

Klamt, S., Mahadevan, R., 2015. On the feasibility of growth-coupled product synthesis in

microbial strains. Metab. Eng. 30, 166–178. doi:10.1016/j.ymben.2015.05.006

Kohlstedt, M., Starck, S., Barton, N., Stolzenberger, J., Selzer, M., Mehlmann, K., Schneider,

R., Pleissner, D., Rinkel, J., Dickschat, J.S., Venus, J., B.J.H. van Duuren, J., Wittmann, C.,

2018. From lignin to nylon: Cascaded chemical and biochemical conversion using

metabolically engineered Pseudomonas putida. Metab. Eng. 47, 279–293.

doi:10.1016/j.ymben.2018.03.003

Koopman, F., Wierckx, N., de Winde, J.H., Ruijssenaars, H.J., 2010. Efficient whole-cell

biotransformation of 5-(hydroxymethyl)furfural into FDCA, 2,5-furandicarboxylic acid.

Bioresour. Technol. 101, 6291–6296. doi:10.1016/j.biortech.2010.03.050

Kumar, A., Suthers, P.F., Maranas, C.D., 2012. MetRxn: a knowledgebase of metabolites and

reactions spanning metabolic models and databases. BMC Bioinformatics 13, 6.

doi:10.1186/1471-2105-13-6

Leavitt, J.M., Wagner, J.M., Tu, C.C., Tong, A., Liu, Y., Alper, H.S., 2017. Biosensor-

Enabled Directed Evolution to Improve Muconic Acid Production in Saccharomyces

cerevisiae. Biotechnol. J. 12, 1600687. doi:10.1002/biot.201600687

Lee, J.W., Kim, T.Y., Jang, Y.-S., Choi, S., Lee, S.Y., 2011. Systems metabolic engineering

for chemicals and materials. Trends Biotechnol. 29, 370–378.

doi:10.1016/j.tibtech.2011.04.001

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Lee, J.W., Na, D., Park, J.M., Lee, J., Choi, S., Lee, S.Y., 2012. Systems metabolic

engineering of microorganisms for natural and non-natural chemicals. Nat. Chem. Biol.8,

536–546. doi:10.1038/nchembio.970

Lee, S.Y., Kim, H.U., 2015. Systems strategies for developing industrial microbial strains.

Nat. Biotechnol. 33, 1061–1072. doi:10.1038/nbt.3365

Lee, S.Y., Kim, H.U., Chae, T.U., Cho, J.S., Kim, J.W., Shin, J.H., Kim, D.I., Ko, Y.-S.,

Jang, W.D., Jang, Y.-S., 2019. A comprehensive metabolic map for production of bio-based

chemicals. Nat. Catal. 2, 18–33. doi:10.1038/s41929-018-0212-4

Leonard, E., Ajikumar, P.K., Thayer, K., Xiao, W.-H., Mo, J.D., Tidor, B., Stephanopoulos,

G., Prather, K.L.J., 2010. Combining metabolic and protein engineering of a terpenoid

biosynthetic pathway for overproduction and selectivity control. Proc. Natl. Acad. Sci. 107,

13654–13659. doi:10.1073/pnas.1006138107

Lewis, N.E., Hixson, K.K., Conrad, T.M., Lerman, J.A., Charusanti, P., Polpitiya, A.D.,

Adkins, J.N., Schramm, G., Purvine, S.O., Lopez-Ferrer, D., Weitz, K.K., Eils, R., König, R.,

Smith, R.D., Palsson, B.Ø., 2010. Omic data from evolved E. coli are consistent with

computed optimal growth from genome-scale models. Mol. Syst. Biol. 6.

doi:10.1038/msb.2010.47

Lewis, N.E., Nagarajan, H., Palsson, B.O., 2012. Constraining the metabolic genotype–

phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 10, 291–

305. doi:10.1038/nrmicro2737

Li, C., Henry, C.S., Jankowski, M.D., Ionita, J.A., Hatzimanikatis, V., Broadbelt, L.J., 2004.

Computational discovery of biochemical routes to specialty chemicals. Chemical Engineering

Science 59, 5051–5060. doi:10.1016/j.ces.2004.09.021

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Li, S., Si, T., Wang, M., Zhao, H., 2015. Development of a Synthetic Malonyl-CoA Sensor in

Saccharomyces cerevisiae for Intracellular Metabolite Monitoring and Genetic Screening.

ACS Synth. Biol. 4, 1308–1315. doi:10.1021/acssynbio.5b00069

Lim, H.G., Jang, S., Jang, S., Seo, S.W., Jung, G.Y., 2018. Design and optimization of

genetically encoded biosensors for high-throughput screening of chemicals. Curr. Opin.


Lin, J.-L., Wagner, J.M., Alper, H.S., 2017. Enabling tools for high-throughput detection of

metabolites: Metabolic engineering and directed evolution applications. Biotechnol. Adv. 35,

950–970. doi:10.1016/j.biotechadv.2017.07.005

Lin, P.P., Mi, L., Morioka, A.H., Yoshino, K.M., Konishi, S., Xu, S.C., Papanek, B.A., Riley,

L.A., Guss, A.M., Liao, J.C., 2015. Consolidated bioprocessing of cellulose to isobutanol

using Clostridium thermocellum. Metab. Eng. 31, 44–52. doi:10.1016/j.ymben.2015.07.001

Liu, C., Men, X., Chen, H., Li, M., Ding, Z., Chen, G., Wang, F., Liu, H., Wang, Q., Zhu, Y.,

Zhang, H., Xian, M., 2018. A systematic optimization of styrene biosynthesis in Escherichia

coli BL21(DE3). Biotechnol. Biofuels 11. doi:10.1186/s13068-018-1017-z

Liu, D., Xiao, Y., Evans, B.S., Zhang, F., 2015. Negative feedback regulation of fatty acid

production based on a malonyl-CoA sensor-actuator. ACS Synth. Biol. 4, 132–140.

doi:10.1021/sb400158w

Liu, H., Lu, T., 2015. Autonomous production of 1,4-butanediol via a de novo biosynthesis

pathway in engineered Escherichia coli. Metab. Eng. 29, 135–141.

Liu, Y., Kuhlman, B., 2006. RosettaDesign server for protein design. Nucl. Ac. Res.34,

W235–W238. doi:10.1093/nar/gkl163

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Lun, D.S., Rockwell, G., Guido, N.J., Baym, M., Kelner, J.A., Berger, B., Galagan, J.E.,

Church, G.M., 2009. Large-scale identification of genetic design strategies using local search.

Mol. Syst. Biol. 5. doi:10.1038/msb.2009.7

Ma, Y., Di Salvo, M.L., Budisa, N., 2018. Self-Directed in Cell Production of Methionine

Analogue Azidohomoalanine by Synthetic Metabolism and Its Incorporation into Model

Proteins, in: Methods in Molecular Biology. Springer New York, pp. 127–135.

doi:10.1007/978-1-4939-7574-7_7

Machado, D., Andrejev, S., Tramontano, M., Patil, K.R., 2018. Fast automated reconstruction

of genome-scale metabolic models for microbial species and communities. Nucleic Acids

Res. 46, 7542–7553. doi:10.1093/nar/gky537

Mahadevan, R., Burgard, A.P., Famili, I., Van Dien, S., Schilling, C.H., 2005. Applications

of metabolic modeling to drive bioprocess development for the production of value-added

chemicals. Biotechnol. Bioprocess Eng. 10, 408–417. doi:10.1007/bf02989823

Mahadevan, R., Edwards, J.S., Doyle, F.J., III, 2002. Dynamic Flux Balance Analysis of

Diauxic Growth in Escherichia coli. Biophysical Journal 83, 1331–1340. doi:10.1016/s0006-

3495(02)73903-9

Mahadevan, R., von Kamp, A., Klamt, S., 2015. Genome-scale strain designs based on

regulatory minimal cut sets. Bioinformatics 31, 2844–2851.

doi:10.1093/bioinformatics/btv217

Maia, P., Rocha, M., Rocha, I., 2015. In-Silico Constraint-Based Strain Optimization

Methods: the Quest for Optimal Cell Factories. Microbiology and Molecular Biology

Reviews 80, 45–67. doi:10.1128/mmbr.00014-15

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Mak, W.S., Tran, S., Marcheschi, R., Bertolani, S., Thompson, J., Baker, D., Liao, J.C.,

Siegel, J.B., 2015. Integrative genomic mining for enzyme function to enable engineering of

a non-natural biosynthetic pathway. Nat. Commun. 6. doi:10.1038/ncomms10005

Marcheschi, R.J., Gronenberg, L.S., Liao, J.C., 2013. Protein engineering for metabolic

engineering: Current and next-generation tools. Biotechnol. J. 8, 545–555.

doi:10.1002/biot.201200371

Matsuda, F., Ishii, J., Kondo, T., Ida, K., Tezuka, H., Kondo, A., 2013. Increased isobutanol

production in Saccharomyces cerevisiae by eliminating competing pathways and resolving

cofactor imbalance. Microb. Cell Fact.12, 119. doi:10.1186/1475-2859-12-119

McKenna, R., Nielsen, D.R., 2011. Styrene biosynthesis from glucose by engineered E. coli.

Metab. Eng. 13, 544–554. doi:10.1016/j.ymben.2011.06.005

McKenna, R., Pugh, S., Thompson, B., Nielsen, D.R., 2013. Microbial production of the

aromatic building-blocks (S)-styrene oxide and (R)-1,2-phenylethanediol from renewable

resources. Biotechnol. J. 8, 1465–1475. doi:10.1002/biot.201300035

McKenna, R., Thompson, B., Pugh, S., Nielsen, D.R., 2014. Rational and combinatorial

approaches to engineering styrene production by Saccharomyces cerevisiae. Microb. Cell

Fact.13. doi:10.1186/s12934-014-0123-2

Mehrer, C.R., Incha, M.R., Politz, M.C., Pfleger, B.F., 2018. Anaerobic production of

medium-chain fatty alcohols via a β-reduction pathway. Metab. Eng. 48, 63–71.

doi:10.1016/j.ymben.2018.05.011

Miskovic, L., Hatzimanikatis, V., 2010. Production of biofuels and biochemicals: in need of

an ORACLE. Trends in Biotechnology 28, 391–397. doi:10.1016/j.tibtech.2010.05.003

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Miyamoto, K.T., Komatsu, M., Ikeda, H., 2014. Discovery of Gene Cluster for Mycosporine-

Like Amino Acid Biosynthesis from Actinomycetales Microorganisms and Production of a

Novel Mycosporine-Like Amino Acid by Heterologous Expression. Appl. Environ.

Microbiol. 80, 5028–5036. doi:10.1128/aem.00727-14

Monk, J., Nogales, J., Palsson, B.O., 2014. Optimizing genome-scale network

reconstructions. Nat. Biotechnol. 32, 447–452. doi:10.1038/nbt.2870

Moretti, S., Martin, O., Van Du Tran, T., Bridge, A., Morgat, A., Pagni, M., 2015.

MetaNetX/MNXref – reconciliation of metabolites and biochemical reactions to bring

together genome-scale metabolic networks. Nucleic Acids Res. 44, D523–D526.

doi:10.1093/nar/gkv1117

Mutturi, S., 2017. FOCuS: a metaheuristic algorithm for computing knockouts from genome-

scale models for strain optimization. Molecular BioSystems 13, 1355–1363.

doi:10.1039/c7mb00204a

Nair, G., Jungreuthmayer, C., Hanscho, M., Zanghellini, J., 2015. Designing minimal

microbial strains of desired functionality using a genetic algorithm. Algorithms for Molecular

Biology 10. doi:10.1186/s13015-015-0060-6

Nemr, K., Müller, J.E.N., Joo, J.C., Gawand, P., Choudhary, R., Mendonca, B., Lu, S., Yu,

X., Yakunin, A.F., Mahadevan, R., 2018. Engineering a short, aldolase-based pathway for

(R)-1,3-butanediol production in Escherichia coli. Metab. Eng. 48, 13–24.

doi:10.1016/j.ymben.2018.04.013

Nguyen, A.D., Hwang, I.Y., Lee, O.K., Kim, D., Kalyuzhnaya, M.G., Mariyana, R., Hadiyati,

S., Kim, M.S., Lee, E.Y., 2018. Systematic metabolic engineering of Methylomicrobium

alcaliphilum 20Z for 2,3-butanediol production from methane. Metabolic Engineering 47,

323–333. doi:10.1016/j.ymben.2018.04.010

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Niu, W., Draths, K.M., Frost, J.W., 2002. Benzene-Free Synthesis of Adipic Acid.

Biotechnol. Prog. 18, 201–211. doi:10.1021/bp010179x

Noda, S., Shirai, T., Oyama, S., Kondo, A., 2016. Metabolic design of a platform Escherichia

coli strain producing various chorismate derivatives. Metab. Eng. 33, 119–129.

doi:10.1016/j.ymben.2015.11.007

Noor, E., Eden, Eran., Milo, R., Alon, Uri. 2010. Central Carbon Metabolism as a Minimal

Biochemical Walk between Precursors for Biomass and Energy. Mol Cell. 39(5):809-20.

doi:10.1016/j.molcel.2010.08.031

O’Brien, E.J., Monk, J.M., Palsson, B.O., 2015. Using Genome-scale Models to Predict

Biological Capabilities. Cell 161, 971–987. doi:10.1016/j.cell.2015.05.019

Oberhardt, M.A., Zarecki, R., Reshef, L., Xia, F., Duran-Frigola, M., Schreiber, R., Henry,

C.S., Ben-Tal, N., Dwyer, D.J., Gophna, U., Ruppin, E., 2016. Systems-Wide Prediction of

Enzyme Promiscuity Reveals a New Underground Alternative Route for Pyridoxal 5’-

Phosphate Production in E. coli. PLoS Comput. Biol. 12, e1004705.

doi:10.1371/journal.pcbi.1004705

Ohno, S., Shimizu, H., Furusawa, C., 2013. FastPros: screening of reaction knockout

strategies for Metab. Eng.. Bioinformatics 30, 981–987. doi:10.1093/bioinformatics/btt672

Orth, J.D., Thiele, I., Palsson, B.Ø., 2010. What is flux balance analysis? Nat. Biotechnol. 28,

245–248. doi:10.1038/nbt.1614

Packer, M.S., Liu, D.R., 2015. Methods for the directed evolution of proteins. Nat. Rev.

Genet. 16, 379–394. doi:10.1038/nrg3927

Pandit, A.V., Srinivasan, S., Mahadevan, R., 2017. Redesigning metabolism based on

orthogonality principles. Nat. Commun. 8, 15188. doi:10.1038/ncomms15188

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Patil, K., Rocha, I., Förster, J., Nielsen, J., 2005. Evolutionary programming as a platform for

in silico metabolic engineering. BMC Bioinformatics 6, 308. doi:10.1186/1471-2105-6-308

Pertusi, D.A., Stine, A.E., Broadbelt, L.J., Tyo, K.E.J., 2014. Efficient searching and

annotation of metabolic networks using chemical similarity. Bioinformatics 31, 1016–1024.

doi:10.1093/bioinformatics/btu760

Pharkya, P., 2004. OptStrain: A computational framework for redesign of microbial

production systems. Genome Research 14, 2367–2376. doi:10.1101/gr.2872004

Pharkya, P., Maranas, C.D., 2006. An optimization framework for identifying reaction

activation/inhibition or elimination candidates for overproduction in microbial systems.


Pitkänen, E., Jouhten, P., Rousu, J., 2009. Inferring branching pathways in genome-scale

metabolic networks. BMC Syst. Biol. 3. doi:10.1186/1752-0509-3-103

Placzek, S., Schomburg, I., Chang, A., Jeske, L., Ulbrich, M., Tillack, J., Schomburg, D.,

2016. BRENDA in 2017: new perspectives and new tools in BRENDA. Nucleic Acids Res.

45, D380–D388. doi:10.1093/nar/gkw952

Prather, K.L.J., Martin, C.H., 2008. De novo biosynthetic pathways: rational design of

microbial chemical factories. Curr. Opin. Biotechnol. 19, 468–474.

doi:10.1016/j.copbio.2008.07.009

Pyne, M.E., Narcross, L., Melgar, M., Kevvai, K., Mookerjee, S., Leite, G.B., Martin, V.J.J.,

2018. An Engineered Aro1 Protein Degradation Approach for Increased cis,cis-Muconic

Acid Biosynthesis in Saccharomyces cerevisiae. Appl. Environ. Microbiol. 84.

doi:10.1128/aem.01095-18

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Qian, S., Cirino, P.C., 2016. Using metabolite-responsive gene regulators to improve

microbial biosynthesis. Curr. Opin. Chem. Eng.14, 93–102. doi:10.1016/j.coche.2016.08.020

Raj, K., Partow, S., Correia, K., Khusnutdinova, A.N., Yakunin, A.F., Mahadevan, R., 2018.

Biocatalytic production of adipic acid from glucose using engineered Saccharomyces

cerevisiae. Metab. Eng. Commun. 6, 28–32. doi:10.1016/j.meteno.2018.02.001

Rantasalo, A., Kuivanen, J., Penttilä, M., Jäntti, J., Mojzita, D., 2018. Synthetic Toolkit for

Complex Genetic Circuit Engineering in Saccharomyces cerevisiae. ACS Synth. Biol. 7,

1573–1587. doi:10.1021/acssynbio.8b00076

Reed, J.L., Vo, T.D., Schilling, C.H., Palsson, B.O., 2003. An expanded genome-scale model

of Escherichia coli K-12 (iJR904 GSM/GPR). Genome Biology 4, R54. doi:10.1186/gb-

2003-4-9-r54

Ren, Y., Yang, S., Yuan, Q., Sun, X., 2015. Microbial production of phenol via salicylate

decarboxylation. RSC Adv. 5, 92685–92689. doi:10.1039/c5ra20104g

Rocha, I., Maia, P., Evangelista, P., Vilaça, P., Soares, S., Pinto, J.P., Nielsen, J., Patil, K.R.,

Ferreira, E.C., Rocha, M., 2010. OptFlux: an open-source software platform for in silico

metabolic engineering. BMC Systems Biology 4, 45. doi:10.1186/1752-0509-4-45

Rockwell, G., Guido, N.J., Church, G.M., 2013. Redirector: Designing Cell Factories by

Reconstructing the Metabolic Objective. PLoS Comput. Biol. 9, e1002882.


Rode, A.B., Endoh, T., Sugimoto, N., 2015. Tuning riboswitch-mediated gene regulation by

rational control of aptamer ligand binding properties. Angew. Chemie - Int. Ed. 54, 905–909.

doi:10.1002/anie.201407385

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Rogers, J.K., Church, G.M., 2016. Genetically encoded sensors enable real-time observation

of metabolite production. Proc. Natl. Acad. Sci. 113, 2388–2393.

doi:10.1073/pnas.1600375113

Rogers, J.K., Taylor, N.D., Church, G.M., 2016. Biosensor-based engineering of biosynthetic

pathways. Curr. Opin. Biotechnol. 42, 84–91. doi:10.1016/j.copbio.2016.03.005

Röthlisberger, D., Khersonsky, O., Wollacott, A.M., Jiang, L., DeChancie, J., Betker, J.,

Gallaher, J.L., Althoff, E.A., Zanghellini, A., Dym, O., Albeck, S., Houk, K.N., Tawfik, D.S.,

Baker, D., 2008. Kemp elimination catalysts by computational enzyme design. Nature 453,

190–195. doi:10.1038/nature06879

Ruckerbauer, D.E., Jungreuthmayer, C., Zanghellini, J., 2015. Predicting genetic engineering

targets with Elementary Flux Mode Analysis: a review of four current methods. New

Biotechnology 32, 534–546. doi:10.1016/j.nbt.2015.03.017

Salinas, F., Rojas, V., Delgado, V., Agosin, E., Larrondo, L.F., 2017. Optogenetic switches

for light-controlled gene expression in yeast. Appl. Microbiol. Biotechnol. 101, 2629–2640.

doi:10.1007/s00253-017-8178-8

Salvachúa, D., Johnson, C.W., Singer, C.A., Rohrer, H., Peterson, D.J., Black, B.A., Knapp,

A., Beckham, G.T., 2018. Bioprocess development for muconic acid production from

aromatic compounds and lignin. Green Chem. 20, 5007–5019. doi:10.1039/c8gc02519c

Schellenberger, J., Que, R., Fleming, R.M.T., Thiele, I., Orth, J.D., Feist, A.M., Zielinski,

D.C., Bordbar, A., Lewis, N.E., Rahmanian, S., Kang, J., Hyduke, D.R., Palsson, B.Ø., 2011.

Quantitative prediction of cellular metabolism with constraint-based models: the COBRA

Toolbox v2.0. Nature Protocols 6, 1290–1307. doi:10.1038/nprot.2011.308

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Segler, M.H.S., Preuss, M., Waller, M.P., 2018. Planning chemical syntheses with deep

neural networks and symbolic AI. Nature 555, 604–610. doi:10.1038/nature25978

Segre, D., Vitkup, D., Church, G.M., 2002. Analysis of optimality in natural and perturbed

metabolic networks. Proc. Natl. Acad. Sci. 99, 15112–15117. doi:10.1073/pnas.232349399

Shin, J.H., Park, S.H., Oh, Y.H., Choi, J.W., Lee, M.H., Cho, J.S., Jeong, K.J., Joo, J.C., Yu,

J., Park, S.J., Lee, S.Y., 2016. Metabolic engineering of Corynebacterium glutamicum for

enhanced production of 5-aminovaleric acid. Microb. Cell Fact.15. doi:10.1186/s12934-016-

0566-8

Shlomi, T., Berkman, O., Ruppin, E., 2005. Regulatory on/off minimization of metabolic flux

changes after genetic perturbations. Proc. Natl. Acad. Sci. 102, 7695–7700.

doi:10.1073/pnas.0406346102

Shlomi, T., Eisenberg, Y., Sharan, R., Ruppin, E., 2007. A genome-scale computational study

of the interplay between transcriptional regulation and metabolism. Mol. Syst. Biol. 3.

doi:10.1038/msb4100141

Siegel, J.B., Zanghellini, A., Lovick, H.M., Kiss, G., Lambert, A.R., St.Clair, J.L., Gallaher,

J.L., Hilvert, D., Gelb, M.H., Stoddard, B.L., Houk, K.N., Michael, F.E., Baker, D., 2010.

Computational Design of an Enzyme Catalyst for a Stereoselective Bimolecular Diels-Alder

Reaction. Science 329, 309–313. doi:10.1126/science.1190239

Siripong, W., Wolf, P., Kusumoputri, T.P., Downes, J.J., Kocharin, K., Tanapongpipat, S.,

Runguphan, W., 2018. Metabolic engineering of Pichia pastoris for production of isobutanol

and isobutyl acetate. Biotechnol. Biofuels 11. doi:10.1186/s13068-017-1003-x

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Sivakumar, T.V., Giri, V., Park, J.H., Kim, T.Y., Bhaduri, A., 2016. ReactPRED: a tool to

predict and analyze biochemical reactions. Bioinformatics btw491.

doi:10.1093/bioinformatics/btw491

Skjoedt, M.L., Snoek, T., Kildegaard, K.R., Arsovska, D., Eichenberger, M., Goedecke, T.J.,

Rajkumar, A.S., Zhang, J., Kristensen, M., Lehka, B.J., Siedler, S., Borodina, I., Jensen,

M.K., Keasling, J.D., 2016. Engineering prokaryotic transcriptional activators as metabolite

biosensors in yeast. Nat. Chem. Biol. 12, 951–958. doi:10.1038/nchembio.2177

Snoek, T., Romero-Suarez, D., Zhang, J., Ambri, F., Skjoedt, M.L., Sudarsan, S., Jensen,

M.K., Keasling, J.D., 2018. An Orthogonal and pH-Tunable Sensor-Selector for Muconic

Acid Biosynthesis in Yeast. ACS Synth. Biol. 7, 995–1003. doi:10.1021/acssynbio.7b00439

Song, H.-S., Goldberg, N., Mahajan, A., Ramkrishna, D., 2017. Sequential computation of

elementary modes and minimal cut sets in genome-scale metabolic networks using alternate

integer linear programming. Bioinformatics 33, 2345–2353.

doi:10.1093/bioinformatics/btx171

Srinivasan, S., Cluett, W.R., Mahadevan, R., 2015. Constructing kinetic models of

metabolism at genome-scales: A review. Biotechnol. J. 10, 1345–1359.

doi:10.1002/biot.201400522

Steffensen, J.L., Dufault-Thompson, K., Zhang, Y., 2016. PSAMM: A Portable System for

the Analysis of Metabolic Models. PLoS Comput. Biol. 12, e1004732.


Tai, Y.-S., Xiong, M., Jambunathan, P., Wang, J., Wang, J., Stapleton, C., Zhang, K., 2016.

Engineering nonphosphorylative metabolism to generate lignocellulose-derived products.

Nat. Chem. Biol. 12, 247–253. doi:10.1038/nchembio.2020

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Tepper, N., Shlomi, T., 2009. Predicting metabolic engineering knockout strategies for

chemical production: accounting for competing pathways. Bioinformatics 26, 536–543.

doi:10.1093/bioinformatics/btp704

Thompson, B., Pugh, S., Machas, M., Nielsen, D.R., 2018. Muconic Acid Production via

Alternative Pathways and a Synthetic “Metabolic Funnel.” ACS Synth. Biol. 7, 565–575.


Tiwari, M.K., Singh, R., Singh, R.K., Kim, I.-W., Lee, J.-K., 2012. Computational

approaches for rational design of proteins with novel functionalities. Comput. Struct.

Biotechnol. J. 2, e201204002. doi:10.5936/csbj.201209002

Trinh, C.T., Liu, Y., Conner, D.J., 2015. Rational design of efficient modular cells. Metab.

Eng. 32, 220–231. doi:10.1016/j.ymben.2015.10.005

Tseng, H.-C., Prather, K.L.J., 2012. Controlled biosynthesis of odd-chain fuels and chemicals

via engineered modular metabolic pathways. Proc. Natl. Acad. Sci. 109, 17925–17930.

doi:10.1073/pnas.1209002109

Tsuge, Y., Kawaguchi, H., Yamamoto, S., Nishigami, Y., Sota, M., Ogino, C., Kondo, A.,

2018. Metabolic engineering of Corynebacterium glutamicum for production of sunscreen

shinorine. Biosci., Biotechnol., Biochem.82, 1252–1259.

doi:10.1080/09168451.2018.1452602

Turk, S.C.H.J., Kloosterman, W.P., Ninaber, D.K., Kolen, K.P.A.M., Knutova, J., Suir, E.,

Schürmann, M., Raemakers-Franken, P.C., Müller, M., de Wildeman, S.M.A., Raamsdonk,

L.M., van der Pol, R., Wu, L., Temudo, M.F., van der Hoeven, R.A.M., Akeroyd, M., van der

Stoel, R.E., Noorman, H.J., Bovenberg, R.A.L., Trefzer, A.C., 2015. Metabolic Engineering

toward Sustainable Production of Nylon-6. ACS Synth. Biol. 5, 65–73.


ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Venayak, N., Anesiadis, N., Cluett, W.R., Mahadevan, R., 2015. Engineering metabolism

through dynamic control. Curr. Opin. Biotechnol. 34, 142–152.

doi:10.1016/j.copbio.2014.12.022

Venayak, N., von Kamp, A., Klamt, S., Mahadevan, R., 2018a. MoVE identifies metabolic

valves to switch between phenotypic states. Nat. Commun. 9. doi:10.1038/s41467-018-

07719-4

Venayak, N., Raj, K., Jaydeep, R., Mahadevan, R., 2018b. An Optimized Bistable Metabolic

Switch to Decouple Phenotypic States during Anaerobic Fermentation. ACS Syn. Bio.


Vogt, M., Brüsseler, C., Ooyen, J. van, Bott, M., Marienhagen, J., 2016. Production of 2-

methyl-1-butanol and 3-methyl-1-butanol in engineered Corynebacterium glutamicum.


Völler, J.-S., Budisa, N., 2017. Coupling genetic code expansion and metabolic engineering

for synthetic cells. Curr. Opin. Biotechnol. 48, 1–7. doi:10.1016/j.copbio.2017.02.002

von Kamp, A., Klamt, S., 2014. Enumeration of Smallest Intervention Strategies in Genome-

Scale Metabolic Networks. PLoS Comput. Biol. 10, e1003378.


von Kamp, A., Klamt, S., 2017. Growth-coupled overproduction is feasible for almost all

metabolites in five major production organisms. Nat. Commun. 8, 15956.

doi:10.1038/ncomms15956

Walther, T., Topham, C.M., Irague, R., Auriol, C., Baylac, A., Cordier, H., Dressaire, C.,

Lozano-Huguet, L., Tarrat, N., Martineau, N., Stodel, M., Malbert, Y., Maestracci, M., Huet,

R., André, I., Remaud-Siméon, M., François, J.M., 2017. Construction of a synthetic

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

metabolic pathway for biosynthesis of the non-natural methionine precursor 2,4-

dihydroxybutyric acid. Nat. Commun. 8, 15828. doi:10.1038/ncomms15828

Wang, J., Jain, R., Shen, X., Sun, X., Cheng, M., Liao, J.C., Yuan, Q., Yan, Y., 2017b.

Rational engineering of diol dehydratase enables 1,4-butanediol biosynthesis from xylose.


Wang, J., Yang, Y., Zhang, R., Shen, X., Chen, Z., Wang, J., Yuan, Q., Yan, Y., 2018.

Microbial production of branched-chain dicarboxylate 2-methylsuccinic acid via enoate

reductase-mediated bioreduction. Metab. Eng. 45, 1–10. doi:10.1016/j.ymben.2017.11.007

Wang, L., Dash, S., Ng, C.Y., Maranas, C.D., 2017a. A review of computational tools for

design and reconstruction of metabolic pathways. Synth Syst Biotechnol. 2, 243–252.

doi:10.1016/j.synbio.2017.11.002

Wang, L., Ji, D., Liu, Y., Wang, Q., Wang, X., Zhou, Y.J., Zhang, Y., Liu, W., Zhao, Z.K.,

2017c. Synthetic Cofactor-Linked Metabolic Circuits for Selective Energy Transfer. ACS

Catal. 7, 1977–1983. doi:10.1021/acscatal.6b03579

Wang, X., Chen, X., Yang, Y., 2012. Spatiotemporal control of gene expression by a light-

switchable transgene system. Nat. Met. 9, 266–269. doi:10.1038/nmeth.1892

Werpy, T.A. and Petersen, G., Eds. (2004) Top Value Added Chemicals from Biomass.

Volume I: Results of Screening for Potential Candidates from Sugars and Synthesis Gas.

Pacific Northwest National Laboratory (PNNL) and National Renewable Energy Laboratory

(NREL).

Woolfson, D.N., Bartlett, G.J., Burton, A.J., Heal, J.W., Niitsu, A., Thomson, A.R., Wood,

C.W., 2015. De novo protein design: how do we expand into the universe of possible protein

structures? Curr. Opin. Struct. Biol. 33, 16–26. doi:10.1016/j.sbi.2015.05.009

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Xiu, Y., Jang, S., Jones, J.A., Zill, N.A., Linhardt, R.J., Yuan, Q., Jung, G.Y., Koffas,

M.A.G., 2017. Naringenin-responsive riboswitch-based fluorescent biosensor module for

Escherichia coli co-cultures. Biotechnol. Bioeng.114, 2235–2244. doi:10.1002/bit.26340

Xu, P., 2018. Production of chemicals using dynamic control of metabolic fluxes. Curr. Opin.


Xu, P., Li, L., Zhang, F., Stephanopoulos, G., Koffas, M., 2014. Improving fatty acids

production by engineering dynamic pathway regulation and metabolic control. Proc. Natl.

Acad. Sci. 111, 11299–11304. doi:10.1073/pnas.1406401111

Yamamoto, S., Suda, M., Niimi, S., Inui, M., Yukawa, H., 2013. Strain optimization for

efficient isobutanol production using Corynebacterium glutamicum under oxygen

deprivation. Biotechnol. Bioeng. 110, 2938–2948. doi:10.1002/bit.24961

Yang, J.E., Park, S.J., Kim, W.J., Kim, H.J., Kim, B.J., Lee, H., Shin, J., Lee, S.Y., 2018.

One-step fermentative production of aromatic polyesters from glucose by metabolically

engineered Escherichia coli strains. Nat. Commun. 9. doi:10.1038/s41467-017-02498-w

Yang, L., Cluett, W.R., Mahadevan, R., 2011. EMILiO: A fast algorithm for genome-scale

strain design. Metab. Eng. 13, 272–281. doi:10.1016/j.ymben.2011.03.002

Yim, H., Haselbeck, R., Niu, W., Pujol-Baxley, C., Burgard, A., Boldt, J., Khandurina, J.,

Trawick, J.D., Osterhout, R.E., Stephen, R., Estadilla, J., Teisan, S., Schreyer, H.B., Andrae,

S., Yang, T.H., Lee, S.Y., Burk, M.J., Van Dien, S., 2011. Metabolic engineering of

Escherichia coli for direct production of 1,4-butanediol. Nat. Chem. Biol.7, 445–452.

doi:10.1038/nchembio.580

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Yu, T., Zhou, Y.J., Huang, M., Liu, Q., Pereira, R., David, F., Nielsen, J., 2018.

Reprogramming Yeast Metabolism from Alcoholic Fermentation to Lipogenesis. Cell 174,

1549–1558.e14. doi:10.1016/j.cell.2018.07.013

Yuan, H., Liu, Y., Li, J., Shin, H., Du, G., Shi, Z., Chen, J., Liu, L., 2018. Combinatorial

synthetic pathway fine-tuning and comparative transcriptomics for metabolic engineering of

Raoultella ornithinolytica BF60 to efficiently synthesize 2,5-furandicarboxylic acid.

Biotechnol. Bioeng.. doi:10.1002/bit.26725

Zanghellini, A., 2014. De novo computational enzyme design. Curr. Opin. Biotechnol. 29,

132–138. doi:10.1016/j.copbio.2014.03.002

Zanghellini, J., Ruckerbauer, D.E., Hanscho, M., Jungreuthmayer, C., 2013. Elementary flux

modes in a nutshell: Properties, calculation and applications. Biotechnol. J. 8, 1009–1016.

doi:10.1002/biot.201200269

Zastrow, M.L., Pecoraro, V.L., 2014. Designing Hydrolytic Zinc Metalloenzymes.

Biochemistry 53, 957–978. doi:10.1021/bi4016617

Zhang, J., Jensen, M.K., Keasling, J.D., 2015. Development of biosensors and their

application in metabolic engineering. Curr. Opin. Chem. Biol. 28, 1–8.

doi:10.1016/j.cbpa.2015.05.013

Zhao, M., Huang, D., Zhang, X., Koffas, M.A.G., Zhou, J., Deng, Y., 2018. Metabolic

engineering of Escherichia coli for producing adipic acid through the reverse adipate-

degradation pathway. Metab. Eng. 47, 254–262. doi:10.1016/j.ymben.2018.04.002

Zhou, H., Vonk, B., Roubos, J.A., Bovenberg, R.A.L., Voigt, C.A., 2015. Algorithmic co-

optimization of genetic constructs and growth conditions: application to 6-ACA, a potential

nylon-6 precursor. Nucleic Acids Res. gkv1071. doi:10.1093/nar/gkv1071

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Ziemert, N., Alanjary, M., Weber, T., 2016. The evolution of genome mining in microbes – a

review. Nat. Prod. Rep. 33, 988–1005. doi:10.1039/c6np00025h

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Tables and Figure Captions

Table 1. Non-natural chemicals currently being chemically produced from biomass

Chemical Uses Raw material Company

FDCA, PEF Bioplastic glucose (YXY

technology)

Avantium, The

Netherlands

Lactide, PLA Bioplastic bio-lactic acid NatureWorks, MN,

U.S.A.

Corbion, The

Netherlands

Butadiene Biorubber bio 1,3 BDO Eni-Versalis, Italy

Synthetic Paraffinic

Kerosene (ATJ-SPK)

Jetfuel bioethanol LanzaTech, IL, U.S.A.

Methyl Acrylate,

Isosorbide Diacrylate

bio-based acrylate resins glucose PTT MCC Biochem,

Thailand

Adipic acid Nylon fibers glucose Verdezyne*, CA, U.S.A.

Rennovia*, CA, U.S.A.

Isobutylene Fuel additives glucose Global Bioenergies,

France

hydroxymethylamine Nylon fibers glucose Rennovia*, CA, U.S.A.

1,6-Hexanediol polyurethanes glucose Rennovia*, CA, U.S.A.

Levoglucosenone,

Hydroxymethyl

Butyrolactone

Drugs, herbicides,

flavourings and fragrances

cellulose Circa, Australia

Abbreviations: FDCA – 2,5 furan dicarboxylic acid; PEF – polyethylene furanoate; PLA –

polylactic acid; BDO – butanediol; * Currently in finatial recuperation.

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Table 2. Laboratory scale bioprocess for non-natural chemicals production

Compound Titer (g/L) Microrganism Substrate Systems Biology

Strategy

Reference

1,4 BDO 18.0 Escherichia coli glucose SimPheny

Biopathway

Predictor;

Promiscuous

enzyme screen;

OptKnock

deletions

Yim et al.,

2011

1,4 BDO 0.5 Escherichia coli D-xylose Promiscuous

enzyme screen;

dynamic control;

orthogonality

Liu and Lu,

2015

1,4 BDO 16.5 Escherichia coli D-xylose Promiscuous

enzyme screen;

enzyme rational

design

Tai et al., 2016

1,4 BDO 0.2 Escherichia coli D-xylose Enzyme rational

design

Wang et al.,

2017b

1,3 BDO 2.4 Escherichia coli glucose Promiscuous

enzyme screen

Nemr et al.,

2018

1,3 PDO 0.416 Escherichia coli glucose,

propionate

- Kataoka et al.,

2017

1,2‐Phenylethanediol 1.23 Escherichia coli glucose

Promiscuous

enzyme screen

Mckeena et al.,

2013

4-Methyl-1,3-

pentanediol

0.072 Escherichia coli glucose,

isobutyrate

- Kataoka et al.,

2017

1,2,4-Butanetriol 0.003 Escherichia coli glucose,

glycolate

- Kataoka et al.,

2017

Isobutanol 22.0 Escherichia coli glucose - Atsumi and

Liao, 2008

Isobutanol 1.62 Saccharomyces

cerevisiae

glucose - Matsuda et al.,

2013

Isobutanol 25.4 Corynebacterium

glutamicum

glucose - Yamamoto et

al., 2013

Isobutanol 5.4 Clostridium

thermocellum

cellulose - Lin et al., 2015

Isobutanol 2.22 Pichia pastoris glucose Promiscuous

enzyme screen

Siripong et al.,

2018

2-Methyl-1-

butanol 1.25 Escherichia coli glucose

Promiscuous

enzyme screen

Cann and Liao,

2008

2-Methyl-1-

butanol 0.37

Corynebacterium

glutamicum glucose

- Vogt et al.,

2016

Isopentanol 9.5 Escherichia coli glucose

- Connor et al.,

2010

Isopentanol 0.086 Escherichia coli glucose

- Guo et al.,

2014

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

1-Pentanol 1.137 Escherichia coli

trans-2-

pentanoate

and formate

Promiscuous

enzyme screen Tseng and

Prather, 2012

1-Pentanol 4.3 Escherichia coli glucose

Enzyme rational

design

Chen et al.,

2017

1-Pentanol 0.137 Escherichia coli

1,2

pentanediol

Enzyme rational

design Dai et al., 2017

1-Pentanol 0.118 Escherichia coli glucose

Enzyme rational

design; genome

mining

Mak et al.,

2015

1-Hexanol 0.341 Escherichia coli glucose Enzyme rational

design; genome

mining

Mak et al.,

2015

1-Heptanol 0.269 Escherichia coli glucose Enzyme rational

design; genome

mining

Mak et al.,

2015

Nonane 0.327 Escherichia coli glucose

- Choi and Lee,

2013

Dodecane 0.136 Escherichia coli glucose

- Choi and Lee,

2013

Tridecane 0.064 Escherichia coli glucose

- Choi and Lee,

2013

2-Methyl-

dodecane 0.042 Escherichia coli glucose

- Choi and Lee,

2013

Tetradecane 0.009 Escherichia coli glucose

- Choi and Lee,

2013

Isobutyl acetate 0.051 Pichia pastoris glucose

Promiscuous

enzyme screen

Siripong et al.,

2018

2,4-

dihydroxybutyric

acid 1.8 Escherichia coli glucose

Thermodynamic

and stoichiometric

analysis;

enzyme

rational design;

pathway balancing

Walther et al.,

2017

Muconic acid 36.6 Escherichia coli glucose

- Niu et al.,

2008

Muconic acid 59 Escherichia coli cathecol

- Kaneko et al.,

2011

Muconic acid 0.141

Saccharomyces

cerevisiae glucose

FBA; Pathway

balancing

Curran et al.,

2013

Muconic acid 2.1

Saccharomyces

cerevisiae glucose

Biosensor Leavitt et al.,

2017

Muconic acid 64.2

Pseudomonas

putida cathecol

Pathway balancing Kohlstedt et

al., 2018

Muconic acid 3.1 Escherichia coli glucose

Thermodynamic;

Elementary Flux

Mode Analysis

Thompson et

al., 2017

Muconic acid 1.8

Corynebacterium

glutamicum lignin

- Becker et al.,

2018

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Muconic acid 85

Corynebacterium

glutamicum cathecol

- Becker et al.,

2018

Muconic acid 5.1

Saccharomyces

cerevisiae glucose

- Pyne et al.,

2018

Muconic acid 52.3

Pseudomonas

putida

benzoate,

coumarate,

corn stover

-

Salvachua et

al., 2018

Adipic acid <0.001 Escherichia coli glucose - Yu et al., 2014

Adipic acid 2.5 Escherichia coli glycerol Cheong et al.,

2016

Adipic acid 0.003 Saccharomyces

cerevisiae

glucose -

Raj et al., 2018

Adipic acid 68 Escherichia coli glucose Pathway balancing Zhao et al.,

2018

Acrylic acid 0.12 Escherichia coli glucose

Promiscuous

enzyme screen

Chu et al.,

2015

4-

Hydroxybenzoic

Acid 1.82 Escherichia coli glucose

-

Noda et al.,

2016

4-

Hydroxybenzoic

Acid 36.6

Corynebacterium

glutamicum glucose

Promiscuous

enzyme screen Kitade et al.,

2018

2-

hydroxyisovaleric


- Cheong et al.,

2018

2-Methylsuccinic


Promiscuous

enzyme screen

Wang et al.,

2018

Polylactic acid 3%CDW Escherichia coli glucose

MOMA; FBA Jung et al.,

2010

Butyrolactam 54.14 Escherichia coli glucose

In-silico flux

response analysis;

thermodynamic

analysis;

promiscuous

enzyme screen

Chae et al.,

2017

Valerolactam 1.18 Escherichia coli glucose

In-silico flux

response analysis;

thermodynamic

analysis;

promiscuous

enzyme screen

Chae et al.,

2017

Caprolactam <0.001 Escherichia coli glucose

In-silico flux

response analysis;

thermodynamic

analysis;

promiscuous

enzyme screen

Chae et al.,

2017

Caprolactam 0.160 Escherichia coli glucose Promiscuous Turk et al.,

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

enzyme screen 2016

6-Aminocaproic


Pathway balancing Zhou et al.,

2015

5-Aminovaleric

acid 33.1

Corynebacterium

glutamicum glucose

- Shin et al.,

2016

D-Phenyllactic


Promiscuous

enzyme screen

Yang et al.,

2018

Poly (3HB-co-D-

phenyllactate) 13.9 Escherichia coli glucose

Promiscuous

enzyme screen

Yang et al.,

2018

Shinorine 0.019

Corynebacterium

glutamicum

sodium

gluconate

- Tsuge et al.,

2018

Shinorine 0.154

Streptomyces

avermitilis glucose

Genome mining Miyamoto et

al., 2014

Styrene 0.3 Escherichia coli glucose

Promiscuous

enzyme screen

Mckenna and

Nielsen, 2011

Styrene oxide 1.32 Escherichia coli glucose

Promiscuous

enzyme screen

Mckeena et al.,

2013

Styrene 0.029

Saccharomyces

cerevisiae glucose

- Mckenna et al.,

2014

Styrene 0.35 Escherichia coli glucose

Promiscuous

enzyme screen;

pathway balancing Liu et al., 2018

3,4-

Dihydroxystyrene 0.063 Escherichia coli glucose

- Kang et al.,

2015

4-Hydroxy-3-

methoxy styrene 0.064 Escherichia coli glucose

- Kang et al.,

2015

4-Hydroxy

styrene 0.355 Escherichia coli glucose

- Kang et al.,

2015

FDCA 30.1

Pseudomonas

putida 5-HMF

- Koopman et

al., 2010

FDCA 13.9

Raoultella

ornithinolytica 5-HMF

- Hussain et al.,

2016

FDCA 34.5

Raoultella

ornithinolytica 5-HMF

Pathway balancing Yuan et al.,

2018

Phenol 1.69 Escherichia coli glucose

- Kim et al.,

2013


- Ren et al.,

2015


Thermodynamic

analysis;

promiscuous

enzyme screen;

elementar flux

modes

Thompson et

al., 2018

C50 astaxanthin

770 μg

gDCW−1 Escherichia coli

glucose

Directed evolution Furubayashi et

al., 2015

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Abbreviations: FBA – Flux Balance Analysis; MOMA – Minimization of Metabolic

Adjustment; FDCA - 2,5-Furandicarboxylic acid; 5-HMF – Hydroxymethylfurfural.

Figure 1. Synthetic pathways for non-natural chemicals. 1,4 BDO can be produced from

xylose by manipulating native xylose catabolism in two different ways or from the central

metabolism, using ketoglutarate and succinyl-CoA as intermediates. 1,3 BDO can be

produced from acetyl-CoA either using acetaldehyde or butyryl-CoA as intermediates.

Acrylic acid can be produced by extrapolating from the propanoate metabolism, using

acryloyl-CoA as a precursor. Adipic acid can be produced either from benzoate or from

central metabolism by using a reverse benzoate degradation pathway, or from shikimate

pathway, using dehydroshikimate (DHS) as intermediate and muconic acid, a non-natural

chemical itself, as precursor. Shikimate pathway can also be extended for producing 4-

hydroxybenzoic acid or phenol. In addition, several amino acids are key intermediates for

non-natural chemical production. Isoleucine and valine biosynthesis pathways can be

manipulated for producing 2-methyl-1-butanol, isobutanol and isobutyl acetate. Styrene and

styrene derivatives can be produced from tyrosine, by extrapolating from phenylpropanoid

biosynthesis. Finally, lysine and glutamate degradation pathways can be manipulated for

producing 5-aminovalerate, valerolactam, 6-aminocaproate, caprolactam and butyrolactam.

Figure 2. Typical workflow for implementation of systems biology based tools in

microorganisms. (A) Genomic, transcriptomic and proteomic data are pulled from databases

and analyzed. (B) Pathway prediction tools are used to compute intermediates from the

target product to the starting compound. (C) Natural enzymes that catalyze pathway

reactions are pulled from databases; if activities are low or not available, rational engineering,

directed evolution or de novo enzyme design may be applied. (D) Strain design algorithms

are applied to genome-scale metabolic models to determine a set of genetic perturbations for

achieving an engineering objective. (E) Engineered enzymes and genetic perturbations are

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

applied to the host strain in vivo. (F) Several strains are screened for optimal activity using

high-throughput methods. (G) Scale-up engineering at the bioreactor level is completed for

high-performing strains.

Figure 3. Key steps of in-silico non-native pathway design. (A) Databases for known

biochemical reactions and molecules and databases for hypothetical extended reaction

network. (B) Network representation used in different pathway computation tools. In a simple

graph network, only one major substrate and one product are kept for each reaction. In a

bipartite graph, the two types of nodes represent reaction operators and intermediate

compounds, respectively. In a (directed) hypergraph, each hyper arc represents a different

reaction connecting multiple reactants and products. (C) Three types of molecular and

reaction representation incorporated in different tools, 1. BEM or bond-electron-matrix

representation of reactants and products uses a matrix to include all the bond information

between each pair of non-hydrogen atoms, 2. SMILES representation of molecular structure

and SMARTS/SMIRKS way of writing reaction rule, 3. Atom mapping tracks each atom in

the reactants. (D) Different search algorithms include breadth-first search, depth first search

(from source compounds), retrosynthesis (search starting from target compound) and double-

direction search. (E) Various pathway ranking/prioritization methods include estimating

chemical structural similarity, pathway length, enzyme docking, thermodynamic feasibility

and number of non-native reaction steps.

Figure 4. Enzyme engineering approaches. (A) Directed evolution process starts with mining

genomic metadata for suitable starting enzymes or identifying key residues for catalytic

activity in an existing starting enzyme. The next step involves creating a combinatorial

library by random mutagenesis of the gene of interest (error-prone PCR, DNA-shuffling) or

targeted mutagenesis of the residues of interest (site-directed mutagenesis, site-saturated

mutagenesis). The variants are then expressed in vivo and those with adequately robust

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

growth (assayed by fluorescence-based methods) (ie. FACS), culturing in liquid medium or

on LB Agar plates, or phage display are selected for production screens using a variety of

methods of measurement, including gas and liquid chromatography, biosensors and mass

spectrometry. For combinatorial libraries generated by random mutagenesis, the variants that

pass the screen for production are iteratively mutated, selected for, and screened until those

that remain exhibit a satisfactory level of performance. For libraries obtained by targeted

mutagenesis, the enzyme structure is updated to include the most successful mutation and

may undergo targeted mutagenesis for another key residues one-by-one or simultaneously

(MAGE). (B) Biosensors take the production of the target compound as an input signal, and

output either an increased expression of a reporter gene such as GFP or an increased

expression of an antibiotic resistance gene. (C) In the “inside out” enzyme design, the

catalytic mechanism is first chosen by defining the reactants and products, after which the

transition state of the reaction is established. Then, complex calculations are performed

computationally to obtain the 3-D structure of the amino acids around the transition state to

maximize stability (the ‘theozyme’). Finally, the RosettaDesign algorithm is used to predict

the 3-D structure of amino acids around the theozyme to stabilize it by matching a protein

scaffold.

Figure 5. (A) Mixed-Integer Linear Programming methods such as OptKnock can suggest

knockout strategies that result in strong or weak growth coupling. (B) Minimal cut sets

(MCS) eliminate undesired phenotypes by incorporating a minimum product yield as a

constraint. (C) Constrained minimal cut sets (cMCS) build on minimal cut sets by

incorporating an additional minimum biomass yield as a constraint. (D) MoVE suggests

metabolic valves to switch between a growth state and a production state. (E) Metaheuristic

methods have decoupled cellular and engineering objectives. (F) Kinetic models are

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

burdened by parameterization requirements which may be addressed using computational

alternatives.

Figure 6. Orthogonal production and dynamic control as alternatives to growth-coupled

production. (A) Pathways featuring a greater number of reactions shared between the biomass

or growth (native metabolism) and production branches will suffer from interference more

characteristic of growth-coupled systems. (B) Minimizing the number of common enzymatic

steps leads to a reduction in the cross-talk and an increase in orthogonality can be attained at

the organism level. Such orthogonal networks are characterized by minimal cross-talk

between growth and biosynthetic pathways. (C) Dynamic control tools such as genetic toggle

switches can reversibly divert flux through the biomass and production branches at any given

time. This is achieved by an external signal such as the presence of chemical inducer, absence

of substrate, or a change in temperature. The bistable switch allows for the minimization of

the trade-off between growth and production that usually acts as a barrier for both to be

optimal. In state 1, the chemical inducer is represented as pink molecules and it leads to the

induction of expression of the pink repressor protein as well as the pink gene. The pink

repressor blocks expression of the yellow gene and the yellow repressor protein. In state 2,

the chemical inducer is represented as yellow molecules and it leads to the induction of

expression of the yellow repressor protein and the yellow gene instead. Consequently, the

yellow repressor blocks the expression of the pink gene and the pink repressor protein.

Hence, this bistable switch alternates between states during which one of the genes is

expressed, and the other is repressed. CP refers to a common precursor shared between the

biomass and production branches of the network.

Figure 7. Non-natural redox cofactors may be used to increase the orthogonality of

biosynthetic pathways. (A) When the redox states in a biosynthetic pathway of interest are

mediated by natural redox cofactors, cross-talk will exist with the cell’s natural metabolism.

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

(B) The use of non-natural redox cofactors for a biosynthetic pathway while the cellular

metabolism continues recognizing natural cofactors can reduce interference and increase the

orthogonality at the pathway level. The most popular non-natural redox cofactors used for

this purpose include nicotinamide cytosine dinucleotide (NCD) and nicotinamide flucytosine

dinucleotide (NFCD), both based on the natural backbone of nicotinamide adenine

dinucleotide (NAD), represented in (C) as nicotinamide nucleobase dinucleotide (NND).

ACCEPTED MANUSCRIPT

ACC

EPTE

D M

ANU

SCR

IPT

Highlights

Pathways to more than 60 Non-natural chemicals described

Computational tools for pathway prediction and host optimization for non-natural chemical

summarized

Enzyme discovery and engineering methods using directed evolution, biosensors and de novo

design presented

Emerging concepts in orthogonal pathway design and dynamic control discussed

ACCEPTED MANUSCRIPT

systems biology based metabolic engineering for non-natural … · 2019. 4. 17. · mcs minimal cut...

Documents