cancer metastatic processes contextualized by networks ...€¦ · cancer metastatic processes...

Cancer Metastatic Processes Contextualized by Networks: Role of Entropy, Symmetry and Controllability.Networks are pervasive in biomedicine, especially in the realm

of experimental studies and computational applications. Their

relevance reflects the ability to model systemic complexity. While

entropy has been central to many studies focused on equilibrium,

more recently other concepts like symmetry, controllability and

synchronization have emerged. In light of such properties,

networks’ impacts are still partially unknown in domains like

cancer research, for which a thorough assessment is here provided.

Depending on the question we ask, the use of networks can be

different. In general, the analysis of complex processes such

carcinogenesis involve the consideration of the so-called cancer

hallmarks. In particular, it would be important to establish how

the metastatic processes take place and progress through the

disease stages. One way to tackle the problem is to analyze how

network states can reflect these metastatic processes, possibly

identifying dysregulation paths whose characteristic features

might guide inference. Our main goal is to cast into network

cancer domain information such that it can be translated into

well-defined and interpretable features. Therefore we consider a

spectrum of entropies, at both local and global network scale,

and we look at both symmetry and controllability. A few examples

are provided to demonstrate the role that typical network

constraints may play when measures or indicators of metastasis

are available.

by Enrico Capobianco

Introduction

1.1. Complexity through the lens of Networks

Cancer research is a paradigmatic complex

context representing the natural setting for

network science studies and applications.

It is well-known that multiple factors must

be considered to explain the carcinogenetic

process, from endogenous ones like DNA

replication or cellular interactions, to

exogenous ones, like for instance lifestyle,

diet and environmental influences. Given the

redundancy (number, variety, etc.) of these

influencing factors, the impact exerted on

cancer heterogeneity is clear. Heterogeneity

is the most inherently complex characteristic

of cancer, the one that makes hard to cure

it because the one that eventually fuels the

resistance mechanisms that negatively affecting

treatment response1.

Redundancy in cancer is typically recognized

because of the multiple bioentities to be

considered and the various types of associ-

ations. Especially through the recourse to

genomics studies, redundancy has become

even more an issue. Genomes can reveal

differentiated somatic alterations of normally

functional cells between and within

cancet types2. Most importantly, functional

redundancy is fundamental to establish a sort

of biological robustness, and the complexity

of the relationship between redundancy and

robustness is often described by quantitative

65

PR

EC

ISIO

N T

OO

LS

66

approaches3. Finally, only a fraction of all

possible genomic measurements can be

detected through profiling techniques: it is

especially important at the translational level

the fact that relatively few examples are really

useful to either biological or clinical validation

purposes. In fact, most experimental validation

techniques, including clinical trials, are not

designed for high-throughput redundant

analysis, thus reducing the list of candidate

genes or proteins playing the role of putative

diagnostic biomarkers or therapeutic targets.

Another property of absolute relevance for

cancer treatment purposes is synergy.

Synergistic interactions are typically found

in cancer contexts, and they involve genes,

proteins, drugs etc.4. However, synergies

are quite hard to explain5. In principle,

exploiting synergy in a complex system means

to be able to limit the overall redundancy

effects. Equivalently, its informative contents

can be assigned to a reduced number of

components able to combine together and

amplify their individual contributions. Gene

regulation, pathway activation or clinical

markers imply the role of synergy, especially

because with such ensembles the use of

pairwise associations cannot ensure a global

understanding of complex dynamics. Despite

the clear need of identifying synergistic

regulation or dysregulation for inference

scopes, most of the times the identifications

that one can get are computationally

intractable, such that one turns again to

pairwise relationships. This is clearly a

coarse approximate solution.

In general, networks support the co-existence

of redundancy and synergy, but offer also

means to deal with them. Specifically in the

cancer context, as we aim to show. First of all,

consider modularity. A modular architecture

is known to add information content. This

is the case when the identified modules and

sub-networks are associated with functional

aspects and even complex functions,

possibly part of processes for which a

sequence of states can be defined. Cell types

can indeed be subjects to transitions between

the different states, depending on their

reference conditions. Intuitively, cellular

states could be studied by discrete-time

Markov models. Knowing the transition rates

for healthy and diseased cells could be

important for understanding disease

mechanisms, for estimating the temporal

effect of a certain perturbation or stressor,

and also for predicting the composition of

the cell state at a given time.

Other concepts addressing states and disease

phenotypes of interest involve the recourse

to so-called combinatorially dysregulated

subnetworks, which are obtained by genomic

systems decompositions aimed to identify

states that are most informative for pheno-

types 6,7. Synergistic dysregulation can leverage

differential expression, in particular when

synergy acts towards those genes for which

significant measure is lacking. The presumed

functional similarity in biological modules

legitimates the importance assigned to

coordinated actions that may exert effects

over phenotypes, including cancer ones.

1.2 Cancer Networks Taxonomy

Networks are powerful tools to infer the

dynamics of cancer systems because their

structural and functional characteristics

combine with the ability of visualizing and

abstracting complex features of experimental,

computational, clinical conditions. To this end,

networks can effectively synthesize different

types of data (qualitative and quantitative),

as well as inputs from various data generating

processes, such as sources of genomic,

metabolomics, proteomic, and imaging

data. Moreover, they provide visibility into

cross-disciplinary problems and areas, not

just for the aims of reframing and envisioning.

Networks guide discovery, deliver solutions,

and aid in managing the transition between

phases of complex processes.

A taxonomy of network types and models has

emerged in the recent years across disciplines.

Notably, this taxonomy remains still partial

in biomedicine, but progresses diffusively

in cancer research due to the advances in

epigenetics, non-coding RNA, imaging, just to

mention only a few examples. Multiple steps

are involved in the definition of ‘Next

Generation Networks’, following what has

been already seen in the Information

Technology and in the Telecommunication

fields, and providing ubiquitous connectivity

with pervasive accessibility to service,

application, content and information. Both

algorithms and methods are destined to

adapt to emerging applications covering

topics at the intersection of disciplines,

such as optimization of network design and

functionality, measuring influences of physical

network structural characteristics in

characterizing robust solutions, cross-layer-

ing algorithms for in-depth understanding of

cellular networks, considering the interactions

between multitype networks while balancing

their coexistence at the infrastructural level,

processing information packets instead of

single entities, surveying anomaly detection

approaches and monitoring early warnings,

facing ever increasing heterogeneity across the

application contexts.

When considering the exposome, it is clear

that this is becoming a crucial source of data

contributing significantly to cancer prevention

by prospectively reducing the global burden

of disease while remaining a prominent public

health issue8. The translation of environmental

epigenetics research to environmental policy

and public health solutions will enhance

chemical risk assessment and enable clinicians

to identify at-risk populations prior to disease

onset. For instance, environmental toxicants

exert their impact on health and disease

in part by modifying the epigenome, DNA

modifications that do not affect the underlying

sequence but can result in altered gene

expression and downstream phenotypic

changes. Research addressing perturbation

mechanisms at the epigenome level, from the

biological and functional significance of small

epigenetic changes (e.g. DNA methylation,

histone modifications, and non-coding RNA

expression), to the impact of epigenetic change

longitudinally on health outcomes are key

aspects.

The network taxonomy that is here considered

specific to the cancer context but is general-

izable to other diseases, is described in view

of challenges centered on three main network

properties: Entropy, Symmetry, Controllabil-

ity. In particular, entropy can be associated

with quantification of uncertainty and this

affects the discovery process; symmetry can

be associated with system robustness and from

here with resilience, but also generalization;

controllability concerns the ability to perform

feasible and accurate selection and to improve

prioritization. It is in particular the multifaceted

metastatic process that will be used in the

presented examples and use cases.

2 Results

2.1 Entropy to decipher Complexity.

There are many possible ways to introduce

entropy in complex biological systems,

including cancer ones; it is perhaps useful

to look at concepts whose potential is not

yet completely exploited. For example, a

prominent principled concept is set

complexity9-11. In biomedicine, this might

help the interpretation of network constraints

of functional relevance from a biological

viewpoint. The associated measure is called

differential interaction information, and the

relevance at biological levels is conveyed by

the global dependence idea and its collective

flavor, typical say of regulatory dynamics

between transcription factors and genes, or

between microRNAs and genes, to mention

two examples among other possible causal

associations revealed in multiple types of

networks. Biological networks are clearly

neither random not regular, instead they are

interesting mixtures of the two properties.

In particular, the network modular architecture

is able to maximize the set-complexity of a

graph. Set complexity is defined below for

two nodes i,j of a network as:

1where S is the Shannon entropy and MI

is the mutual information between nodes i,j.

Modularity is basically enforcing the

connectivity between nodes and represents

a characterizing structural feature, which is

quantitatively reflected in the MI measure.

As said, the gain comes from functional

significance of the dependence that goes

beyond simple correlation. In genomics,

this might be relevant to capture nonlinear

or high-order interactions due to highly

complex regulation signatures.

A second relevant concept is basin entropy,

used to parametrize the dynamical uncertainty

of a network from the connectivity of its

components, thus probing its behavior with

varying parameters12. Seen in relation with

basins of attraction that link initial conditions

to final states, this measure is useful to quantify

the possible unpredictability of the dynamical

system underlying the network13. For gene

regulatory networks, basin entropy can be

defined by considering the basins as the state

space components, as follows14:

S = - Σi si log(si)

2where si indicates the size of the

basin I, and S is minimal when one basin has

all the states and maximal when the states are

distributed into different attractors.

Since the state of a network defines its activity

through the component nodes, a state is clearly

a dynamic entity destined to fluctuate according

to trajectories determined by the various

transitions occurring from state to state. In

general, trajectories are purely random in a

stochastic system, and in noisy systems these

trajectories can exhibit multiple dynamical

regimes, unlike in deterministic systems. It may

also be possible that states tend to recur, in

such case there are attractor states, or simply

attractors. Other states may tend to flow into

attractors forming their basin, or basin of

attraction. The important aspect is thus that

the size of an attractor is contributed by the

number of basin states gravitating around it.

Entropy can be interestingly computed

for both these types of states, and being

state-space partitions not unique for a given

network, the same holds for the related state

entropy. This offers a rationale for relying on

average entropy measures, computed on

network ensembles.

Speaking of average entropies, a reference

goes to the centrality of fluctuation theorems 15.

They describe systems’ non-equilibrium

states analytically, through the statistical

fluctuations in time-averaged properties. This

can be rephrased as stating that the resilience

(or robustness) R in a system is correlated to

its level of uncertainty, hence its entropy S,

according to:

dR dS > 0

Ψ = c ∑ i ∑ j max{ Si Sj} MIij (1-MIij)

67

PR

EC

ISIO

N T

OO

LS

32.2 Metastasis: Stochastic Dynamics.

An entropy increase in the cancer context

can depend on the robustness of cancer cells

to perturbing factors or environmental

stressors, including the tumor microenviron-

ment16. However, it was observed that

differential expression, which leads to cancer

identification, and differential network

entropy are indeed two anticorrelated

entities17. An example is offered with

overexpressed proliferation genes (carrying

oncogenes) that were found associated with

reduced network entropy across several

cancers. Notably, the impact in cancer research

is relevant because this relationship suggests

that the metastatic potential could be

established at early stages of disease progres-

sion through proper entropy measurements.

It has also been established that from the

analysis of transcriptional signatures associated

with perturbed cancer-related genes, metastatic

cancer seems to fall into two main subtypes:

epithelial to mesenchymal transition or ETM-

like associated to an inflammation state, and

proliferative state (increased metabolism and

systemic stress signaling)18. Typically, ETM

transitions occur in cell morphology with

a corresponding reduction of cell function

synchronization and increased motility levels.

Another possible finer subtype derived from

the adaptive stress signaling could be referred

to the so-called phenotype switching, which

occurs after the inhibition of receptor

(non-receptor) kinases, or in response to

(chemo-) therapy. Most importantly, signaling

events occur not only dynamically but rapidly

in cancer cell that respond to therapy, and

then can turn to survival or not depending on

resistance. Adaptive stress signaling has been

indicated as the reference emerging research

area to such developments19.

Metastasis evolves over time and in response to

therapy. Therefore, cancer might in principle

be modelled according to a stochastic sampling

process, and the genotypes would be the result

of random drawing from mutational pooling,

this latter acting as the process underlying

the data generating mechanism20. A mutation

ensemble would thus be composed of various

states to which perturbed networks can be

associated depending on the probability of

state activation or repression following a

certain mutation. Under conditions of

equilibrium, or stationarity, the observed

experimental data endowed with the

phenotypic response (metastatic or non)

can be considered the outcome of ensemble

averages operating over many states. These

states have differentiated entropy degrees,

and because they interact between them

and with the environment, the influences at

systems level fluctuate depending on entangled

versus decoherent conditions.

A stochastic matrix can be formed by the

transition probabilities between states Xi

and Xj, assuming they are constant over time

and depend only on the reference state. This

way, the transition matrix would define a

discrete-time Markov process explaining state

evolution dynamics of the population of cells

under consideration21,22. At a network scale, a

dynamic rewiring occurs through node

interactions that determine the evolving

network state. When no change of state is

observed over time, an attractor can be

observed, or an attractor landscape if all

possible state trajectories are considered.

Both genetic and epigenetic alterations can

fuel such network evolution process, and

genomic instability or loss of proteostasis are

among the possible observed causes of

uncontrolled cell proliferation23.

A steady-state uncertainty measure of the

information flow is provided by the network

entropy computed from the entropy rate ∆S24:

∆S = ∑iπi Si

4considering the stationary distribution

of the transition probabilities π and with the

entropies Si computed locally (at each state).

A change in entropy rate would imply that

the effects of signaling are likely present in

response to some type of perturbation.

While an interesting interpretation has been

offered in a context of primary and metastatic

versus non-metastatic cancer, showing that

in the former case the entropy increases and

thus becoming an indicator of cancer progres-

sion, a finer formulation would be required to

consider network nodes instead of the states.

The local entropy for a given node would be a

special case:

Si = -[log( k)]-1 ∑i pi log pi

5with k as the degree of the reference

node i, and pi the probability linking the

reference node with other nodes. An

important consequence is that cancer

phenotypes can be assessed by the use of

their local entropies from node-wise degrees.

The distribution obtained this way can allow

the assessment of network heterogeneity, and

in particular how randomness characterizes

metastatic vs non-metastatic networks.

A principled elucidation of the dynamical

responses of cancer networks naturally

involves therapy, and the starting point is

measuring the differential gene expression

that determines phenotype-driven network

configurations. Under the influence of external

factors, a phase transition may occur once

certain critical levels are reached. A normal

cell can undergo a transition and thus become

not apoptotic due to cumulated mutations

too. Epigenetics play a central role in such

69

PR

EC

ISIO

N T

OO

LS

regards25. However, relevance goes to genome

stability too, in parallel and in light of genomic

rearrangement studies elucidating concepts like

chromothripsis and chromoplexy that underlie

catastrophic events. In general, some types of

compensatory perturbations might be designed

and used to direct a network to a desirable

state, thus deviating from undesirable ones.

This topic is re-considered next, within the

control arguments.

While a phase space represents all possible

states in a dynamical system, it is important

to note that biological systems tend to be

sparse, with a high number of degrees of

freedom reflected over the big set of parameters

characterizing the system. Similarly, generators

of complexity are the interactions among

bio-entities like proteins, many of them being

false negatives or missing values, or the profiles

of genes, only in part significant due to noisy

signals and presenting multiple transcripts.

Depending on the possibility to limit the

overall uncertainty, this complexity can be

mitigated by dimensionality reductions aiming

at preserving the structural information, Due

to sparsity, the local structures are those that

usually orient the phase space while the system

evolves. For example, multiple evolutionary

trajectories are observed in cancers even

before achieving a transformed state, such as a

metastatic one, thus establishing the role of the

temporal dimension of cancer heterogeneity.

2.3. Symmetry: General concepts.

Biological signals that cause a shift in the

normal balance of symmetric and asymmetric

cell division can trigger differentiation arrest,

and for instance cancer progression26.

Studies on symmetric stem cell divisions

have considered their role in homeostasis27. While asymmetric division is very

important in generating diversity during

development, its dysregulation is relevant

for promoting oncogenesis. It is well-known

that a small fraction of cancer stem cells or

cancer-initiating cells exist in the affected

cell populations exhibit resistance to many

anticancer drugs and produce many heteroge-

neous tumor cell populations, resulting in

a high frequency of tumor recurrence

and metastasis Cancer stem cells undergo

asymmetric cell division, which is a physio-

logical event of normal stem cells. It occurs

during development and tissue homeostasis

and maintains a balance between self-renewal

stem cells and differentiated cells in a single

division, therefore between the stem cell

pool and the progenitor cell pool28,29.

Symmetry is also a key factor in brain network

functionality, to which neural activity must

be coordinated according to mechanisms that

remain largely unknown. Recently, behavioral

hypotheses have emerged, and they suggest

that symmetry breaking of network connectivity

shapes the timescale hierarchy that eventually

ends in some target functional subspace. In

turn, the behavior emerges when the appropriate

conditions imposed upon the couplings are

satisfied, justifying the conductance property

of synaptic couplings30,31.

The structure of complex networks is increas-

ingly chosen as the context of application

of symmetry concepts. This depends quite

naturally on the presence of connectivity

patterns that are rarely unique in the same

network. Therefore, of interest here is how

these patterns that may determine symmetry

can influence structurally and functionally

biological networks.

A few open questions remain:

l Do symmetric patterns represent localized

network dynamics, such as modules, clusters,

motifs? If yes, how specifically vs redundantly

so?

l Are such patterns robust or fragile structures

with regard to symmetry breaking?

When considering the context of biomedicine,

the following questions emerge in relation

with cancer progression and therapy:

l What influences do symmetry patterns exert

at a global network disease scale? Are they

diffusive, heterogeneous, redundant?

l Is some form of control possible over these

patterns?

Detecting symmetries facilitates the achieve-

ment of reliable inference, particularly in

presence of multimodal distributions or

spurious correlations. For instance, likelihood

functions can have symmetries. Consider

sample data x ϵ X and the choice of model

M(x,θ), given the parameters θ ϵ θ*. These

parameters may be non-identifiable due to the

underlying presence of symmetry. Namely, a

symmetry s: θ θ is a measurable function

that makes θ no longer identifiable from the

data x. In such cases, the recourse to symmetry

breaking solutions can be needed to augment

inference performance. The natural way to

obviate to the effects of symmetry is to change

the model parameterization. A re-parametriza-

tion means to enable a transform θ ψ, with

ψ ϵ ψ*, or as an alternative strategy to

constraint the given parameters, say θc for

given c. There are several types of symmetries

and some are local (when likelihood-preserv-

ing) and some are not. Common symmetries

are permutation ones, but there are also

scaling and translation ones.

When the reference parametric context is a

network N, it has been shown that mesoscale

symmetries imply the existence of localized

dynamical modes. These might be unstable no

matter what the structure of the network is, or

might be stable, suggesting that the symmetry

allows to nodes reduction without influencing

the overall dynamics32. In general, defining

symmetries is something inherent to nodes and

their permutations. Two nodes are permuted

when their links are rewired and thus a switch

occurs. Collecting all nodes into the adjacency

matrix A, in which Aij = 1 (if node i and node

j are adjacent) or 0 (otherwise), a symmetry is

present when a permutation P is applied to A,

leaving it unchanged, i.e. PA = A. This estab-

lishes a so-called automorphism, indicating

that nodes are topologically equivalent if

their permutation does not affect the network

structure. This is useful to collapse redundancy

71

PR

EC

ISIO

N T

OO

LS

into network quotients or skeletons33-35, in

which no structural repetition is present but

other structural network properties remain,

together with the inherent complexity. Note

that symmetries may also be combined, and

all the participating nodes form a symmetric

structure. Then, an orbit is present with a set

of structurally equivalent nodes, i.e. all vertices

in the same orbit have the exact same degree.

Formally, given a set of nodes in a network N

and a set of automorphisms A(N) (forming a

group), the orbit of a node n ϵ n(N) is the set

(see36):

O(n) = {λn ϵ n(N) : λ ϵ A(N)}

7as the associated probabilities (ratios

between AP cell nodes and network nodes).

The normalized entropy is then defined simply

as:

(SAP)* = SAP/log N

8Of a certain utility is also to know

that the eigenvalues of a symmetric structure

are decomposable into two types: redundant

and non-redundant eigenvalues. The former

correspond to the eigenvectors localized

on the symmetric structure, while the latter

refers to the eigenvectors whose eigenvalues

are the same in relation with the nodes of a

given orbit. This is relevant for considering the

network based on orbits instead of nodes, thus

eliminating redundancy. Another important

aspect is the consideration of a system at the

steady state versus a possible departure from it.

Such dynamics are regulated by the previously

addressed eigenvalues, regardless the network

structure.

With a probability measure inserted in the

network, a probability distribution P is defined

on its nodes, thus leaving with the network

entropy as a measure of the randomness degree

in the network:

SN(P) = min ∑i pi log pi

9and this implies the importance of the

stochasticity of symmetries (see fuzzy

symmetries38) based on network ensembles.

Furthermore, network communities, which are

cohesively connected sets of likely functionally

similar nodes, exert also an impact on another

important aspect, synchronization39. The

relationship between network topology and

synchronization are not completely clear yet.

The synchronization depends on the spec-

tral properties of networks, as the topology

is summarized by the Laplacian, a symmetric

matrix in general. The second eigenvalue of

this matrix may tend to 0, implying that a state

of synchronization cannot be reached by the

network. An influence comes also from the

highest of the eigenvalues. Besides determining

or not the property, no information is available

from these two eigenvalues on the relation-

ships with network topology, of interest here.

For instance, betweenness is defined as the

fraction of shortest paths between node pairs

that cross the given node, and indicates the

influence that the node exerts in terms of

information flow at network scale. It clearly

refers to synchronization capability. This latter

is also enhanced with high clustering in the

network, measuring the number of neighbors

of adjacent nodes, especially because the

higher is clustering value the closer are the

nodes. Quite intuitively, synchronization is

also enhanced by degree heterogeneity, as this

reduces the average distance, and it has also

been shown to be affected by the ratio be-

tween the highest and lowest degree nodes40.

5Therefore, an orbit is the set whose

nodes can be obtained from one another by

simple permutations in A(N), and a symmetric

network partitioning of nodes into orbits

establishes disjoint equivalence classes for

each node, i.e. an automorphism partition (AP) 37.

Of interest is the corresponding entropy,

a measure of the network structural

heterogeneity, such that for n=1,N:

SAP = - ∑n pn log pn

6Pn = |ni| / nN

72

There are cases in which the functional

similarity refers to symmetry. In such cases,

node similarity could not be due to the

community effect. A typical example is

provided by brain areas. Note that the role

of symmetries with reference to network

synchronization is investigated in41, while the

computational methods to break such

symmetries (isolated desynchronization)

are discussed in42. The construction of

functional networks depends on the relation-

ships between their coupling components,

which makes synchronization motifs central

features. Functional networks are in general

heterogeneous, thus non-symmetrical

structures. This reflects the fact that disruption

of the couplings generates symmetry-breaking

in the network, and also loosens the inherent

synchronization motifs distribution. However,

if the couplings are able to sustain synchroni-

zation, then symmetry will be characterizing

the functional network43.

2.4. Controllability: Relevance for Cancer

In general, networks evolve through a series

of transitions between states, but also by

changing structure. The typical hierarchical

structure presents clear directionality, while

in the symmetric structure more fluctuations

can be observed, inducing alternating

order-disorder transitions. In cancer, different

sets of control parameters may be present

to characterize the cancer type and to drive

the transition from normal to cancer state.

Using signaling as a reference, a human cancer

signaling network allowed the identification of

driver nodes and classification of their role in

the structural controllability of the network.

The underlying hypothesis is that nodes that

are critical for achieving centralized control

are therapeutically important. In particular,

anti-cancer drugs primarily act through such

genes that serve as their targets, thus revealing

utility for analysis of disease networks with

therapeutic implications.

Cancer forms a robust system that maintains

stable functioning (cell sustenance and

proliferation) despite perturbations. Cancer

progression occurs stage-wise over time

with increasing aggressiveness and worsening

prognosis. The characterization of these

stages and identification of the genes driving

the transitions are critical steps for an

understanding of cancer progression and the

development of effective anti-cancer therapies.

Valuable insights into cancer progression

was recently revealed by a few factors:

l The presence of interactions involved in core

cell-cycle and DNA-damage repair pathways

that are significantly rewired in tumors,

indicating significant impact to key

genome-stabilizing mechanisms;

l Several flipped genes that are serine/

threonine kinases which act as biological

switches, reflecting cellular switching

mechanisms between stages;

l Different sets of genes flipped during the

initial and final stages, indicating a

progressive pattern.

Based on these results, the robustness of

cancer can be derived from exchange

mechanism between genes at different stages

and from different biological processes and/

or cellular components. These are all involved

in progression stages and thereby allow

cancerous cells to evade targeted therapy,

therefore suggesting that a possible effective

therapy should target a “cover set” of these

genes. The control problem can in general be

concentrated on the search for a master

regulator of perturbations affecting target

bioentities. The principle of structural

controllability can thus be used to identify

a minimal set of driver nodes that exert

control the network states.

Since nonlinearity induces dynamics hard to

control, the attractor principle and associated

landscape analysis can be also be usefully

considered for differentiating between normal

and cancer states and optimize possible control

strategies44-47. Co-existence of multiple

attractors allows stepping to an attractor

network, or equivalently a coarse representa-

tion of the system phase space, would then be

possible by identifying all possible attractors

in the system, and once nodes are assigned to

them the gain for control purposes appears

from the fact that driving the network to a

desired state in a finite time would be more

efficiently done48. A control kernel would be

identified as the minimal set of nodes that

once regulated would drive the network from

any initial state to any target state49. Clearly

enough, such kernel inspires the possibility

of identifying its components with cellular

phenotypes, and possibly with drug targets,

and such kernel structure leverage the inherent

property of state coherency (distribution over

the attractors).

Because noise is always a possible inducer of

transitions between states and also network

components, a complementary aspects of

such developments refers to the role that

compensatory perturbations used to direct

the network to a desired state, by taking

advantage of its basin of attraction and thus

exerting a so-called network reprogramming50.

This concept refers to seminal work on survival

signaling in lymphocyte leukemia51,52 and on

cancer subtypes53.

73

PR

EC

ISIO

N T

OO

LS

74

2.5 Controllability: Working Definitions

Controllability, namely the ability of a

dynamical system to step between states, say

from an initial to a final state in finite time,

can be verified quite straightforwardly in

linear time-invariant systems by the so-called

Kalman’s rank condition54. In particular, given

a canonical system:

dz = Az + Bv

10 with state vector z, and A

and B as state and control parameter matrices,

respectively, the condition to obey is:

rank{B,AB,….., AP-1 B} = P

11Naturally enough, things

complicate when the system’s parameters are

unknown. Furthermore, a common strategy

to establish network controllability turns to

the identification of a minimum set of driver

nodes. This implies that controlling the set is

equivalent to controlling the entire system55-57.

Referring back to symmetry, it is of interest

to consider the impact of individual rather

than global dynamics, departing therefore

from the analysis of network topology and

instead considering the diversity of individual

dynamics. This condition is helpful particularly

with real-world networks, as it can be

expected that some assumptions, for instance

the independence of parameters, must be

relaxed. Notably, a global symmetry aspect

seems to emerge, one at which the highest

controllability or lowest number of key nodes

is achieved. Thus, the specific intrinsic

dynamics become less relevant for controlla-

bility purposes than the densities with which

they present themselves58.

This seems coherent with what anticipated

before, namely that the real networks

complexity may be reduced to a context

considering node-specific hidden variables

that once transformed, may reveal latent

symmetries. If at each network N is assigned

a probability P(N), then statistical ensembles

of graphs can be designed by considering fam-

ilies of such stochastic networks59. These will

be stochastically symmetric under a transfor-

mation if each member network has the same

property under the same transformation. An

entropy optimization problem is then present-

ed when searching for the maximum entropy

probability. This problem usually implicates the

existence of a topologically constrained net-

work, particularly when the ensemble network

functions as a null model.

2.6 Applications

A series of examples on cancer systems is

now presented. References are recent

studies in which metastatic cancers have

been analyzed through various methodologies,

and differentially expressed gene profiles have

been obtained.

Use Case 1: Metastatic Breast Cancer

We built networks in Figure 1 and in Figure

2 with data from the study in60 on 123 paired

primary and metastatic tissues from breast

cancer patients. Here, 47 genes (16 significantly

over-expressed and 31 significantly under-

expressed) were found differentially expressed

in metastatic ones. The whole protein-protein

interaction network (source STRING db,

confidence level = 0.7) is shown for the

over-expressed protein coding genes (counting

for 375 nodes and 476 edges) and for the

under-expressed protein coding genes

(counting for 772 nodes and 1113 edges). As

these two networks are big, their minimum

connected configuration is also computed (28

nodes and 52 edges for over-expressed protein

coding genes, and 53 nodes and 141 edges for

under-expressed protein coding genes).

The modules from the reduced networks are

also computed using the algorithm WalkTrap 61

(Label Propagation62 and InfoMap63

were applied too). The second type of

constraint we examine for these data, is gene

regulation according to transcription factors

(TF) (we use ENCODE Chip-seq data as the

source db). The whole network we found for

the overexpressed genes accounts for 217

nodes and 413 edges (reduced to 51 nodes

and 136 edges when the minimum connected

network is considered), and is visibly dense in

communities (each of the employed algorithms

find several modules, while in the reduced

one only WalkTrap finds 6 modules). With

the under-expressed genes the whole TF-gene

network accounts for 313 nodes and 1208

edges, while the reduced one accounts for 73

nodes and 354 edges, and here the WalkTrap

algorithm finds 4 modules.

The metastatic BC data state space has been

thus sequentially constrained, first by the

sign of the gene differential expression, then

by both the retrieved interactions between

DEGs and the TF regulation over the target

genes. Modularity is an additional topological

constraint applied to both the gene network

and the TF-gene network. Modularity remains

present also after reducing the original

network to a minimally connected one.

Therefore, this fraction of the metastatic

profile is highly structured and organized in

modules which imply specific functional

activity. This holds for both over-expressed

and under-expressed genes in the metastatic

BC profile, with a richer modularity map found

for the under-expressed fraction of the gene

profile, possibly induced by the bigger size

more than by inherent structure.

A claim in the original paper on BC subtyping

was that large absolute expression changes

at the RNA level between primary and

76

metastatic disease were not identified, and

indeed this fact weakens the constraints that

can be applied to the state space from gene

profiling. A certain pivotal role (top overex-

pressed gene) was assigned to FGFR4 or

Fibroblast Growth Factor Receptor 4

(well-known player in both ovarian and BC

tumorigenesis, and its overexpression in BC is

characteristic of the HER2-E intrinsic subtype),

and in Figure 1 it appears that it plays a role of

modular driver at whole network scale, but

less at reduced network scale (not in the blue

group). Therefore FGFR4 exerts an influence

but this is specific to a subtype, and therefore

explains why it is not present when the

minimally connected configuration is used.

Being the subtype a specific state of cells, a

lower entropy would be expected due to a

decreased number of degrees of freedom

in the system. Being metastatic both the

configurations we displayed, this local entropy

decrease could also represent a more general

signature.

Overall, the presence of constraints in a

network tends to naturally decrease the value

of its entropies. The topology of the network

establishes several types of constraints. It is

surely more complicated to interpret the

entropy changes in light of the context in

which they occur, say cancer. As a general rule,

higher entropies are associated with increased

network plasticity, as when a re-modulation

takes place, this might be observed from

DEG profiles or activated pathways or wired

modules. It was reported that over-expressed

gene profiles show reduction of entropy

compared to under-expressed gene profiles,

and according to the fluctuation theorem the

latter could indicate superior robustness, for

instance by adapting to the selective pressure

of the tumor microenvironment. Conversely,

lower entropies are typically induced by

constraints, and these limit the state activities

and therefore, at systems scale, the degrees of

freedom.

Figure 1. PIN from 16 metastatic over-expressed genes in BC. Top panel: Left, whole network; Right, modular map

of minimum connected sub-network. Bottom panel: left, whole TF-gene network, Right, modular map of minimum

connected sub-network.

Figure 2. PIN from 31 metastatic under-expressed genes in BC. Top panel: Left, whole network; Right, modular

map of minimum connected sub-network. Bottom panel: left, whole TF-gene network, Right, modular map of

minimum connected sub-network.

77

PR

EC

ISIO

N T

OO

LS

Use Case 2 – Metastatic Stomach Cancer

In a proposed study with 21 metastatic

samples and 11 non-metastatic samples of stom-

ach adenocarcinoma, 2979 (of which

1251 overexpressed, 1728 under-expressed,

and 2593 specific) DEGs were obtained in the

first group and 491 in the second group64. Other

386 DEGs appeared in both groups, therefore

showed similar expression patterns. We took

the top-50 DEGs in this group and built a

network from them (Figure 3, top panel). We

can see that a quite organized and modular

structure appears, even after suitable reduction

(42 nodes and 58 edges).

When we consider specific metastatic

sub-groups (top-ten overexpressed and

under-expressed DEGs), we see that the

small induced networks are not modular,

but centered on only two hubs, HSPD1

(over-expressed) and ESRRG (under-

expressed). Therefore, the topological

constraint of modularity is not due to

metastatic characteristics, and very likely a

substantial degree of entropy in the system

remains undirected to any specific phenotype.

The subtypes that are well-known and usually

investigated are mainly a few, say 4, thus a

relatively low number compared with other

cancers. HSPD1 and ESRRG are considered

promising metastasis biomarkers for this

cancer, the former explaining how the cancerous

cells escape from apoptosis, the latter is one of

the 5 gastric cancer signature genes

Use Case 3 – Melanoma state transitions

In a recently published paper on melanoma 65, the metastatic process is investigated by

the analysis of transcriptional reprogramming

occurring during transition from proliferative

to invasive states. Many genomic regulatory

regions were revealed for the melanoma states

from joint transcriptome and methylome

profiling, and master regulators were identified

in SOX10/MITF (SOX-10 is a marker for

melanocytic differentiation, and its dysfunction

Figure 3. PIN from 21 metastatic and 11 non-metastatic stomach adenocarcinoma genes. Top panel: whole (left)

and reduced networks induced by mixed gene group; Bottom panel: over-expressed (left) and under-expressed

hubsFigure 3. PIN from 21 metastatic and 11 non-metastatic stomach adenocarcinoma genes. Top panel: whole (left)

and reduced networks induced by mixed gene group; Bottom panel: over-expressed (left) and under-expressed hubs.

Figure 4. PIN from proliferative DEG signature in melanoma. Top: whole network. Bottom: reduced networks,

with top150 over-expressed values (left) and the rest (right).

GeneticProfiling

Disease ManagementProtocol

Patient Informatio

78

impairs MITF, the Microphthalmia transcription

factor expression, as well as melanocytic

development and survival), and AP-1/TEAD

(together, they critically regulate transcriptional

and functional mechanisms in cancer cells,

particularly invasion and resistance66).

We present in Figure 4 and in Figure 5 the

protein-protein interactions networks obtained

from two signatures, proliferative (over-

expressed) and invasive (under-expressed)

genes, together with their reductions when

only the top-150 over-expressed (proliferative

DEGs) and top-150 under-expressed (invasive

DEGs) are taken into consideration. It is

interesting to note that while the two global

signature networks are densely built around

key hubs (PDGFRB, ITGB1, EGFR, NFKBIA,

JUN in invasive, and CDK2, STAT5A, TP53,

MYC, BCL2, PTEN, MAP3K1, KDR, KIT in

proliferative), lots of communities appear to

be part of their structures (data not shown).

We thus considered the top-150 DEGs in

both signatures, and the rest of the DEGs,

and plotted their networks. In both signatures,

particularly in the proliferative one, the

contribution to modularity of the top DEGs

is strong compared to the rest of the DEGs,

indicating that the strong differentially

expressed values are driving the interaction

dynamics and establishing through modularity

a more constrained network configuration

with invasive and especially proliferative genes.

Since the transition between the two states,

from proliferative to invasive cells, is a sign of

disease progression and metastasis, it might be

justified to a certain extent that communities

appear characterized as shown.

Figure 5. PIN from invasive DEG signature in melanoma. Top: whole network. Bottom: reduced networks, with

top 150 down-expressed values (left) and the rest (right).

80

Further evidence is obtained when other sets

of DEGs are considered. For instance, following

TEAD knockdown experiments, we notice in

Figure 6 that the under-expressed DEGs are

densely bulky while the over-expressed DEGs

are sparsely modular, while the set of the TEAD

target DEGs with log2fc > |1| expression is highly

modular, indicating a strong influence on

modularity induced by TEAD through its target

genes, i.e. a subset of the previous two DEG

sets, and therefore a prevalent sparsification of

this feature (and targets neutralization as well)

after knockdown of the regulator.

Figure 6. PIN from TEAD knockdown experiments. Top-left under-expressed DEGs; Top-right: over-expressed

DEGs.. Bottom: TEAD target DEGs.

3. Discussion

Due to the importance of state transitions in

cancer networks, many dynamic properties

that are currently analyzed through entropy,

symmetry and controllability can become

interpretable from a biological and hopefully,

in the future, clinical standpoint. Among the

most important network states exerting

influences in such regards, the attractor states

are stable states and can be considered

differentiated cell types, thus also phenotypes

such a subtypes or tumor cell characteristics.

We know that cell state transitions have

associated patterns of gene expression, and

from such patterns we can investigate the

network configurations associated to them

A system likely spends most of the time within

the basins of attraction, and relatively little

time moving between them. This depends

on the fact that stationarity tends to recur

and non-stationarity transiently breaks it.

Underlying both states there are differenti-

ation processes whose stochastic dynamics

can be tracked by monitoring the inter-state

trajectories covering different regions of the

state-space. The activation of the states is

the element inducing the observed changes,

therefore the measurable trajectories. The

signaling activity patterns usually measured by

gene profiles and by the related pathways once

cast within network representations, allow

more dimensions and more constraints to be

investigated through the topological properties.

The control of a large dynamical complex

network may be a very hard task in biological

networks67-70. It might be intuitively

efficient to control the state of subsets of

nodes instead of single nodes, but identifying

functional sets as markers or targets is a

highly context-specific problem that requires

complex algorithms rather than simplified

solution paths. The identification of

symmetries in a complex system can be

very important in order to decipher its

organizational principles and rules. The key

question is thus to understand the role of

symmetries in reconstructing or controlling

network dynamics.

Naturally enough, it is important to compute

an optimal network decomposition into

observable/controllable and unobservable/

uncontrollable sub-networks likewise into

symmetry-driven versus non-symmetry-driven

sub-networks to study how they synchronize

or desynchronize. The problems remain

much harder in nonlinear networks with

symmetries, due to the fact that observability

and controllability present more complicate

dependence relationships.

References

1. Lavi, O.. Redundancy: a critical obstacle to improving cancer therapy. Cancer Res., 2015; 75(5), 808-12. doi: 10.1158/0008-5472.CAN-14-3256.• Cereda, M, Mourikis, T.P, Ciccarelli, F.D. Genetic Redundancy, Functional Compensation, and Cancer Vulnerability Trends in Cancer, 2016, 2(4), 160–2. • Altrock, P.M., Liu, L.L Michor, F. The mathematics of cancer: integrating quantitative models. Nature Rev. Cancer, 2015, 15(12), 730-45. doi: 10.1038/nrc4029.• Deming, D.A., Leystra, A.A., Nettekoven, L., Sievers, C., Miller, D., Middlebrooks, M., Clipson, L., Albrecht, D., Bacher, J., Washington, M.K., et al. PIK3CA and APC mutations are synergistic in the development of intestinal cancers. Oncogene, 2014, 33(17), 2245-54. doi: 10.1038/onc.2013.167..• Gerlee, P., Altrock, P.M..Complexity and stability in growing cancer cell populations. Proc. Natl. Acad. Sci. U S A., 2015, 112(21), :E2742-3. doi: 10.1073/pnas.1505115112.• Chowdhury, S.A., Nibbe, R.K., Chance, M.R., Koyutürk, M. Subnetwork state functions define dysregulated subnetworks in cancer. J. Comput. Biol., 2011;18(3), 263-81. doi: 10.1089/cmb.2010.0269.• Chowdhury, S.A., Koyutürk, M.. Identification of coordi-nately dysregulated subnetworks in complex phenotypes. Pac. Symp. Biocomput. 2010,133-44.• Vineis, P., Chadeau-Hyam, M., Gmuender, H., Gulliver, J., Herceg, Z., Kleinjans, J., Kogevinas, M., Kyrtopoulos, S., Nieuwenhuijsen, M., Phillips, D.H., et al. EXPOsOMICS Consortium. The exposome in practice: Design of the EX-POsOMICS project. Int. J. Hyg. Environ. Health. 2017; 220(2 Pt A),142-151. doi: 10.1016/j.ijheh.2016.08.001. • Galas, D.J., Nykter, M., Carter, G.W., Price, N.D., Shmulevich, I. Biological information as set-based complexity. IEEE Trans. Inform. Th. 2010, 56(2), 667-77. • Galas, D.J., Sakhanenko, N.A., Skupin, A, Ignac, T. Describing the Complexity of Systems: Multivariable Set Complexity and the Information Basis of Systems Biology. J. Comput. Biol.. 2014 , 21(2), 118–140. doi: 10.1089/cmb.2013.0039.• Ignac, T.M. Sakhanenko, N.A., Galas, D.J. Relations between the set-complexity and the structure of graphs and their sub-graphs. EURASIP J. Bioinform. Syst. Biol., 2012, 2012(1), 13. doi: 10.1186/1687-4153-2012-13.• Krawitz, P., Shmulevich, H. Basin entropy in boleean network ensembles. Phys Rev Lett 2007; 98: 158701. • Daza, A., Wagemakers, A., Georgeot, B., Guéry-Odelin, D., Sanjuán, M.A.. Basin entropy: a new tool to analyze uncertainty in dynamical systems. Sci Rep. 2016, 6, 31416. doi: 10.1038/srep31416. • Ahnert, S.E., Fink, T.M.A., Form and function in gene regulatory networks: the structure of network motifs determines fundamental properties of their dynamical state space. J. R. Soc. Interface 2016, 13, 20160179; DOI: 10.1098/rsif.2016.0179. • Sevick, E.M., Prabhakar, R., Williams, S.R., Searles, D.J. Fluctuation Theorems. Annual Rev. Phys. Chem. 2008, 59(1), 603-633. • Teschendorff AE, Severini S. Increased entropy of signal transduction in the cancer metastasis phenotype. BMC Syst. Biol. 2010; 4:104. https://doi.org/10.1186/1752-0509-4-104. • West J, Bianconi G, Severini S, Teschendorff AE. Differential network entropy reveals cancer system hallmarks. Sci Rep. 2012; 2:802. doi: 10.1038/srep00802. • Robinson, D.R., Wu, Y.M., Lonigro, R.J., Vats, P., Cobain, E., Everett, J., Cao, X., Rabban, E., Kumar-Sinha, C, Ray-mond, V. et al. Integrative clinical genomics of metastatic cancer.. Nature. 2017, 548(7667),:297-303. doi: 10.1038/nature23306.• Pazarentzos, E., Bivona, T.G. Adaptive stress signaling in targeted cancer therapy resistance. Oncogene. 2015,34(45), 5599-606. doi: 10.1038/onc.2015.26. • AlQuraishi, M., Koytiger, G. Jenney, A. MacBeath, G., Sorger, P.K. A multiscale statistical mechanical frame-work integrates biophysical and genomic data to assemble cancer networks. Nat Genet., 2014, 46(12), 1363-1371. doi: 10.1038/ng.3138. • Gupta, P.B., Fillmore, C.M., Jiang, G., Shapira, S.D., Tao, K., Kuperwasser, C., Lander, E.S. Stochastic state transi-tions give rise to phenotypic equilibrium in populations of cancer cells. Cell. 2011; 146(4), 633-44. doi: 10.1016/j.cell.2011.07.026.

Buder, T., Deutsch, A., Seifert, M., Voss-Böhme, A. Cell-Trans: An R Package to Quantify Stochastic Cell State Transitions. Bioinform Biol Insights. 2017, 11, 1177932217712241. doi: 10.1177/1177932217712241.• Kwang‐Hyun, C., Jae Il, J., Dongkwan, S., Dongsan, K., Sang‐Min, P. The reverse control of irreversible biological processes. WIREs Syst Biol Med 2016, 8, 366-377. doi: 10.1002/wsbm.1346.• Banerji, C.R., Miranda-Saavedra, D., Severini, S., Wid-schwendter, M., Enver, T., Zhou, J.X., Teschendorff, A.E..Cellular network entropy as the energy potential in Waddington’s differentiation landscape. Sci Rep. 2013; 3, 3039. doi: 10.1038/srep03039.• Davies, P.C., Demetrius, L., Tuszynski, J.A. Cancer as a dynamical phase transition. Theor Biol Med Model. 2011; 8: 30. doi: 10.1186/1742-4682-8-30.• Shahriyari, L., Komarova, N.L. Symmetric vs. Asymmetric Stem Cell Divisions: An Adaptation against Cancer? PLOS One, 2013, 8(10). https://doi.org/10.1371/jour-nal.pone.0076195. • Bajaj, J., Zimdahl, B., Reya, T. Fearful symmetry: subversion of asymmetric division in cancer development and progression. Cancer Res. 2015; 75(5): 792-7. doi: 10.1158/0008-5472.CAN-14-2750. • Morrison SJ, Kimble J. Asymmetric and symmetric stem-cell divisions in development and cancer. Na-ture 2006; 441:1068-74; http://dx.doi.org/10.1038/na-ture04956• Meacham CE, Morrison SJ. Tumour heterogeneity and cancer cell plasticity. Nature 2013; 501:328-37; http://dx.doi.org/10.1038/nature12624• Izumi, H., Kaneko, Y. Symmetry breaking in human neuroblastoma cells. Mol Cell Oncol. 2014; 1(4): e968510.. doi: 10.4161/23723548.2014.968510• Pillai, A.S., Jirsa, V.K. Symmetry Breaking in Space-Time Hierarchies shapes Dynamics and Behavior. Neuron. 2017; 94(5):1010-1026. doi: 10.1016/j.neuron.2017.05.013.• Aufderheide, H, Rudolf, L, Gross, T. Mesoscale symme-tries explain dynamical equivalence of food webs. New J. Phys., 2012, 14, 105014.• Xiao, Y., MacArthur, B.D., Wang, .H, Xiong, M., Wang, W.. Network quotients: structural skeletons of complex systems. Phys. Rev. E, 2008, 78, 046102.• MacArthur, B.D., Sanchez-Garcia, R.J. Spectral characteris-tics of network redundancy. Phys. Rev. E, 2009, 80, 026117.• Wang, H., Yan, G., Xiao, Y. Symmetry in world trade network. J. Syst. Sci. Complex. 2009, 22, 280-90.• Xiao, Y.H., Wu, W.T., Wang, H., Xiong, M., Wang, W. Symmetry-based stricture entropy pof complex networks. Physica A, 2008, 387, 2611-19. • Garrido, A. Symmetry in Complex Networks. Symmetry 2011, 3, 1-15.• Garlaschelli, D., Ruzzenenti, F., Basosi, R. Complex networks and symmetry I: a review. Symmetry 2010, 2(3), 1683-1709; doi:10.3390/sym2031683.• Comellas, F., Gago, S. Synchronizability of complex networks. J Phys. A: Math. Theor. 2007, 40, 4483–4492 doi:10.1088/1751-8113/40/17/006.• Motter, A.E., Zhou, C., Kurths, J. Network synchroniza-tion, diffusion, and the paradox of heterogeneity. Phys. Rev. E 2005, 71(1 Pt 2): 016116. • Nicosia, V., Valencia, M., Chavez, M., Diaz-Guilera, A., Latora, V. Remote synchronization reveals network symmetries and functional modules. Phys. Rev. Lett. 2013, 110, 174102. Pecora, L.M., Sorrentino, F., Hagerstrom, A.M., Murphy, T.E., Roy, R. Cluster Synchronization and isolated desyn-chronization in complex networks with symmetries. Nat. Comm. 2014, 5, 4079. doi: 10.1038/ncomms5079. • Malagarriga, D., Villa, A.E.P., Garcia-Ojalvo, J., Pons, A.J. Consistency of heterogeneous synchronization patterns in complex weighted networks. Chaos, 2017, 27, 031102.• Cho, S.H., Park, S.M., Lee, H.S., Lee, H.Y., Cho, K.H. Attractor landscape analysis of colorectal tumorigenesis and its reversion. BMC Syst. Biol. 2016, 10(1), 96.• Taherian Fard, A., Ragan, M.A. Modeling the Attractor Landscape of Disease Progression: a Network-Based Approach. Front. Genet. 2017, 8, 48. doi: 10.3389/fgene.2017.00048.

4. Materials and Methods

The datasets used for the application

section and the described examples have been

retrieved from published sources reported

in each use case. Standard network software

(http://igraph.org/, http://www.cytoscape.

org/) was used to build the graphical evidences

from annotation sources such as

https://string-db.org.

Conflicts of Interest. The author declares

no conflicts of interest.

81

PR

EC

ISIO

N T

OO

LS

Choi, M., Shi, J., Jung, S.H., Chen, X., Cho, K.H. Attrac-tor landscape analysis reveals feedback loops in the p53 network that control the cellular response to DNA damage.Sci. Signal. 2012, 5(251), ra83. doi: 10.1126/scisignal.2003363.• Wang, P, Song, C., Zhang, H., Wu, Z., Tian, X.J., Xing, J. Epigenetic state network approach for describing cell phenotypic transitions. Interface Focus. 2014, 4(3), 20130068. doi: 10.1098/rsfs.2013.0068.• Wang, L.Z., Su, R.Q., Huang, Z.G., Wang, X., Wang, W.X., Grebogi, C., Lai, Y.C. A geometrical approach to control and controllability of nonlinear dynamical networks. Nat Commun. 2016, 7, 11323. doi: 10.1038/ncomms11323.• Kim, J. Park, S-M., Cho, K-H. Discovery of a kernel for controlling biomolecular regulatory networks. Sci. Rep. 2013; 3: 2223. doi: 10.1038/srep02223.• Cornelius, S.P., Kath,W.L. Motter, A.E. Realistic Control of Network Dynamics. Nat Commun. 2013; 4: 1942. doi: 10.1038/ncomms2939.• Zhang, R. Vinod Shah, M. Yang,J. Nyland,S.B. Liu,X. Yun, J.K., Albert, R. Loughran, T.P. Network model of survival signaling in large granular lymphocyte leukemia. Proc Natl Acad Sci U S A. 2008; 105(42): 16308–16313. doi: 10.1073/pnas.0806447105.• Saadatpour, A., Wang, R.S., Liao, A, Liu, X, Loughran, T.P., Albert, I, Albert, R.Dynamical and structural analysis of a T cell survival network identifies novel candidate therapeutic targets for large granular lymphocyte leukemia. PLoS Comput Biol. 2011; 7(11):e1002267. doi: 10.1371/journal.pcbi.1002267. • Maetschke, S.R., Ragan, M.A. Characterizing cancer subtypes as attractors of Hopfield networks. Bioinformat-ics. 2014, 30(9), 1273-9. doi: 10.1093/bioinformatics/btt773. • Kalman, R.E. Mathematical description of linear dynami-cal systems. J. Soc Indus. Appl Math Ser A 1963, 1, 152-192.• Luenberger, D.G. Introduction to Dynamic Systems: Theory, Models and Applications. J. Wiley & S., NY 1979.• Liu, Y.Y, Slotine, J.J., Barabasi, A.L. Controllability of complex networks. Nature 2011, 473, 167-173. • Whalen, A.J., Brennan, S.N., Sauer, T.D., Schift, S.J. Observability and controllability of nonlinear networks: the role of symmetries. Phys Rev X, 2015, 5, 011005. • Zhao, C., Wang, W.X., Liu, Y.Y., Slotine JJ. Intrinsic dynamics induce global symmetry in network controllability. Sci. Rep. 2015, 5, 8422. • Marras, E., Travaglione, A., Capobianco, E. Sub-modular resolution analysis by network mixture models. Stat. Appl. Genet. Mol. Biol. 2010, 9, Art19 (2010) doi: 10.2202/1544-6115.1523. • Cejalvo, J.M., Martínez de Dueñas, E., Galván, P., García-Recio, S., Burgués Gasión, O., Paré, L., Antolín, S., Martinello, R., Blancas, I., Adamo, B., et al. Intrinsic Subtypes and Gene Expression Profiles in Primary and Metastatic Breast Cancer. Cancer Res. 2017, 77(9), 2213-2221. doi: 10.1158/0008-5472.CAN-16-2717. • Pons, P. Latapy, M. Computing communities in large networks using random walks. ISCIS’05 Proc. of 20th Int Conf Comp Inf Sci, 284-93. • Raghavan, U.N. Albert, R. Kumara, S. Near linear time algorithm to detect community structures in large-scale networks, Phys. Rev. E 2007, 76, 036106.• Rosvall, M., Bergstrom, C.T. Maps of random walks on complex networks reveal community structure. Proc. Nat. Ac. Sci. U.S.A. 2008, 105(4), 1118–1123. doi: 10.1073/pnas.0706851105.• Liang, L., Ye, L., Cui, D., Yang, D. Gene expression signature associated with metastasis of stomach adenocar-cinoma. Int. J. Clin. Exp. Med. 2017, 10(2), 3016-26. • Verfaillie,A., Imrichova, H., Atak,Z.K., Dewaele,M., Rambow,F. Hulselmans, G., Christiaens,V. Svetlichnyy, D. Luciani, F. Van den Mooter, L. et al. Decoding the regulatory landscape of melanoma reveals TEADS as regulators of the invasive cell state. Nat. Commun. 2015, 6, 6683. doi:10.1038/ncomms7683• Liu,X., Li,H. Rajurkar,M. Li, Q.Cotton, J.L, Ou,J. Zhu, L.J., Goel, H.L. Mercurio, A.M., Park J-S., Davis,R.J., Mao, J. Tead and AP1 coordinate transcription and motility. Cell Rep. 2016, 14(5), 1169–1180.doi:10.1016/j.celrep.2015.12.104

• Asgari, Y,, Salehzadeh-Yazdi, A., Schreiber, F., Masou-di-Nejad, A. Controllability in Cancer Metabolic Networks According to Drug Targets as Driver Nodes.. PLoS One. 2013, 8(11), e79397. doi: 10.1371/journal.pone.0079397.• Srihari, S., Raman, V., Leong, H.W., Ragan, M.A. Evolution and Controllability of Cancer Networks: A Boolean Perspective. IEEE/ACM Trans Comput Biol Bioinform. 2014;11(1), 83-94. doi: 10.1109/TCBB.2013.128.• Vinayagam, A., Gibson, T.E., Lee, H.J., Yilmazel, B., Roesel, C., Hu, Y., Kwon, Y., Sharma. A., Liu, Y.Y., Perrimon, N., Barabási, A.L. Controllability analysis of the directed human protein interaction network identifies disease genes and drug targets. Proc. Natl. Acad. Sci. U S A. 2016, 113(18), 4976-81. doi: 10.1073/pnas.1603992113. • Sharma, A., Cinti, C., Capobianco, E.. Multitype Network-Guided Target Controllability in Phenotypically Characterized Osteosarcoma: Role of Tumor Microen-vironment. Front. Immunol., 2017; 8, 918. DOI=10.3389/fimmu.2017.00918.

Enrico Capobianco, core expertise is in quantitative

methods and computational modeling for conducting

inference on complex data.

In particular, network science and stochastic methods

from statistics and machine learning have been

instrumental to his wide spectrum work in biomedicine.

While participating in worldwide academic and scientific

activities at several international institutes, Enrico

collaborates with major research agencies, and organizes

scientific programs especially designed to advance

Systems and Precision Medicine, and Big Data in Health.

Enrico holds a Doctorate in Statistical Sciences from the

University of Padua (IT) and after conducting postdoc

research in computational fields at Stanford University

(1994-98), he received in 1999 a NATO-CNR grant in

Denmark at the Niels Bohr Institute and at the Danish

Technical University, before becoming in 2001-02 an

ERCIM (European Research Consortium for Informatics

and Mathematics) fellow at the Center for Mathematics

and Computer Science in Amsterdam, the Netherlands.

After returning to the USA, he worked as a Staff Scientist

at the Mathematical Sciences Research Institute in

Berkeley (2003), and as a Senior Scientist at Boston

University, Biomedical Engineering (2004-05). In 2005,

Enrico was appointed Head of Methods at Serono,

Evry, France, and in 2006 he joined the Center for

Advanced Studies, Research and Development in

Sardinia at the Polaris Science and Technology Park,

leading a quantitative systems biology group till 2011.

Enrico received a Professorship from the Chinese

Academy of Sciences, Shanghai University, in 2011,

was a Visiting Professor at the Fiocruz Foundation in

Brazil (2008-10), and was Visiting Scientist at the

Institut des Hautes Études Scientifiques in France (2011).

Enrico leads since 2011 the Computational Biology and

Bioinformatics program at the Center of Computational

Science, University of Miami, FL (USA). He has been

associate scientist with the National Research Institute

in Italy, at the Institute of Clinical Physiology in Pisa,

where he founded LISM, the Laboratory of Integrative

Systems Medicine (2012–15) – and where he has

coordinated Big Data in Health activities, in Milan,

Siena and Pisa (2015-17).

82

cancer metastatic processes contextualized by networks ...€¦ · cancer metastatic processes...

Documents