testing models of mrna localization reveals robustness ... › content › 10.1101 ›...

14
Biophysical Journal Volume: 00 Month Year 1–14 1 Testing models of mRNA localization reveals robustness regulated by reducing transport between cells J. U. Harrison, R. M. Parton, I. Davis and R. E. Baker Abstract Robust control of gene expression in both space and time is of central importance in the regulation of cellular processes, and for multicellular development. However, the mechanisms by which robustness is achieved are generally not identified or well understood. For example, mRNA localization by molecular-motor-driven transport is crucial for cell polarization in numerous contexts, but the regulatory mechanisms that enable this process to take place in the face of noise or significant perturbations are not fully understood. Here we use a combined experimental-theoretical approach to characterize the robust- ness of gurken/TGF-alpha mRNA localization in Drosophila egg chambers, where the oocyte and 15 surrounding nurse cells are connected in a stereotypic network via intracellular bridges known as ring canals. We construct a mathematical model that encodes simplified descriptions of the range of steps involved in mRNA localization, including production and trans- port between and within cells until the final destination in the oocyte. Using Bayesian inference, we calibrate this model using quantitative single molecule fluorescence in situ hybridization data. By analyzing both the steady state and dynamic behaviours of the model, we provide estimates for the rates of different steps of the localization process, as well as the extent of directional bias in transport through the ring canals. The model predicts that mRNA synthesis and transport must be tightly balanced to maintain robustness, a prediction which we tested experimentally using an over-expression mutant. Surprisingly, the over-expression mutant fails to display the anticipated degree of overaccumulation of mRNA in the oocyte predicted by the model. Through careful model-based analysis of quantitative data from the over-expression mutant we show evidence of saturation of transport of mRNA through ring canals. We conclude that this saturation engenders robustness of the localization process, in the face of significant variation in the levels of mRNA synthesis. Statement of significance 1 For development to function correctly and reliably across a 2 population, gene expression must be controlled robustly in 3 a repeatable manner. How this robustness is achieved is not 4 well understood. We use modelling to better study the local- 5 ization of polarity determining transcripts (RNA) in fruit 6 fly development. By calibrating our model with quantitative 7 imaging data we are able to make experimentally testable 8 predictions. Comparison of these predictions with data from 9 a genetic mutant reveals evidence that saturation of RNA 10 transport contributes to the robustness of RNA localization. 11 1 Introduction 12 Biological systems are constantly subjected to noise from 13 both internal and external sources, and also to random muta- 14 tions. It is crucial that key driving mechanisms are able to 15 buffer against such insults, thereby enabling organisms to 16 display phenotypes that are robust to perturbation. Local- 17 ization of mRNA is a fundamental, and crucially important, 18 mechanism that enables the polarization of cells such as 19 oocytes and early embryos [1–3]. For example, the axes of 1 Drosophila are established through the regulation of gradi- 2 ents in bicoid, gurken, oskar and nanos mRNA [4]. In fact, 3 Drosophila and other species rely heavily on the asymmetric 4 localization of mRNAs to coordinate early development pro- 5 cesses both spatially and temporally. Further, mRNA local- 6 ization has been observed in a variety of species and cell 7 types, including Drosophila and Xenopus oocytes, neurons, 8 chicken fibroblasts, yeast and bacteria [5–9], demonstrating 9 that the process is ubiquitous and not limited to large cells. 10 However, despite the widespread nature of mRNA localiza- 11 tion, there is still much that we do not understand, in partic- 12 ular what ensures robustness. To this end, here we combine 13 mathematical modelling with quantitative experimental data 14 to investigate the regulatory mechanisms controlling mRNA 15 localization, capturing, in particular, the roles of transport 16 and production. 17 We concentrate on early development of the model 18 organism Drosophila, where maternal mRNA prepatterns 19 the oocyte [1–3]. Maternal mRNA is produced in the 15 20 nurse cells neighbouring the oocyte [10]; it is then packaged 21 into complexes with various proteins [11] and transported 22 © 2019 The Authors . CC-BY 4.0 International license available under a not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprint (which was this version posted May 29, 2019. ; https://doi.org/10.1101/533133 doi: bioRxiv preprint

Upload: others

Post on 26-Jan-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

  • Biophysical Journal Volume: 00 Month Year 1–14 1

    Testing models of mRNA localization reveals robustness regulated byreducing transport between cells

    J. U. Harrison, R. M. Parton, I. Davis and R. E. Baker

    Abstract

    Robust control of gene expression in both space and time is of central importance in the regulation of cellular processes,and for multicellular development. However, the mechanisms by which robustness is achieved are generally not identifiedor well understood. For example, mRNA localization by molecular-motor-driven transport is crucial for cell polarization innumerous contexts, but the regulatory mechanisms that enable this process to take place in the face of noise or significantperturbations are not fully understood. Here we use a combined experimental-theoretical approach to characterize the robust-ness of gurken/TGF-alpha mRNA localization in Drosophila egg chambers, where the oocyte and 15 surrounding nurse cellsare connected in a stereotypic network via intracellular bridges known as ring canals. We construct a mathematical modelthat encodes simplified descriptions of the range of steps involved in mRNA localization, including production and trans-port between and within cells until the final destination in the oocyte. Using Bayesian inference, we calibrate this modelusing quantitative single molecule fluorescence in situ hybridization data. By analyzing both the steady state and dynamicbehaviours of the model, we provide estimates for the rates of different steps of the localization process, as well as the extentof directional bias in transport through the ring canals. The model predicts that mRNA synthesis and transport must be tightlybalanced to maintain robustness, a prediction which we tested experimentally using an over-expression mutant. Surprisingly,the over-expression mutant fails to display the anticipated degree of overaccumulation of mRNA in the oocyte predicted bythe model. Through careful model-based analysis of quantitative data from the over-expression mutant we show evidence ofsaturation of transport of mRNA through ring canals. We conclude that this saturation engenders robustness of the localizationprocess, in the face of significant variation in the levels of mRNA synthesis.

    Statement of significance1

    For development to function correctly and reliably across a2population, gene expression must be controlled robustly in3a repeatable manner. How this robustness is achieved is not4well understood. We use modelling to better study the local-5ization of polarity determining transcripts (RNA) in fruit6fly development. By calibrating our model with quantitative7imaging data we are able to make experimentally testable8predictions. Comparison of these predictions with data from9a genetic mutant reveals evidence that saturation of RNA10transport contributes to the robustness of RNA localization.11

    1 Introduction12

    Biological systems are constantly subjected to noise from13both internal and external sources, and also to random muta-14tions. It is crucial that key driving mechanisms are able to15buffer against such insults, thereby enabling organisms to16display phenotypes that are robust to perturbation. Local-17ization of mRNA is a fundamental, and crucially important,18mechanism that enables the polarization of cells such as19

    oocytes and early embryos [1–3]. For example, the axes of 1Drosophila are established through the regulation of gradi- 2ents in bicoid, gurken, oskar and nanos mRNA [4]. In fact, 3Drosophila and other species rely heavily on the asymmetric 4localization of mRNAs to coordinate early development pro- 5cesses both spatially and temporally. Further, mRNA local- 6ization has been observed in a variety of species and cell 7types, including Drosophila and Xenopus oocytes, neurons, 8chicken fibroblasts, yeast and bacteria [5–9], demonstrating 9that the process is ubiquitous and not limited to large cells. 10However, despite the widespread nature of mRNA localiza- 11tion, there is still much that we do not understand, in partic- 12ular what ensures robustness. To this end, here we combine 13mathematical modelling with quantitative experimental data 14to investigate the regulatory mechanisms controlling mRNA 15localization, capturing, in particular, the roles of transport 16and production. 17

    We concentrate on early development of the model 18organism Drosophila, where maternal mRNA prepatterns 19the oocyte [1–3]. Maternal mRNA is produced in the 15 20nurse cells neighbouring the oocyte [10]; it is then packaged 21into complexes with various proteins [11] and transported 22

    © 2019 The Authors

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 2 1 INTRODUCTION

    through intracellular bridges, known as ring canals, into the1oocyte [12]. Ring canals are formed by incomplete cell divi-2sions [13], and result in the 15 nurse cells and the oocyte3being connected in a characteristic pattern [14]. A schematic4of the early Drosophila egg chamber is shown in Figure 1,5together with a microscopy image showing the ring canals6(highlighted with an actin (phalloidin Alexa488) marker),7and a diagram illustrating the characteristic connections8between nurse cells.9

    (a)

    (b) (c)

    14 μm

    Phalloidin Alexa488 (actin)grk smFISH

    Figure 1: Stage 6 Drosophila egg chamber. (a) A microscopyimage of a fixed egg chamber labelled in green withpahlloidin-alexa488 (actin) and in magenta with grk smFISHprobes (stellaris, Cal Fluor 590). Localization of gurkenRNA is observed at the posterior of the oocyte. Ring canalsproviding connections between nurse cells and to the oocytecan be seen highlighted by the actin marker. (b) Schematicof the early egg chamber. (c) Diagram of the connectionsbetween nurse cells and the ooctye via ring canals. Theoocyte (red) is labeled 1 and the nurse cells are labeled 2to 16.

    In this work, we develop a coarse-grained ordinary dif-10ferential equation (ODE) model of gurken (gurken) transport11through the Drosophila egg chamber, including all 15 nurse12cells and the developing oocyte, with terms representing both13

    the production and transport of mRNA 1. Using Bayesian 1inference methods, we fit this simple model to quantitative 2imaging data obtained using smFISH (single molecule flu- 3orescence in situ hybridization) [15, 16]. Combining quan- 4titative modelling approaches with experimental data in this 5way allows us to explore underlying biological mechanisms 6through the generation of testable model predictions with 7appropriate quantification of uncertainties. In particular, it 8enables estimation of the mRNA production and transport 9rates in the model, and predicts that there is a tight reg- 10ulatory balance between production and transport for the 11localization of gurken in the Drosophila egg chamber. 12

    In the process of fitting the model to microscopy imag- 13ing data, we examine the formation of higher-order RNA- 14protein complexes in the oocyte. In the nurse cells, mRNA 15is assembled into complexes containing both mRNA and 16various proteins, and these RNA complexes are then trans- 17ported into the ooctye. An important aspect of the trans- 18port process is that the RNA complexes are remodelled (so 19that more mRNA transcripts are contained in each complex) 20upon transit through the ring canals that connect the nurse 21cells to the oocyte [12, 17], leading to larger complexes in 22the oocyte [18]. We use quantitative data analysis to provide 23an estimate of the extent of this assembly of higher-order 24complexes for gurken. 25

    In addition, we explore the question of whether trans- 26port of RNA complexes occurs unidirectionally or bidirec- 27tionally through ring canals in the Drosophila egg chamber. 28Some evidence that transport through ring canals is unidi- 29rectional has previously been provided [17]. However, since 30ring canals are small relative to the nurse cells, the passage 31of complexes through them is relatively difficult to observe 32in vivo. Our study offers further evidence in support of 33the hypothesis that transport through ring canals is strongly 34biased towards the oocyte, and provides quantification of this 35process within a model-based framework. 36

    Finally, we use the coarse-grained ODE model to 37make testable predictions about the behaviour of an over- 38expression mutant with increased production of mRNA. We 39demonstrate strong agreement between the model predic- 40tions and observed data for nurse cells close to the oocyte, 41but find discrepancies for nurse cells far from the oocyte. 42Surprisingly, we find the numbers of complexes localized 43in the oocyte of the over-expression mutant are very simi- 44lar to wild type, whereas the model predicts numbers should 45increase significantly. To probe the reasons for this dis- 46parity, we consider a suite of extended models incorporat- 47ing inhomogeneous production, density dependent transport 48and crowding-induced blocking of transport through ring 49canals. We show, via statistical techniques that allow quan- 50titative model comparison, that the crowding-induced block- 51ing mechanism is best supported by the data, and that a 52

    1In referring to production, we include mRNA transcription, exportthrough the nuclear pore complex, and assembly of RNA-protein com-plexes.

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 2.2 Single molecule FISH 3

    model incorporating blocking is capable of producing distri-1butions of RNA complexes similar to those observed experi-2mentally. In strong support of this model-based prediction,3we observe accumulation of complexes at ring canals in4smFISH microscopy data for the over-expression mutant,5consistent with this crowding-induced blocking mechanism.6

    Despite their increasing use across the biologi-7cal sciences, mathematical and computational modelling8approaches have not to-date been widely adopted in the study9of mRNA localization. Trong et al. [19] have demonstrated10how localization of mRNA can be achieved in a Drosophila11oocyte using a partial differential equation (PDE) description12of mRNA dynamics and a stochastic model of cytoskele-13ton dynamics. Ciocanel et al. [20] used a PDE model of14RNA localization in Xenopus oocytes to quantify aspects15of the transport process using FRAP (fluoresence recov-16ery after photobleaching) data, and others have considered17models of RNA transport to examine the behaviour of indi-18vidual molecular motors [21, 22], or the cytoskeleton struc-19ture [23]. However, these works have focused on dynam-20ics within a single cell, the oocyte, as has other work in21model systems such as mating budding yeast and the neu-22ronal growth cone [24, 25]. Recent work by Alsous et al. [26]23has quantified collective growth and size control dynam-24ics in Drosophila nurse cells. Similarly, our work considers25dynamic behaviours in multiple cells. However, our focus is26to examine the regulation and robustness of mRNA local-27ization by considering a coarse-grained ODE model of the28entire Drosophila egg chamber, and using it to interro-29gate quantitative experimental data. Further, our work high-30lights that by combining an incredibly simple mathematical31model with quantitative experimental data one can draw rel-32evant conclusions and make testable predictions about key33biological processes that would not, otherwise, have been34possible.35

    2 Methods and materials36

    2.1 Fly strains and tissue preparation37

    Stocks were raised on standard cornmeal-agar medium38at 25◦C. The wild type was Oregon R (OrR). Over39expression of gurken RNA was obtained by crossing40UASp grk3A (based upon genomic sequence DS02110,41which includes the full 3’ and 5’ UTRs [27]) to mater-42nal tubulin driver TubulinGal4-VP16. Tagging of gurken43RNA for live imaging was achieved using the MS2-44MCP(FP) system: gurken-(MS2)12 [28], P[nos-NLS-MCP-45mCherry] [29]; P[w+nanos-5’-NLS-MCP-Dendra2 K10-3’]46(Rippei Hayashi). GFP protein trap lines: Moesin::GFP [30];47Tau-GFP 65/167 (D. St Johnston, University of Cambridge).48Oocytes were isolated for live imaging as described in Parton49et al. [31], and for fixation and single molecule fluorescence50in situ hybridization (smFISH), as described in Davidson51et al. [32].52

    2.2 Single molecule FISH 1

    smFISH detection of gurken RNA was performed as 2described in Davidson et al. [32]. Stellaris Olignoucleotide 3probes 20nt in length complementary to the gurken tran- 4script (CG17610; 48 probes) conjugated to CAL Fluor Red 5590 were obtained from Biosearch technologies. Labelled 6oocytes were mounted on slides in 70% Vectashield (Vector 7laboratories). 8

    2.3 Fixed-cell imaging 9

    Confocal imaging of fixed Drosophila oocytes was per- 10formed using an inverted Olympus FV3000 six laser line 11spectral confocal system fitted with high sensitivity gallium 12arsenide phosphide (GaAsP detectors) and using a 60x SI 131.3 NA lens. For smFISH detection of gurken transcripts, 14settings were optimized with a pinhole of 1.2 airy units 15and increasing laser power over the depth of the sample 16(between 30-50µm from the cover slip) to compensate for 17signal attenuation. 18

    2.4 Image analysis 19

    Basic image handling and processing was carried out in FIJI 20(ImageJ V1.51d; fiji.sc, [33]). Image data was archived in 21OMERO (V5.3.5) [34]; image conversions were carried out 22using the BioFormats plugin in FIJI [35]. RNA particles in 23the nurse cells and oocyte are identified and located using the 24FISH QUANT software [36]. We combine use of this soft- 25ware with manual segmentation of the nurse cells and oocyte 26performed in three dimensions using a GUI (graphical user 27interface) designed for the annotation of training and evalu- 28ation data for machine learning via software available from 29QBrain [37]. This enables quantification of the total num- 30bers of complexes observed in each cell. Identification of 31cells within the egg chamber is performed by counting the 32number of ring canals connecting each cell, and which cells 33these connect. Data is processed from n = 16 egg cham- 34bers between stage 5 and stage 8 of oogenesis. Since the 35RNA complexes are sparse in the nurse cells, using counts 36of particles is a valid method of quantifying the total RNA 37expression [38]. 38

    2.5 Identification of developmental time via an 39exponential growth model 40

    To identify a precise time variable for each egg chamber, 41we apply an exponential growth model to measurements 42of the area of the median section. Effectively this allows 43us to use log(A), where A is the area of the median sec- 44tion, as a rescaled variable for the timescale of develop- 45ment, by fitting the following model by linear regression: 46log(A) = log(A0) + τt, where the intercept t0 = log(A0) 47

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    http://fiji.schttps://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 4 3 RESULTS

    corresponds to the start of gurken mRNA production, and τ1gives a rescaling of time. To fit this model, we use data from2Shimada et al. [39] relating the developmental stage to the3area of egg chambers and take the time point corresponding4to the midpoint of each stage as an estimate of its age (Fig-5ure S1). This approach is supported by the results of Jia et al.6[40], who use logA in a similar way to assist with automatic7stage identification for Drosophila egg chambers.8

    2.6 Implementation of Monte Carlo methods9

    We use the open source software package Stan mc −10stan.org (version 2.17.0) [41] to perform MCMC (Markov11chain Monte Carlo) sampling from the posterior distribution.12Stan uses Hamiltonian Monte Carlo techniques to navigate13the geometry of the posterior distribution efficiently [42],14along with automatic differentiation to compute gradients.15We run four chains in parallel for 2000 iterations with a16burn in of 1000 iterations. Traceplots illustrating conver-17gence of each of the chains are shown in Figure S2. Code18to reproduce the analysis described in this work is available19at https://github.com/shug3502/rstan analysis.20

    2.7 Model comparison21

    We perform model comparison via leave-one-out cross val-22idation [43–45]. From the family of models under consid-23eration, we make testable predictions for the behaviour of24a genetic over-expression mutant and compare these with25experimental data. We compute pseudo-BMA+ (Bayesian26Model Averaging) weights [46] for each model, where the27largest weight will give the model closest in Kullback-28Leibler divergence to the data generating model. However,29these weights can potentially be misleading in the case where30the true model is not contained in the set of models con-31sidered, as is the case here. In this M-open case, stacking32weights [46] offer a more conservative choice of models. We33present pseudo-BMA+ and stacking weights, since both are34informative. For further details see Supplementary Material35Section K.36

    3 Results37

    3.1 A coarse-grained model for gurken mRNA localization38

    To investigate the mechanisms governing robustness of39mRNA localization, we developed a minimal, compartment-40based ODE model of gurken localization in a Drosophila egg41chamber. We view each cell as a separate compartment and42assume that RNA complexes are produced in each of the43nurse cells [10]. We assume that there is transport of RNA44complexes between cells that are connected by ring canals45(Figure 1), and that there is negligible degradation (there is46

    evidence that the majority of maternal mRNAs are stable 1during oogenesis over the timescales of interest [47, 48]). 2

    The model can be written as 3

    dydt

    = a v + bB(ν)y, (1)

    where the i th entry of the vector y is the number of RNA 4complexes in cell i (indexed as in Figure 1). RNA com- 5plexes are produced at constant rate a > 0 in units of 6[particles hr−1], and the vector v ∈ R16 lists the cells that 7produce mRNA; for the Drosophila egg chamber, where 8mRNA is produced in all cells except the ooctye, v = 9(0, 1, . . . , 1)ᵀ since the oocyte is transcriptionally silent for 10most of oogenesis [49]. Transport of complexes between 11cells takes place at constant rate b > 0 in units of

    [hr−1

    ]. The 12

    matrix B describes the network of connected nurse cells (so 13that entry (i, j) is non-zero only when cells i and j are con- 14nected by a ring canal). Transport bias (towards the oocyte) 15is represented by parameter ν so that complexes in a given 16nurse cell move towards the oocyte at relative rate 1− ν and 17away from it at relative rate ν (Figure 4).The precise form 18of B is provided in Supplementary Material Section A. Note 19that a similar approach has been employed by Alsous et al. 20[26] to show that differences in cell sizes in the Drosophila 21egg chamber result from the characteristic cell network gen- 22erated through incomplete divisions, and the resulting ring 23canal connections. As an initial condition, we assume there 24are no RNA complexes in any of the cells at time t0 (where 25t0 is determined from the linear model described in Section 262.5). The system of ODEs is solved numerically using the 27fourth order Runge-Kutta scheme implementation contained 28in the Boost C++ library [50]. 29

    Typical behaviour of the model is shown in Figure 2. 30Much of the biologically relevant behaviour occurs in the 31quasi-steady-state regime where we have linear growth in the 32number of RNA complexes in the oocyte, and a characteristic 33distribution of RNA complexes across the nurse cells. Typi- 34cal dynamic behaviour for each nurse cell is shown in Figure 35S3. 36

    3.2 Bayesian inference framework 37

    We connect the model directly to quantitative experimental 38data so that it can provide relevant predictions and insight. 39We use a Bayesian inference framework to take account of 40both measurement and process uncertainty, incorporating the 41mechanistic model stated in Equation (1), a model of the 42measurement process, and prior knowledge of parameters. 43

    The biological process model (Equation (1)) can be 44related to the observed data via a measurement model. The 45measurement model accounts for any errors in processing 46the data, in addition to raw experimental error. However, 47we first note that RNA complexes consist of multiple indi- 48vidual mRNA transcripts, and recall that upon entry to the 49

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    mc-stan.orgmc-stan.orgmc-stan.orghttps://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 3.3 Quantification of higher-order assembly of gurken RNA complexes in the oocyte 5

    500

    1000

    1500

    2000

    2500

    0

    1000

    2000

    15 20 25 30

    1 4 8 12 16

    Time (hrs)

    Cell ID

    mR

    NA

    com

    ple

    xes

    mR

    NA

    com

    ple

    xes

    a)

    b) Time: 32 hrs

    Figure 2: Predictions from the ODE model (1). (a) Dynamicbehaviour of the number of complexes in the oocyte; theremaining nurse cells are shown in Figure S3. (b) The dis-tribution of complexes across the cells in the egg chamberat t = 32 hrs demonstrates the accumulation of mRNAin the oocyte. The numbering for the Cell ID on the xaxis corresponds to that given in Figure 1. Parameters area = 10 particles hr−1, b = 0.20 hr−1 and ν = 0.10.

    oocyte there is some higher-order assembly of these com-1plexes [12, 17, 18]. Since we have quantitative data on the2number of RNA complexes in each cell, rather than the num-3ber of individual transcripts, we account for this higher-order4assembly of complexes in the oocyte in the measurement5model. We assume that in the oocyte we observe φ y1 com-6plexes, where φ ∈ (0, 1] is a scaling parameter. We can7interpret φ as a ratio between the number of transcripts per8complex in the nurse cells as compared to the oocyte.9

    We use a negative binomial distribution for the measure-10ment model, so that measured RNA complex counts, z, are11given by12

    z ∼ NB(Φy, σ), (2)where Φ is a diagonal matrix with entries [φ, 1, 1, . . . , 1], σ13is a parameter controlling the magnitude of the measurement14error, and y is the solution of the biological process model15(Equation (1)), giving the numbers of RNA complexes in16each cell. Details on the parameterization for the negative17binomial distribution are given in Supplementary Material18Section E. We take the product of the likelihoods over each19of the observed cells (including nurse cells and oocyte),20with each biological sample corresponding to a unique time21point, to give the full likelihood. Full details on the choice of22

    prior distributions for parameters are given in Supplemen- 1tary Material Section F. The parameters to be inferred are a, 2b, ν, φ and σ. 3

    3.3 Quantification of higher-order assembly of gurken 4RNA complexes in the oocyte 5

    Considering directly the total integrated intensity of RNA 6complexes in the oocyte compared to those in the nurse cells 7allows us to directly estimate the relative numbers of mRNA 8transcripts in an RNA complex in the oocyte compared to 9the nurse cells. We calculate the total integrated intensity 10for 448 foci in n = 13 example datasets for egg chambers 11from stage 5 to stage 8, including both the nurse cells and 12oocyte. By subtracting the background intensity, and divid- 13ing by the mean nurse cell intensity, we can obtain an esti- 14mate for φ, as shown in Figure 3: the median estimate gives 15φ = 0.345± 0.048. Fitting a mixture of Gaussians model to 16the intensity data reveals how transcripts are packed within 17RNA complexes via multimodal distributions of intensities 18(Supplementary Figure S5). 19

    (d)

    (a) (c)(b)

    BGD OONCRegion

    Nor

    mal

    ised

    inte

    nsity

    0

    5

    10

    15

    20

    (d)

    (a) (c)(b)

    BGD OONCRegion

    Nor

    mal

    ised

    inte

    nsity

    0

    5

    10

    15

    20(d)

    (a) (c)(b)

    BGD OONCRegion

    Nor

    mal

    ised

    inte

    nsity

    0

    5

    10

    15

    20

    (d)

    (a) (c)(b)

    BGD OONCRegion

    Nor

    mal

    ised

    inte

    nsity

    0

    5

    10

    15

    20

    Figure 3: Assembly of higher order gurken complexes inthe oocyte and quantification of the size of the assembliesin terms of equivalent complexes in the nurse cells. Typicalcomplexes in background (BGD), nurse cells (NC) and theoocyte (OO) are identified from the circled regions indicated.By measuring total integrated intensity of these complexes inn = 13 egg chambers, we obtain distributions in each region,normalized by subtracting the average background value andscaling to a single nurse cell complex. We find that com-plexes in the oocyte are equivalent to a median of 2.5 timesa single complex in the nurse cells.

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 6 3 RESULTS

    We can now use this independent estimate of φ to spec-1ify a strong prior for φ when fitting the full Bayesian model.2We obtain posterior estimates for the assembly parameter,3φ, with 0.40 as a median value, and [0.31, 0.49] as a 95%4credible interval. This gives an estimate of between two and5four nurse cell RNA complexes assembled into one equiva-6lent oocyte RNA complex. The posterior distribution for φ is7shown in pairwise plots of all the model parameters in Figure8S6.9

    3.4 Biased transport through ring canals revealed by10model comparison at quasi-steady state11

    We now consider the question of whether transport of RNA12complexes through ring canals in Drosophila is unidirec-13tional (towards the oocyte) or bidirectional (both towards and14away from the oocyte). In addition, if transport is bidirec-15tional, how biased is it towards the ooctye? Within the math-16ematical model (Section 3.1), the network structure (encoded17in the matrix B) and the bias parameter, ν, impose a distribu-18tion of RNA complexes across the nurse cells. By inferring a19posterior distribution for ν, through comparison of the model20with quantitative data, we can estimate the extent to which21transport through the ring canals is biased.22

    To simplify the model and ensuing Bayesian analysis,23we consider the model behaviour in the large time, quasi-24steady-state limit (which corresponds to the time scale on25which observations are made). In this limit, we have y ≈2615ak2t, where k2 is the quasi-steady-state distribution that27can be uniquely calculated for a given matrix B (see Sup-28plementary Material Section H). Normalizing both the vec-29tor k2 (so that the normalized RNA complex count in the30oocyte is unity), and the equivalent distribution measured31from smFISH data, enables comparison of the distribution32of complexes across the egg chamber under the assumption33of a Gaussian measurement error model, N(0, ξ2). We note34that a different measurement error model is required here35compared to Section 3.2, since the normalized mRNA distri-36bution takes positive real values rather than integer counts.37Priors are described in detail in Supplementary Material38Section F.39

    In Figure 4, we show the marginal posterior distribu-40tion for the transport bias parameter, ν. The median of the41marginal posterior distribution for ν is 0.02, with a 95%42credible interval of [0.00, 0.05]. The posterior predictive dis-43tribution (Figure 4(c)) contains the observed data points44within the 95% credible region for the complex distribu-45tion in each cell. Together these results strongly support the46hypothesis that transport of complexes is significantly biased47towards the oocyte.48

    3.5 Estimates of production and transport rates show 1production and transport are in tightly regulated 2

    balance 3

    We obtain estimates for the rates of production, a, and 4transport, b, of RNA complexes by applying the modelling 5approach outlined in Equation (1) and Section 3.2. We con- 6sider the full dynamic behaviour of the model, together with 7the smFISH dataset of n = 16 egg chambers (see Section 2) 8and infer all model parameters (a, b, ν, φ and σ). 9

    Upon sampling from the posterior for the full model, we 10find that the production rate, a, takes a median value of 10.0 11and lies within a 95% credible interval of [7.7, 14.1] (in units 12of [particles hr−1]). The transport rate, b, has a median value 13of 0.22 and lies within a 95% credible interval of [0.16, 0.31] 14(in units of [hr−1]). The marginal posterior distributions for 15the rate parameters a and b are shown in Figure 5, and the 16posterior distribution for all the model parameters is shown 17pairwise in Figure S6. 18

    Using these parameter estimates, we can compare the rel- 19ative contributions of the production and transport terms in 20Equation (1) to the rate of change in the number of RNA 21complexes. We have 10.0 ≈ a ≈ b〈ỹ〉 ≈ 21.4, where a 22is the complex production rate, b is the transport rate and 23〈ỹ〉 = 97 is the median number of gurken RNA complexes 24at time t considered across all cells of the egg chamber. This 25result indicates that the RNA complex production rate is of 26the same order of magnitude as the rate of complex trans- 27port, and therefore suggests that production and transport 28are tightly balanced. We provide further evidence to sup- 29port this hypothesis by evaluating the sensitivity of the model 30described in Section 3.1 to changes in the rate parameters a 31and b (see Supplementary Material Section J). 32

    3.6 Perturbing mRNA production yields results 33inconsistent with model predictions 34

    To further validate our results, we explored whether the 35model can predict the response of the system to a perturba- 36tion in the RNA complex production rate by considering an 37over-expression mutant with multiple copies of the gurken 38gene. We make the assumption that the mutant has RNA 39complex production rate γa, where a is the production rate 40in wild type, and γ > 1 is a scale factor, but that all other 41model parameters are unchanged. 42

    The model specified in Equation (1) predicts that the 43total number of RNA complexes in an egg chamber should 44increase at rate 15a in wild type, and 15γa in the over- 45expression mutant. Comparing this prediction with experi- 46mental data, we estimate a median value for γ of 2.23, with 47a 95% credible interval of [1.38, 3.06] (Supplementary Mate- 48rial Section L). We take the value γ = 2.23 for the remainder 49of this work. 50

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 3.6 Perturbing mRNA production yields results inconsistent with model predictions 7

    c)

    a) b)

    Figure 4: Results of MCMC sampling for the coarse grained model at steady state to determine the bias in transport throughring canals, as described in Section 3.4. In a), we display a schematic diagram of biased transport between two compartments,with transport from cell 2 to cell 1 at rate 1 − ν, and transport from cell 1 to cell 2 at rate ν. In b), we display a density plotof the marginal posterior distribution for the transport bias parameter ν, compared to the prior for the same parameter whichis uniform over [0.0, 0.5]. This shows evidence of strong bias in transport through ring canals. In c), we show the posteriorpredictive distribution with the raw data shown as red points. The shaded region shows a 95% credible interval of the distri-bution of predictions from the model. Here the distribution of mRNA across cells in the egg chamber is normalized such thatthe amount of mRNA in the oocyte is 1. The model provides a good fit to the data, since most of the data points lie within thegrey envelope of the 95% credible interval.

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 8 3 RESULTS

    0

    3

    6

    9

    15 11 9

    16 14 10 5 1

    12 7 3

    8 6 2

    4

    010002000300040005000

    0500

    10001500

    0250500750

    0200400600

    0100200300400

    0200400600

    0100200300400

    0100200300

    0100200300

    050

    100150200

    0100200

    0100200300

    050

    100150200

    0100200

    050

    100150200

    a b

    13

    10 15 20 0.2 0.3 0.40.000.050.100.150.200.25

    15 20 25 30 15 20 25 30 15 20 25 30 15 20 25 30 15 20 25 30

    050

    100150200

    Time (hrs)

    mR

    NA

    a)

    b)

    Figure 5: Results of MCMC sampling for the model described in Section 3.2 and sensitivity to model parameters. In a), wedisplay the posterior marginal distribution for the rate parameters a and b. In b), we show the posterior predictive distribution,which gives the predictions from the model with parameters fitted to the data. Each panel gives the prediction for each cell(numbered 1 for the oocyte through 2 to 16 for the nurse cells as in Figure 1) as a solid line, with measured experimentalparticle counts shown as points. A 95% credible interval for the predictions is shown as a shaded region. The blue points areobserved data used to train the model, whilst the red points are observed data used to test the model (note that these red pointsare egg chambers for which only the oocyte and neighbouring nurse cells have been segmented). The subplots are arranged tohighlight distance from the oocyte within the network of connections between nurse cells shown in Figure 1c).

    We now use the model to predict the distribution of RNA1complexes in each of the cells of the egg chamber by sam-2pling from the posterior predictive distribution generated3in Section 3.5 using wild type data only. In doing so, the4implicit assumption is that, in the over-expression mutant,5the only process significantly affected is the RNA complex6production rate. For the oocyte and neighbouring nurse cells,7

    the model predictions are consistent with the observed data 1(Figure 6). However, for the nurse cells furthest away from 2the oocyte (such as 8, 12, 14, 15, 16 which are at least three 3‘steps’ away from the oocyte) the observed experimental 4data contains nurse cells with much greater accumulation of 5RNA complexes than predicted by the model (Figure 6). We 6

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 3.9 Model comparison reveals blocking of ring canals as the most plausible mechanism 9

    highlight further the phenotypic differences in the distribu-1tion of mRNA across cells based on distance from the oocyte2in Figure S9.3

    Crucially, the lack of agreement between predictions and4observed data clearly indicates that the model does not cap-5ture key mechanistic details of the mRNA localization pro-6cess that are relevant to the over-expression mutant. The aim7of the ensuing analysis is to tease apart the nature of these8missing mechanistic details.9

    3.7 Localization of RNA in the oocyte of the10over-expression mutant reveals robustness11

    Quasi-steady-state analysis of the ODE model in Equa-12tion (1) predicts that RNA complex numbers in the oocyte,13y1, grow linearly at a rate proportional to 15a (Supplemen-14tary Material Section H). Therefore, in addition, the model15predicts that in the over-expression mutant, which has an16increased production rate of γa, the number of RNA com-17plexes localized to the oocyte at a given developmental time18should increase by a factor of γ compared to wild type.19However, rather than this predicted increase, we observe the20number of RNA complexes in the over-expression mutant21oocytes to be similar to that in wild type (Figure 7). This22results demonstrates an incredible robustness of the system23in the sense that the same amount of gurken RNA is local-24ized to the oocyte, despite large changes to the production25rate. A key question is now what mechanism gives rise to26the observed robustness?27

    3.8 Biological mechanisms to explain the over-expression28data can be represented as alternative models29

    We propose several plausible biological mechanisms to30explain the behaviour of the localization process in the over-31expression mutant, as listed below, and evaluate their ability32to recapitulate data from the over-expression mutant.33

    Inhomogeneous production. Previously we assumed equal34production rates of RNA complexes across all nurse cells.35However, the over-expression of gurken in nurse cells is36driven by the GAL4-UAS system, which can result in patchy37expression across cells. We now relax the assumption of38equal production rates, and estimate the production rates of39individual nurse cells by quantifying the number of nascent40gurken transcripts (Supplementary Material N).41

    Blocking or queuing at ring canals. With increased levels42of gurken in the over-expression mutant, the environment43inside the nurse cells is more crowded. We hypothesize that44the transport of RNA complexes through ring canals could be45blocked or restricted due to crowding. Blocking in this way46restricts transport both towards, and away from, the oocyte.47We can represent this hypothesis in the model by changing48

    Production BlockingDensity

    dependenttransport

    PseudoBMA+weights

    Stackingweights

    M0 0 0 0 0.04 0.00M1 0 1 0 0.94 0.40M2 0 0 1 0.00 0.08M3 1 0 0 0.01 0.23M4 1 0 1 0.00 0.00M5 1 1 0 0.00 0.29M6 0 1 1 0.00 0.00M7 1 1 1 0.00 0.00

    Table 1: A description of the collection of models M ={M0,M1, . . . ,M7}, together with the weights generatedusing model comparison (see Section 2.7 for details).

    appropriate entries in the matrix B to zero: manual exami- 1nation of the data was used to predict which ring canals are 2blocked. 3

    Density dependent transport. The final hypothesis we con-sider is that increases in gurken in the over-expressionmutant lead to saturation of the transport mechanism. Thishypothesis is motivated by observations of a build up of RNAat nuclear pore complexes in the over-expression mutant,suggesting that nuclear export or the ability to form compe-tent mRNA particles for transport may be saturated. Insteadof assuming that transport rate between cells is linearly pro-portional to the number of RNA complexes, we assume asaturating transport rate of the form

    f(y) =y

    1 + βy,

    where β is a parameter describing the density dependence. 4

    We represent the hypotheses detailed above as a col- 5lection of models, M = {M0,M1, . . . ,M7}, that together 6represent all possible combinations of the additional mech- 7anisms (see Table 1 for details). Models including the 8crowding-induced blocking at ring canals and inhomoge- 9neous production mechanisms can be forward simulated 10using parameters from the posterior distribution based on 11wild type data, as in Figure 6. Models including density 12dependent transport must be fitted to the wild type data to 13estimate the parameter β. 14

    3.9 Model comparison reveals blocking of ring canals as 15the most plausible mechanism 16

    We use model comparison approaches (Section 2.7) to eval- 17uate the ability of the different models to explain the bio- 18logical observations. Table 1 indicates strong support for 19the blocking of ring canals as a mechanism to explain the 20observed over-expression data: Model M1 has a pseudo- 21BMA+ weight close to 1, indicating that of the models 22

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 10 3 RESULTS

    13

    15 11 9

    16 14 10 5 1

    12 7 3

    8 6 2

    4

    0 10 20 0 10 20 0 10 20 0 10 20 0 10 20

    0250050007500

    0

    1000

    2000

    3000

    0500

    100015002000

    0300600900

    1200

    0200400600800

    0500

    100015002000

    0250500750

    0300600900

    0

    500

    1000

    1500

    0

    500

    1000

    0

    500

    1000

    1500

    0500

    10001500

    0250500750

    1000

    0100200300400

    0250500750

    1000

    0300600900

    Time (hrs)

    mR

    NA

    Figure 6: Posterior predictive distribution for the over-expression mutant based on the ODE model (1) fitted to wild type data.Model predictions (gray envelope) show good agreement with observed data (red points) for cells close to the oocyte. Cellsfurther from the oocyte, such as cell 16, show discrepancy between model predictions and the observed data, indicating thereare biological mechanisms not captured by the model. The observed data for the over-expression mutant is shown as red dots,with the gray envelope showing a 95% credible interval of predictions from the model via the posterior predictive distribution.The subplots are arranged to highlight distance from the oocyte within the network of connections between nurse cells shownin Figure 1c).

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 11

    0

    500

    1000

    1500

    0 10 20 30Time

    mR

    NA

    com

    ple

    xes

    in o

    ocyte

    phenotypeOEWT

    Figure 7: The number of mRNA complexes in the oocyteincreases linearly over time for both the wild type (blue)and the over-expression mutant (red). Strikingly, the rate ofincrease of mRNA in the oocyte (shown by the gradient ofthe red and blue lines) is approximately equal in each case,whereas we would have expected this to be γ times greaterfor the over-expression mutant. The tight regulation of thelocalization process biologically ensures the right amount ofRNA reaches the oocyte at the right time, even under bio-logical perturbation. Here this is due to blocking of ringcanals. The points show observed data on number of RNAcomplexes, and the lines show linear models fit to these data.

    considered this is the most plausible model. The more con-1servative stacking weights approach also weights several2other models relatively highly, however these are predom-3inantly models that involve blocking as well as additional4mechanisms. These results suggest that none of the mod-5els considered perfectly describe the data generating pro-6cess (to be expected since the model is an incredibly sim-7plified description of a very complex biological process).8However, predictions of RNA complex distributions for the9over-expression mutant made using M1 (which incorpo-10rates only the blocking of ring canals) are substantially11improved compared to the predictions of the basic model.12Taken together, our results reveal crowding-driven block-13ing of ring canals as the most plausible explanation for our14experimental observations.15

    3.10 Reexamining microscopy data provides evidence of16queuing at ring canals17

    A natural question to ask is then whether it is possible18to validate this prediction experimentally? By re-examining19microscopy data from the over-expression mutant, we20demonstrate evidence for the crowding-related blocking of21ring canals in smFISH data: we observe clusters of RNA22

    complexes around ring canals (Figure 8). Similar observa- 1tions of a build up of RNA complexes near ring canals have 2also been reported upon injection of gurken mRNA in nurse 3cells [12]. Together, these results suggest that, while the 4transport of RNA complexes within nurse cells may be rela- 5tively fast, the efficiency of the transport process as a whole 6can suffer when transit through ring canals is blocked by 7crowding. Based on assessment of the mRNA counts across 8cells (Supplementary Figure S12), we find 64% (n = 14) 9of the over-expression mutant egg chambers show evidence 10of blocking behaviour. The mechanism leading to crowding- 11induced blocking in over-expression mutants may be present 12also in wild type, but such events occur rarely in wild type 13due to lower accumulation of RNA complexes within a nurse 14cell. 15

    Phalloidin Alexa488 (actin)grk smFISH

    (a)

    14 μm

    a)

    b)

    (b)

    Figure 8: Blocking of ring canals may result in significantaccumulation of mRNA in nurse cells far from the oocyte. a)A clustered distribution of mRNA complexes queuing nearthe ring canal (arrow). b) A comparison between simula-tions from the model with all ring canals transporting RNAequally, and a situation where the ring canal between cells8 and 16 is blocked. The schematic inset in a) shows theposition of the blocked ring canal within the tissue.

    4 Discussion 16

    Combining simple mechanistic mathematical models and 17Bayesian inference approaches together with experimen- 18tal evidence, we have investigated the dynamics of mRNA 19

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 12 4 DISCUSSION

    localization in Drosophila egg chambers, and made quanti-1tative predictions of the dynamics of the localization process.2Estimates of the production and transport rates of gurken3mRNA indicate that these processes are tightly balanced4in mRNA localization. To confer robustness on the sys-5tem, large perturbations must be buffered by the localization6mechanism to ensure that the ‘correct’ amount of RNA is7localized at the right time in development. We have shown,8using a simple model, that crowding-driven blocking of ring9canals is sufficient to regulate the localization of mRNA in10the oocyte.11

    In addition, our results suggest that transport of RNA12complexes through ring canals is strongly biased. The char-13acteristic network of connections between the nurse cells14specifies a given network structure that we represent via the15matrix B in Equation (1). By inferring the bias parameter, ν,16we have shown that the observed distribution of RNA com-17plexes across the egg chamber is consistent with strongly18biased transport of RNA complexes towards the oocyte. This19strong bias at a macroscopic cellular scale is nonetheless20consistent with weak bias in transport at a microscopic scale21on the microtubule network [51], since averaging over many22movements of many particles moving with weakly biased23random motion can result in strong bias for the ensemble24average.25

    We have also provided measurements and estimates for26the assembly of higher order RNA complexes in the oocyte27following remodelling upon transit through the ring canals28between the oocyte and the nurse cells. Measurements of29RNA content for bicoid, oskar and nanos RNA complexes30have been performed previously [18, 52], although largely31for later stages of development, and we note that the esti-32mates given here are consistent with similar measurements33on the distribution of multimeric RNA complexes of oskar34upon entry to the oocyte at stage 10b [18]. This process35of remodelling of RNA complexes upon transit into the36oocyte may play a role within wider changes in how RNA37is packaged in the oocyte compared to the nurse cells.38

    We attempted to validate the mathematical model (Equa-39tion (1)) by making predictions about behaviour of an over-40expression mutant. The predictions of this simple model41revealed a discrepancy in the accumulation of mRNA in42the nurse cells furthest from the oocyte compared with the43observed experimental data: in some cases, far more mRNA44is observed in these cells than predicted by the model. In45addition, the observed rate of localization of mRNA in the46oocyte is similar in the over-expression mutant as compared47to wild type, contrary to predictions from the simple model.48We resolved this discrepancy by using model comparison49approaches to compare the ability of a range of biological50mechanisms to replicate the observed data. Our results sug-51gest crowding-induced blocking of ring canals as a plausible52explanation for the discrepancy between model prediction53and experimental observations, and more generally that the54transport process becomes overloaded in the over-expression55

    mutant. We therefore argue that the crowding-induced block- 1ing of ring canals is effectively a regulatory mechanism 2that helps ensure the robustness of gurken localization in 3Drosophila oocytes. Precise control of gene expression dur- 4ing development can be facilitated by indirect methods [53]; 5blocking of ring canals is an example of such indirect 6regulation. 7

    We have considered generalizations of the simple model 8(Equation (1)) to incorporate inhomogeneous variance 9across cells in our observations or a decay term with decay 10of RNA at rate δ. Details are described in Supplementary 11Material Sections P and Q. Our conclusions about the block- 12ing mechanism hold also for these more general descrip- 13tions. The simple model is favoured by model comparison 14when compared to these models, which do not generalize as 15well to predictions for the over-expression data. Alternative 16hypotheses to account for the discrepency between model 17predictions and observations, based on different degradation 18rates for the different phenotypes, may have some merit, but 19they are hard to assess within the framework of prediction 20conditioned on wild type data used to compare the other 21candidate models (see Supplementary Material Section P). 22

    23

    To further assess the crowding-induced blocking mech- 24anism, we suggest directly measuring transit through ring 25canals in the wild type and over-expression mutant condi- 26tions. However, this task is nontrivial and would require 27extended time lapse imaging to track movements of com- 28plexes through representative subsets of all ring canals. 29

    30

    Finally, we highlight that the conclusions drawn as a 31result of this combined modelling-experiment study were 32made possible through use of Bayesian inference approaches 33that allowed us to interrogate and interpret quantitative 34data, making full use of the information contained therein. 35It is important to note also that our conclusions were 36reached through the use of an incredibly simple, analytically 37tractable, coarse-grained model, that allowed us to abstract 38many of the intricate details, and focus on salient mecha- 39nisms. We hope that our work serves as an exemplar for 40future studies in this area. 41

    Author contributions 42

    JUH and RMP designed and performed biological experi- 43ments and JUH carried out primary analysis and interpre- 44tation of the data. JUH and REB developed the modelling 45approaches and JUH wrote the code and performed statisti- 46cal inference. All authors discussed the results and conclu- 47sions, commented on and contributed to the revision of the 48manuscript. REB, RMP and ID supervised the project. 49

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 13

    Acknowledgements1

    This work was supported by funding from the Engineer-2ing and Physical Sciences Research Council (EPSRC) (grant3no. EP/G03706X/1). R.E.B. is a Royal Society Wolfson4Research Merit Award holder and a Leverhulme Research5Fellow, and also acknowledges the BBSRC for funding6via grant no. BB/R000816/1. This work was additionally7supported by a Wellcome Trust Senior Research Fellow-8ship (096144) and Wellcome Investigator Award (209412)9to I.D. and supporting R.M.P. Thanks also to MICRON10(http://micronoxford.com, supported by Wellcome11Strategic Awards 091911/B/10/Z and 107457/Z/15/Z) for12access to equipment.13

    References14

    [1] Becalska, A. N., and E. R. Gavis, 2009. Lighting up15mRNA localization in Drosophila oogenesis. Development16136:2493–2503.17

    [2] Buxbaum, A. R., G. Haimovich, and R. H. Singer, 2014. In18the right place at the right time: visualizing and understanding19mRNA localization. Nature Reviews Molecular Cell Biology20.21

    [3] Parton, R. M., A. Davidson, I. Davis, and T. T. Weil, 2014.22Subcellular mRNA localisation at a glance. Journal of Cell23Science 127:2127–2133.24

    [4] Wolpert, L., R. Beddington, J. Brockes, T. Jessell,25P. Lawrence, and E. Meyerowitz, 1998. Principles of Devel-26opment. Oxford University Press, Oxford.27

    [5] Wilkie, G. S., and I. Davis, 2001. Drosophila wingless28and pair-rule transcripts localize apically by dynein-mediated29transport of RNA particles. Cell 105:209–219.30

    [6] Bobola, N., R. Jansen, T. H. Shin, and K. Nasmyth, 1996.31Asymmetric accumulation of Ash1p in postanaphase nuclei32depends on a myosin and restricts yeast mating-type switch-33ing to mother cells. Cell 84:699–709.34

    [7] Mowry, K. L., and D. A. Melton, 1992. Vegetal messen-35ger RNA localization directed by a 340-nt RNA sequence36element in Xenopus oocytes. Science 255:991–994.37

    [8] Rosbash, M., and R. H. Singer, 1993. RNA travel: tracks38from DNA to cytoplasm. Cell 75:399–401.39

    [9] Nevo-Dinur, K., A. Nussbaum-Shochat, S. Ben-Yehuda, and40O. Amster-Choder, 2011. Translation-independent localiza-41tion of mRNA in E. coli. Science 331:1081–1084.42

    [10] Cáceres, L., and L. A. Nilson, 2005. Production of gurken43in the nurse cells is sufficient for axis determination in the44Drosophila oocyte. Development 132:2345–2353.45

    [11] Bullock, S. L., and D. Ish-Horowicz, 2001. Conserved signals46and machinery for RNA transport in Drosophila oogenesis47and embryogenesis. Nature 414:611–616.48

    [12] Clark, A., C. Meignin, and I. Davis, 2007. A Dynein-49dependent shortcut rapidly delivers axis determination tran-50scripts into the Drosophila oocyte. Development 134:1955–511965.52

    [13] Spradling, A. C., 1993. Developmental genetics of oogen-53esis. In The development of Drosophila melongaster. Cold54Spring Harbour, NY, Cold Spring Harbour Laboratory Press,55

    1–70. 1[14] Alsous, J. I., P. Villoutreix, N. Stoop, S. Y. Shvartsman, and 2

    J. Dunkel, 2018. Entropic effects in cell lineage tree packings. 3Nature Physics 14:1016. 4

    [15] Femino, A. M., F. S. Fay, K. Fogarty, and R. H. Singer, 51998. Visualization of single RNA transcripts in situ. Science 6280:585–590. 7

    [16] Raj, A., P. Van Den Bogaard, S. A. Rifkin, A. Van Oude- 8naarden, and S. Tyagi, 2008. Imaging individual mRNA 9molecules using multiple singly labeled probes. Nature 10Methods 5:877–879. 11

    [17] Bohrmann, J., and K. Biber, 1994. Cytoskeleton-dependent 12transport of cytoplasmic particles in previtellogenic to mid- 13vitellogenic ovarian follicles of Drosophila: time-lapse anal- 14ysis using video-enhanced contrast microscopy. Journal of 15Cell Science 107:849–858. 16

    [18] Little, S. C., K. S. Sinsimer, J. J. Lee, E. F. Wieschaus, and 17E. R. Gavis, 2015. Independent and coordinate trafficking of 18single Drosophila germ plasm mRNAs. Nature Cell Biology 1917:558. 20

    [19] Trong, P. K., H. Doerflinger, J. Dunkel, D. St Johnston, and 21R. E. Goldstein, 2015. Cortical microtubule nucleation can 22organise the cytoskeleton of Drosophila oocytes to define the 23anteroposterior axis. eLife 4:e06088. 24

    [20] Ciocanel, M. V., J. A. Kreiling, J. A. Gagnon, K. L. Mowry, 25and B. Sandstede, 2017. Analysis of Active Transport by 26Fluorescence Recovery after Photobleaching. Biophysical 27Journal 112:1714–1725. 28

    [21] Bressloff, P. C., and J. M. Newby, 2013. Stochastic models of 29intracellular transport. Reviews of Modern Physics 85:135. 30

    [22] Howard, J., 2001. Mechanics of Motor Proteins and the 31Cytoskeleton. Sinauer, Sunderland, MA. 32

    [23] Hafner, A. E., and H. Rieger, 2018. Spatial cytoskeleton orga- 33nization supports targeted intracellular transport. Biophysical 34Journal 114:1420–1432. 35

    [24] Hawkins, R. J., O. Benichou, M. Piel, and R. Voituriez, 362009. Rebuilding cytoskeleton roads: Active-transport- 37induced polarization of cells. Physical Review E 80:040903. 38

    [25] Bressloff, P. C., and B. Xu, 2015. Stochastic active-transport 39model of cell polarization. SIAM Journal on Applied 40Mathematics 75:652–678. 41

    [26] Alsous, J. I., P. Villoutreix, A. M. Berezhkovskii, and S. Y. 42Shvartsman, 2017. Collective growth in a small cell network. 43Current Biology 27:2670–2676.e4. 44

    [27] Bökel, C., S. Dass, M. Wilsch-Bräuninger, and S. Roth, 452006. Drosophila Cornichon acts as cargo receptor for ER 46export of the TGFα-like growth factor Gurken. Development 47133:459–470. 48

    [28] Jaramillo, A. M., T. T. Weil, J. Goodhouse, E. R. Gavis, and 49T. Schupbach, 2008. The dynamics of fluorescently labeled 50endogenous gurken mRNA in Drosophila. Journal of Cell 51Science 121:887–894. 52

    [29] Weil, T. T., R. M. Parton, B. Herpers, J. Soetaert, T. Veenen- 53daal, D. Xanthakis, I. M. Dobbie, J. M. Halstead, R. Hayashi, 54C. Rabouille, and I. Davis, 2012. Drosophila patterning 55is established by differential association of mRNAs with P 56bodies. Nature Cell Biology 14:1305–1313. 57

    [30] Edwards, K. A., M. Demsky, R. A. Montague, N. Weymouth, 58and D. P. Kiehart, 1997. GFP-Moesin illuminates Actin 59

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

  • 14 4 DISCUSSION

    cytoskeleton dynamics in living tissue and demonstrates1cell shape changes during morphogenesis in Drosophila.2Developmental Biology 191:103–117.3

    [31] Parton, R. M., A. M. Vallés, I. M. Dobbie, and I. Davis, 2010.4Pushing the limits of live cell imaging in Drosophila. Live5cell imaging: a laboratory manual 387–418.6

    [32] Davidson, A., R. M. Parton, C. Rabouille, T. T. Weil, and7I. Davis, 2016. Localized translation of gurken/TGF-α8mRNA during axis specification is controlled by access to9Orb/CPEB on processing bodies. Cell Reports 14:2451–102462.11

    [33] Schindelin, J., I. Arganda-Carreras, E. Frise, V. Kaynig,12M. Longair, T. Pietzsch, S. Preibisch, C. Rueden, S. Saalfeld,13and B. Schmid, 2012. Fiji: an open-source platform for14biological-image analysis. Nature Methods 9:676.15

    [34] Allan, C., J. M. Burel, J. Moore, C. Blackburn, M. Link-16ert, S. Loynton, D. MacDonald, W. J. Moore, C. Neves,17and A. Patterson, 2012. OMERO: flexible, model-driven18data management for experimental biology. Nature Methods199:245.20

    [35] Linkert, M., C. T. Rueden, C. Allan, J. M. Burel, W. Moore,21A. Patterson, B. Loranger, J. Moore, C. Neves, and D. Mac-22Donald, 2010. Metadata matters: access to image data in the23real world. The Journal of Cell Biology 189:777–782.24

    [36] Mueller, F., A. Senecal, K. Tantale, H. Marie-Nelly, N. Ly,25O. Collin, E. Basyuk, E. Bertrand, X. Darzacq, and C. Zim-26mer, 2013. FISH-quant: automatic counting of transcripts in273D FISH images. Nature Methods 10:277–278.28

    [37] Hailstone, M., L. Yang, D. Waithe, T. J. Samuels, Y. Arava,29T. Dobrzycki, R. M. Parton, and I. Davis, 2017. Brain devel-30opment: machine learning analysis of individual stem cells in31live 3D tissue. bioRxiv .32

    [38] Yang, L., J. Titlow, D. Ennis, C. Smith, J. Mitchell, F. L.33Young, S. Waddell, D. Ish-Horowicz, and I. Davis, 2017.34Single molecule fluorescence in situ hybridisation for quan-35titating post-transcriptional regulation in Drosophila brains.36bioRxiv 128785.37

    [39] Shimada, Y., K. M. Burn, R. Niwa, and L. Cooley, 2011.38Reversible response of protein localization and microtubule39organization to nutrient stress during Drosophila early ooge-40nesis. Developmental Biology 355:250–262.41

    [40] Jia, D., Q. Xu, Q. Xie, W. Mio, and W. M. Deng, 2016. Auto-42matic stage identification of Drosophila egg chamber based43on DAPI images. Scientific Reports 6:18850.44

    [41] Carpenter, B., A. Gelman, M. Hoffman, D. Lee, B. Goodrich,45M. Betancourt, M. Brubaker, J. Guo, P. Li, and A. Riddell,462017. Stan: A Probabilistic Programming Language. Journal47of Statistical Software 76:1–32.48

    [42] Betancourt, M., S. Byrne, S. Livingstone, and M. Girolami,492017. The geometric foundations of Hamiltonian Monte50Carlo. Bernoulli 23:2257–2298.51

    [43] Piironen, J., and A. Vehtari, 2017. Comparison of Bayesian52predictive methods for model selection. Statistics and53Computing 27:711–735.54

    [44] Vehtari, A., A. Gelman, and J. Gabry, 2017. Practi-55cal Bayesian model evaluation using leave-one-out cross-56validation and WAIC. Statistics and Computing 27:1413–571432.58

    [45] Vehtari, A., J. Gabry, Y. Yao, and A. Gelman, 2018.59

    loo: Efficient leave-one-out cross-validation and WAIC for 1Bayesian models. https://CRAN.R-project.org/ 2package=loo, r package version 2.0.0. 3

    [46] Yao, Y., A. Vehtari, D. Simpson, and A. Gelman, 2018. 4Using stacking to average Bayesian predictive distributions. 5Bayesian Analysis . 6

    [47] Tadros, W., and H. D. Lipshitz, 2005. Setting the stage for 7development: mRNA translation and stability during oocyte 8maturation and egg activation in Drosophila. Developmental 9Dynamics 232:593–608. 10

    [48] Little, S. C., G. Tkačik, T. B. Kneeland, E. F. Wieschaus, and 11T. Gregor, 2011. The formation of the Bicoid morphogen 12gradient requires protein movement from anteriorly localized 13mRNA. PLOS Biology 9:e1000596. 14

    [49] King, R. C., and R. G. Burnett, 1959. Autoradiographic study 15of uptake of tritiated glycine, thymidine, and uridine by fruit 16fly ovaries. Science 129:1674–1675. 17

    [50] Schäling, B., 2011. The boost C++ libraries. Boris Schäling. 18[51] Parton, R. M., R. S. Hamilton, G. Ball, L. Yang, C. F. Cullen, 19

    W. Lu, H. Ohkura, and I. Davis, 2011. A PAR-1–dependent 20orientation gradient of dynamic microtubules directs poste- 21rior cargo transport in the Drosophila oocyte. The Journal of 22Cell Biology 194:121–135. 23

    [52] Trovisco, V., K. Belaya, D. Nashchekin, U. Irion, G. Siri- 24nakis, R. Butler, J. J. Lee, E. R. Gavis, and D. St Johnston, 252016. bicoid mRNA localises to the Drosophila oocyte ante- 26rior by random Dynein-mediated transport and anchoring. 27eLife 5. 28

    [53] Little, S. C., M. Tikhonov, and T. Gregor, 2013. Precise 29developmental gene expression arises from globally stochas- 30tic transcriptional activity. Cell 154:789–800. 31

    Biophysical Journal 00(00) 1–14

    .CC-BY 4.0 International licenseavailable under anot certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made

    The copyright holder for this preprint (which wasthis version posted May 29, 2019. ; https://doi.org/10.1101/533133doi: bioRxiv preprint

    https://CRAN.R-project.org/package=loohttps://CRAN.R-project.org/package=loohttps://CRAN.R-project.org/package=loohttps://doi.org/10.1101/533133http://creativecommons.org/licenses/by/4.0/

    IntroductionMethods and materialsFly strains and tissue preparationSingle molecule FISHFixed-cell imagingImage analysisIdentification of developmental time via an exponential growth modelImplementation of Monte Carlo methodsModel comparison

    ResultsA coarse-grained model for gurken mRNA localizationBayesian inference frameworkQuantification of higher-order assembly of gurken RNA complexes in the oocyteBiased transport through ring canals revealed by model comparison at quasi-steady stateEstimates of production and transport rates show production and transport are in tightly regulated balancePerturbing mRNA production yields results inconsistent with model predictionsLocalization of RNA in the oocyte of the over-expression mutant reveals robustnessBiological mechanisms to explain the over-expression data can be represented as alternative modelsModel comparison reveals blocking of ring canals as the most plausible mechanismReexamining microscopy data provides evidence of queuing at ring canals

    Discussion