help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

Originally published as Biophys J. BioFAST on January 11, 2007.
doi:10.1529/biophysj.106.093781
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
biophysj.106.093781v1
92/7/2350    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Goutsias, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Goutsias, J.
Biophysical Journal 92:2350-2365 (2007)
© 2007 The Biophysical Society

Classical versus Stochastic Kinetics Modeling of Biochemical Reaction Systems

John Goutsias

Whitaker Biomedical Engineering Institute, The Johns Hopkins University, Baltimore, Maryland

Correspondence: Address reprint requests to J. Goutsias, Tel.: 410-516-7871; E-mail: goutsias{at}jhu.edu.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 BIOCHEMICAL REACTION SYSTEMS
 A QUADRATIC AUTOCATALATOR
 CONCLUSIONS
 APPENDIX A: STOCHASTIC CHEMICAL...
 APPENDIX B: UNIDIRECTIONAL...
 APPENDIX C: QUADRATIC...
 APPENDIX D: SCKE APPROXIMATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
We study fundamental relationships between classical and stochastic chemical kinetics for general biochemical systems with elementary reactions. Analytical and numerical investigations show that intrinsic fluctuations may qualitatively and quantitatively affect both transient and stationary system behavior. Thus, we provide a theoretical understanding of the role that intrinsic fluctuations may play in inducing biochemical function. The mean concentration dynamics are governed by differential equations that are similar to the ones of classical chemical kinetics, expressed in terms of the stoichiometry matrix and time-dependent fluxes. However, each flux is decomposed into a macroscopic term, which accounts for the effect of mean reactant concentrations on the rate of product synthesis, and a mesoscopic term, which accounts for the effect of statistical correlations among interacting reactions. We demonstrate that the ability of a model to account for phenomena induced by intrinsic fluctuations may be seriously compromised if we do not include the mesoscopic fluxes. Unfortunately, computation of fluxes and mean concentration dynamics requires intensive Monte Carlo simulation. To circumvent the computational expense, we employ a moment closure scheme, which leads to differential equations that can be solved by standard numerical techniques to obtain more accurate approximations of fluxes and mean concentration dynamics than the ones obtained with the classical approach.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 BIOCHEMICAL REACTION SYSTEMS
 A QUADRATIC AUTOCATALATOR
 CONCLUSIONS
 APPENDIX A: STOCHASTIC CHEMICAL...
 APPENDIX B: UNIDIRECTIONAL...
 APPENDIX C: QUADRATIC...
 APPENDIX D: SCKE APPROXIMATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
The design of predictive models of cellular regulation is an important problem in computational systems biology. The majority of models published in the literature assume that cells are well-stirred, homogeneous biochemical reaction systems at thermal equilibrium, an assumption that we also follow in this article. A widely used approach to modeling cellular regulation characterizes the dynamic evolutions of molecular concentrations by deterministic first-order ordinary differential equations, known as chemical kinetics equations (CKEs) (1Go). However, to take into account that reactions in cells occur by random collisions of reactant molecules, we must employ a stochastic approach to modeling cellular regulation. A popular approach characterizes the dynamic evolution of the joint probability mass function of the state of cellular regulation by a first-order partial differential equation known as the chemical master equation (CME) (2Go–4Go). This leads to a modeling methodology that has been employed in several biological settings with remarkable success (5Go–9Go).

It has been increasingly recognized that cellular regulation should be studied at the level of single cells. Despite a growing effort to develop experimental methods for observing biochemical activities in single cells (10Go–12Go), these methods can only be used to simultaneously observe a limited number of molecular dynamics. Most experimental techniques used today estimate molecular concentrations in tissues containing a large number of cells (13Go,14Go). As a consequence, appreciable research activity is focused on studying the aggregate behavior of cellular regulation in a large population of cells.

For the purpose of this work, we may assume that a tissue is composed of K genetically identical cells that express the same set of genes independently from each other. This is a convenient albeit reasonable approximation, since it frees us from modeling tissue inhomogeneities and biological effects due to complex interactions among cells. We may model cellular activities in each cell by a stochastic biochemical reaction system that consists of N molecular species and use the random variable Xnk(t) to denote the number of molecules of the nth species present in the kth cell at time t. Since cellular regulation is observed by pooling together molecules extracted from all cells in the tissue, we may characterize its state at time t by the molecular concentrations Yn(t) = (Xn1(t) + Xn2(t) + ··· + XnK(t))/KAV, where V is the cellular volume and A = 6.0221415 x 1023 mol–1 is the Avogadro constant. The mean value and variance of Yn(t) are given by E(Yn(t)) = µX,n(t)/AV and Var(Yn(t)) = vX,nn(t)/KA2V2, where µX,n(t) and vX, nn(t) are the mean and variance of Xnk(t), respectively. This implies that the mean value of the molecular concentration Yn(t) of the nth species is independent of the number of cells in the tissue, whereas its variance tends to zero as the number of cells grows to infinity (provided that the variance vX,nn(t) is finite). As a consequence, we may approximately characterize cellular regulation in a large population of cells by the mean concentration vector

Formula 1(1)
where Formula 1 denotes the N x 1 mean vector with elements µX,n(t), n = 1, 2, ..., N.

An important question that arises here is whether the molecular concentration dynamics predicted by the CKEs coincide with the mean concentration dynamics predicted by the underlying CME and Eq. 1. It turns out that, given the CME, we can uniquely construct the corresponding CKEs and vice versa. Therefore, we may expect that the two approaches lead to the same dynamics. However, by analyzing a number of simple chemical reactions such as 2A -> B, A + B -> C, A -> B, A -> B -> C, 2A -> B -> C, and Formula 1, it was previously shown in the literature (15Go–17Go) that this may not be true in general. A notable exception occurs at the thermodynamic limit, in which the number of molecules and cellular volume tend to infinity while the molecular concentrations remain finite, or when all reaction mechanisms are linear. However, both of these cases are clearly not realistic.

In Zheng and Ross (18Go), they extended the previous work by focusing on the autocatalytic cubic Schlögl model Formula 1, Formula 1. These investigators noted that differences between classical and stochastic chemical kinetics are due to a coupling of correlation effects with system nonlinearities. By focusing on parameter values that lead to the same stationary states (concentrations of the molecular intermediate B) for both models, they showed that the deterministic model may result in quantitatively different transient behavior for the mean concentration of B than the corresponding stochastic model, with the maximum deviation between the concentration trajectories decreasing as the model parameters are modified toward a linear kinetic mechanism.

In view of the fact that cellular regulation is controlled by a complex network of biochemical reactions, it is necessary to investigate the relationship between classical and stochastic chemical kinetics in a more general setting than the one considered in the literature (15Go–18Go). In this article, we derive fundamental relationships between the two approaches for a biochemical reaction system that consists of elementary (monomolecular or bimolecular) irreversible reactions. We can use this system to model any set of biochemical reactions, since we can decompose any reaction that involves more than two molecules (a rare possibility in practice) into a cascade of bimolecular reactions and split a reversible reaction into two separate irreversible reactions (19Go). We have chosen to illustrate our results by employing two reaction mechanisms: a unidirectional dimerization and a quadratic autocatalator with positive feedback. These mechanisms allow us to clearly demonstrate that intrinsic stochastic fluctuations may appreciably influence the qualitative and quantitative behavior of cellular regulation and to analytically investigate the origins of such influence. Note, however, that our approach is very general and can be applied to more complex regulatory mechanisms as well.

In this article, we show that the mean concentration dynamics predicted by the CME are governed by first-order ordinary differential equations similar to the ones obtained by classical chemical kinetics, expressed in terms of the stoichiometry coefficients and the time-dependent fluxes of the underlying reactions. However, the flux is now decomposed into a macroscopic and a mesoscopic term. The macroscopic term is analytically identical to the classical flux and accounts for the effect of mean reactant concentrations on the rate of product synthesis, whereas the mesoscopic term accounts for the effect of statistical correlations among interacting reactions. When all mesoscopic fluxes are zero, a situation that occurs when the biochemical reaction system consists of only monomolecular reactions (which leads to linear reaction mechanisms), the concentration dynamics predicted by the CKEs will be identical to the mean concentration dynamics predicted by the CME. However, and by using the two aforementioned examples, our analytical and numerical investigations show that nonzero mesoscopic fluxes may induce appreciable qualitative and quantitative differences in transient and stationary system behavior from that predicted by classical chemical kinetics. In addition to the conclusions reached by Zheng and Ross (18Go), we show that the mean concentration dynamics predicted by the CME may converge to different stationary values than those predicted by the CKEs, thus supporting the fact that intrinsic stochastic fluctuations may play an important role in determining a cell's phenotype by quantitatively influencing cell regulation at steady state. Moreover, we analytically and numerically demonstrate that intrinsic stochastic fluctuations may also affect the epigenetic properties of cell regulation in a qualitative manner by introducing novel modes of stationary behavior not accounted for by the CKEs. Hence, a sufficiently accurate model of mean concentration dynamics must necessarily include all mesoscopic fluxes in its formulation. These developments provide a theoretical understanding of the role that intrinsic stochastic fluctuations may play in inducing biochemical function.

The mesoscopic fluxes cannot be evaluated analytically. We can estimate them by Monte Carlo simulation, but the resulting method is computationally intensive in most cases of interest. To circumvent the computational expense, we employ a moment closure scheme that allows us to approximate the underlying covariances (and thus the mesoscopic fluxes) by first-order ordinary differential equations that are similar to the CKEs and can be solved by standard numerical techniques. We show that, at least for the quadratic autocatalator, this approximation leads to more accurate predictions of fluxes and mean concentration dynamics than the CKEs.

We should mention that, in a recent work (20Go), Samoilov et al. have used a simple example (an enzymatic futile cycle) to analytically and numerically demonstrate that extrinsic stochastic fluctuations in biochemical reaction systems may also produce dynamic behavior not accounted for by classical chemical kinetics. Our work is complementary to theirs, since it focuses on the effects of intrinsic stochastic fluctuations on system behavior. Moreover, it supports, both analytically and computationally, the general belief that stochastic fluctuations may play an important role in determining biological function in cells and, therefore, must be accounted for by computational models of cellular regulation.


    BIOCHEMICAL REACTION SYSTEMS
 TOP
 ABSTRACT
 INTRODUCTION
 BIOCHEMICAL REACTION SYSTEMS
 A QUADRATIC AUTOCATALATOR
 CONCLUSIONS
 APPENDIX A: STOCHASTIC CHEMICAL...
 APPENDIX B: UNIDIRECTIONAL...
 APPENDIX C: QUADRATIC...
 APPENDIX D: SCKE APPROXIMATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Deterministic description
In this article, we consider a well-stirred biochemical reaction system at thermal equilibrium that consists of M elementary (monomolecular or bimolecular) irreversible reaction channels. By assuming that the system contains N molecular species, we may characterize its state at time t ≥ 0 by an N x 1 deterministic vector q(t) whose dynamic evolution is governed by the following CKEs (1Go):

Formula 2(2)

In Eq. 2, Formula 2 is the N x M stoichiometry matrix of the underlying biochemical reactions. Moreover, {rho}(t) is an M x 1 (time-dependent) vector with elements {rho}m(t) = (1/V) d{xi}m(t)/dt, where {xi}m(t) is the extent of the mth reaction, defined as the amount (in moles) of a species produced or consumed by the mth reaction during the time interval [0,t), divided by the corresponding stoichiometric coefficient. Note that {rho}m(t) is the rate of change in the extent of reaction per unit volume at time t and, hence, it quantifies the reaction rate of the mth reaction. These parameters are frequently referred to as time-dependent fluxes (velocities of molecular flow) and play a fundamental role in the analysis of biochemical reaction systems (1Go). The mass action rate law implies that the mth element of {rho}(t) is given by

Formula 3(3)
where {kappa}m is the reaction rate constant of the mth reaction and {psi}m(q) is a product of the reactant concentrations of the mth reaction, given by

Formula 4(4)

Note that in classical chemical kinetics it is assumed that molecular concentrations are appreciably larger than 1/AV, in which case Formula 4 for bimolecular reactions with identical reactants. To account for the possibility that some molecular concentrations may be comparable to 1/AV, we set in this article {psi}m(q) = qn(qn – 1/AV).

Although it is commonly believed that q(t) provides a sufficient approximation to the mean concentration vector u(t), given by Eq. 1, because of stochastic fluctuations in biochemical activity, this may not be true. Moreover, the derivation of Eq. 2 requires that the numbers of molecules in the system are very large compared to 1. Otherwise, q(t) will not be a continuous function of t and differentiation of q(t) will not be possible. Therefore, we may not be able to justify the CKEs when modeling biochemical reaction systems with appreciable stochastic fluctuations and small numbers of reactant molecules. In view of the fact that reactions occur by random collisions of reactant molecules, it is intuitive to believe that it will be more appropriate if we employ a stochastic approach.

Stochastic description
If we use a stochastic biochemical reaction system to model cellular regulation in single cells, then the mean molecular concentrations u(t), given by Eq. 1, will satisfy the system of first-order differential equations

Formula 5(5)
where {nu}(t) is an M x 1 (time-dependent) flux vector with elements {nu}m(t) = (1/V) d{chi}m(t)/dt, and {chi}m(t) is the extent of the mth reaction at time t, which is now defined as the mean degree of advancement (DA) of the mth reaction at time t divided by the Avogadro number (see Appendix A for a brief review of stochastic chemical kinetics). Notably, the flux {nu}m(t) of the mth reaction is the rate of change in its mean DA per unit volume divided by the Avogadro number.

Equation 5 is similar to the CKEs with one important difference. The mth element {nu}m(t) of the flux vector {nu}(t) is now given by

Formula 6(6)
where

Formula 7(7)
as in the deterministic case, and (see Appendix A)

Formula 8(8)

In Eq. 8, {eta}m,kl is the second-order partial derivative of the propensity function of the mth reaction with respect to the DAs of the kth and lth reactions, whereas, vZ, kl(t) is the covariance between the DAs of those reactions. We refer to {rho}m(t) as the macroscopic flux and to {theta}m(t) as the mesoscopic flux of the mth reaction. Clearly, the macroscopic flux accounts for the effect of mean reactant concentrations on the rate of product synthesis, whereas the mesoscopic flux accounts for the effect of statistical correlations among interacting reactions on that rate. Because the propensity functions are at most quadratic functions of the DAs (see Appendix A), the mesoscopic flux of a given reaction depends only on the (nonzero) Hessian elements of the propensity function of that reaction and on the corresponding DA covariances. As a consequence of Eq. 8, the mesoscopic flux of a monomolecular reaction will be zero, since the propensity function of such reaction will be linear and the corresponding Hessian matrix will be zero (i.e., {eta}m,kl = 0, for every k, l). However, this may not be true for the mesoscopic flux of a bimolecular reaction, whose value will depend on the DA covariances between reactions that affect the molecular population of one reactant species and reactions that affect the population of the other reactant species.

If all reactions are monomolecular, the biochemical reaction system will be linear (i.e., all propensity functions will be linear). In this case, the second-order partial derivatives of the propensity functions with respect to the DAs will be zero, which, together with Eq. 8, implies that {theta}m(t) = 0, for every m (since {eta}m,kl = 0, for every m, k, l). Hence, when all reactions are monomolecular, Eq. 5 is identical to Eq. 2 of classical chemical kinetics. This result was derived in Gillespie (21Go).

Equations 58 extend the classical CKEs 2 and 3 to account for intrinsic stochastic fluctuations in biochemical activity and are an exact consequence of the conservation of mass and the CME underlying the biochemical reaction system (Eqs. 21 and 27). Note that

Formula 9(9)
where

Formula 10(10)

We refer to these equations (and Eqs. 58) as statistical chemical kinetics equations (SCKEs), where we use the term "statistical" to emphasize that the equations account for correlations among reactions. Like the CKEs, the SCKEs provide a macroscopic description of a biochemical reaction system. However, this description is now controlled by the mesoscopic behavior of the system through the forcing term {varepsilon}(t). If {varepsilon}(t) were known for every t ≥ 0, then we could evaluate u(t) by integrating Eq. 9 using standard numerical techniques. However, this is not true and u(t) must be estimated by computationally intensive Monte Carlo simulation using the Gillespie algorithm (22Go). In an effort to circumvent this computational expense, we later discuss a method that allows us to approximately evaluate u(t) by numerically integrating an appropriately derived system of first-order ordinary differential equations.

Deterministic versus stochastic description
Equation 9 reveals that intrinsic stochastic fluctuations may influence the mean concentration dynamics u(t) through the mesoscopic forcing term {varepsilon}(t). This fact is not considered by the CKEs and its importance should not be underestimated. If, for some species n, {varepsilon}n(t) != 0, for every t ≥ 0, then at least one function {psi}m will be quadratic, where m is a reaction that consumes or produces the nth molecular species, and the nth SCKE will therefore be nonlinear. Indeed, if {psi}m is linear, for any reaction m that consumes or produces the nth molecular species, then its second-order partial derivative {eta}m,kl will be zero, for every k, l, which implies that {theta}m(t) = 0, by virtue of Eq. 8. Since the mesoscopic forcing term {varepsilon}n(t) is given by Eq. 10, this implies that {varepsilon}n(t) = 0 for every t ≥ 0, which contradicts our assumption that {varepsilon}n(t) != 0 for every t ≥ 0. By combining this observation with the fact that a nonzero forcing term may substantially affect the solution of a nonlinear differential equation, we may expect that nonzero mesoscopic forcing terms could appreciably affect the mean behavior of a nonlinear biochemical reaction system.

It is clear from our previous discussion that we may use the CKEs to characterize the mean concentration dynamics if and only if, at any time t ≥ 0, the mesoscopic flux vector {theta}(t) is in the null space of the stoichiometry matrix Formula 10. In this case, Formula 10, for every t ≥ 0, and the SCKEs will be reduced to the CKEs. Eq. 8 suggests that this will happen if all underlying reactions are monomolecular (see also (18Go,21Go)). We may also use the CKEs when all covariances are zero, a condition that will be satisfied in the thermodynamic limit. However, most biochemical reaction systems of interest are nonlinear and there is no way to know a priori whether the covariances are zero or, more generally, whether {theta}(t) is in the null space of Formula 10. Therefore, to account for the influence of nonzero mesoscopic fluxes on system dynamics, we must include them in the formulation.

Numerical example
To provide a simple illustration of our discussion so far, we consider the following unidirectional dimerization reaction,

Formula 11(11)
with specific probability rate constant c1, initialized with S molecules P and S molecules Q (see Appendix B for details). In Fig. 1, we depict the normalized (with respect to the steady-state dimer concentration s = S/AV) dimer concentration and flux dynamics, predicted by the underlying SCKE (solid lines) versus the ones predicted by the corresponding CKE (dotted lines), for S = 1, in Fig. 1 A, and S = 10, in Fig. 1 B. We also depict the dynamics of the normalized mesoscopic forcing term. We have estimated the concentrations, fluxes, and forcing terms by Monte Carlo simulation using the Gillespie algorithm (22Go,23Go), and calculated the CKE concentrations and fluxes analytically (see Eq. 38). It turns out that, as t->{infty}, both models converge to the same steady-state concentration s.


Figure 1
View larger version (23K):
[in this window]
[in a new window]

 
FIGURE 1  Normalized dimer accumulation in the unidirectional dimerization reaction, given by Eq. 11, predicted by the SCKEs (solid lines) and CKEs (dotted lines). The dynamics obtained by the SCKEs have been computed by Monte Carlo simulation using the Gillespie algorithm, whereas the dynamics obtained by the CKEs have been computed analytically from Eq. 38. The system is initialized with (A) one molecule P and one molecule Q, (B) 10 molecules P, and 10 molecules Q. The associated normalized flux and mesoscopic forcing term dynamics are depicted as well. Parameters used are c1 = 10–3 s–1, V = 2 pL, and K = 6000 cells.

 
When the initial number of reactant molecules is very small (e.g., when S = 1 in Fig. 1 A), the CKE concentration dynamics do not match the SCKE dynamics obtained by Monte Carlo simulation. According to the results depicted in Fig. 1 A, small differences in flux dynamics may lead to substantial differences in concentration dynamics. However, a sufficient increase in the initial number of reactant molecules (e.g., by tenfold in Fig. 1 B) may drastically alleviate this difference. For sufficiently large S (S ≥ 100) the flux and concentration dynamics are virtually identical (data not shown).

The observed differences are due to the mesoscopic forcing term, which coincides in this case with the mesoscopic flux of the reaction. Since all reactants are eventually transformed into dimers, the mesoscopic forcing term tends to zero as t->{infty}. Its magnitude and rate of convergence to zero affect the SCKE concentration dynamics and the time it takes for the system to reach steady state. Because the flux is given by Eq. 37, larger values of the mesoscopic forcing term will promote faster reaction rates and thus faster relaxation to steady state (for this example, the mesoscopic forcing term is nonnegative). Fig. 1 shows that, when S = 1, the mesoscopic forcing term converges to zero slower than when S = 10. Our simulations show that the response predicted by the SCKE reaches steady state at ~2 h, whereas the response predicted by the CKE requires substantially more time (~24 h) to reach steady state. This example provides an analytical justification, by means of the mesoscopic forcing term, of a previously recognized fact that intrinsic stochastic fluctuations in biochemical activity may produce quantitative differences between the transient concentration dynamics predicted by classical and stochastic chemical kinetics (15Go,18Go).


    A QUADRATIC AUTOCATALATOR
 TOP
 ABSTRACT
 INTRODUCTION
 BIOCHEMICAL REACTION SYSTEMS
 A QUADRATIC AUTOCATALATOR
 CONCLUSIONS
 APPENDIX A: STOCHASTIC CHEMICAL...
 APPENDIX B: UNIDIRECTIONAL...
 APPENDIX C: QUADRATIC...
 APPENDIX D: SCKE APPROXIMATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Although the dimer concentration dynamics predicted by the CKE of the reaction equation of the previous example (Eq. 11) may follow a different trajectory than the dynamics predicted by the corresponding SCKE, eventually the two trajectories reach the same steady state. We will now show that this may not be necessarily true. To do so, we turn to a more complex example, governed by the following six reactions:

Formula 12(12)

These reactions convert substrate molecules S into proteins Q. An intermediate protein P is first produced by Reaction 1 and, subsequently, by Reaction 2 via transcription and translation in which P acts as a transcription factor to promote its own synthesis from a DNA template D. P is then transformed into Q via the intermolecular reactions 3 and 4, with P and Q, respectively. Finally, Reactions 5 and 6 model degradation of P and Q. Due to Reactions 2 and 3, we refer to this system as quadratic autocatalator with positive feedback, since Reaction 3 is autocatalytic with quadratic concentration dependence (see also (24Go)) and Reaction 2 applies (positive) feedback on the synthesis of P. The resulting system is similar to a reaction cascade considered in Kaufman et al. (25Go), which involves the autophosphorylation of protein tyrosine kinase activity in T cell stimulation, obtained by ignoring all dephosphorylation reactions. This simplification leads to a biologically relevant example, which allows us to analytically demonstrate that intrinsic stochastic fluctuations may appreciably affect, both qualitatively and quantitatively, the stationary behavior of a biochemical reaction system. We could use more complicated reaction schemes (e.g., schemes that involve phosphorylation/dephosphorylation, transcription, translation, etc.), but it would not be possible to proceed analytically.

Quantitative stationary behavior
The steady-state concentrations Formula 12 of P and Formula 12 of Q, predicted by the SCKEs associated with the quadratic autocatalator with feedback are given by (see Appendix C for details)

Formula 13(13)

Formula 14(14)
where

Formula 15(15)
provided that {alpha} > 0 and Formula 15 (see also the first row of Table 1). In these equations, s = S/AV and d = D/AV, where S, D are the numbers of S and D molecules, respectively. Moreover, Formula 15 is the sum of the mesoscopic fluxes of the third and fourth reactions at steady state, which depends on s, and the {kappa}-values are the reaction rate constants, given by Eq. 42. We use the notation Formula 15, Formula 15, and Formula 15 to explicitly denote that the stationary quantities Formula 15, Formula 15, and Formula 15 depend on the input substrate concentration s. By setting Formula 15 in Eq. 13 (i.e., by ignoring the mesoscopic fluxes), we obtain the steady-state concentrations predicted by the CKEs (see Table 2).


View this table:
[in this window]
[in a new window]

 
TABLE 1  Stationary concentrations of P, predicted by the SCKEs, in the quadratic autocatalator with feedback

 

View this table:
[in this window]
[in a new window]

 
TABLE 2  Stationary concentrations of P, predicted by the CKEs, in the quadratic autocatalator with feedback

 
According to Eq. 14, when {kappa}5 = {kappa}2d, the steady-state concentration Formula 15 of Q predicted by the SCKEs will be identical to the one predicted by the CKEs; this concentration is given by {kappa}1s/{kappa}6. However, this may not be true for the steady-state concentration Formula 15 of P, since this concentration depends on Formula 15, according to Eq. 13. If Formula 15, then Eqs. 13 and 14 imply that Formula 15 (this is also true when {kappa}5 != {kappa}2d; see Appendix C), in which case, both models will asymptotically (as s ->{infty}) reach the same steady-state concentration for P as well. However, for finite input substrate concentrations, we may not be able to ignore the steady-state mesoscopic forcing term Formula 15, in which case the steady-state concentration of P predicted by the SCKEs will be different than the one predicted by the CKEs, with the difference being controlled by the sign and magnitude of Formula 15. This is illustrated in Fig. 2, which depicts the concentration dynamics of P and Q and the flux dynamics of the third and fourth reactions predicted by the SCKEs (solid lines), estimated by Monte Carlo simulation using the Gillespie algorithm, and the CKEs (dotted lines), obtained numerically. In this case, the steady-state P concentration predicted by the CKEs is larger than the one predicted by the SCKEs, since Formula 15. Note the quantitative differences between the transient mean concentration dynamics and fluxes. As a matter of fact, the CKEs wrongly predict that the mean concentration of Q will be zero during the first minute, whereas the SCKEs predict a gradual increase in the mean concentration of Q from 0 pM to ~0.033 pM.


Figure 2
View larger version (22K):
[in this window]
[in a new window]

 
FIGURE 2  Protein accumulation in the quadratic autocatalator, given by Eq. 12, for the case when {kappa}5 = {kappa}2d, initialized with 10 molecules S (concentration of 8.30 pM), two molecules D (the number of DNA copies of a particular gene per eukaryotic cell), and zero molecules P and Q, predicted by the SCKEs (solid lines) and CKEs (dotted lines). The dynamics obtained by the SCKEs have been computed by Monte Carlo simulation using the Gillespie algorithm, whereas the dynamics obtained by the CKEs have been computed numerically. The flux dynamics of the third and fourth reactions are depicted as well. Parameters used are c1 = 0.002 s–1, c2 = 0.001 s–1, c3 = 0.005 s–1, c4 = 0.004 s–1, c5 = 0.002 s–1, c6 = 0.05 s–1, V = 2 pL, and K = 10,000 cells. Although the steady-state concentration of Q predicted by the CKEs is theoretically identical to the one predicted by the SCKEs, this is not true for the concentration of P. The dashed lines indicate the mean concentration and flux dynamics predicted by the second-order SCKEs discussed in this article.

 
Similar remarks apply when {kappa}5 != {kappa}2d. However, the steady-state concentrations of both P and Q depend now on Formula 15 (the concentration of Q depends on Formula 15 through the concentration of P; recall Eq. 14) and the CKEs may not provide good approximations at finite input substrate concentrations. We illustrate this case in Fig. 3.


Figure 3
View larger version (22K):
[in this window]
[in a new window]

 
FIGURE 3  Protein accumulation and flux dynamics in the quadratic autocatalator, given by Eq. 12, for the case when {kappa}5 > {kappa}2d. The parameters used are the same as in Fig. 2, but now c5 = 0.006 s–1. In this case, the steady-state concentrations of P and Q predicted by the CKEs (dotted lines) are both different than the actual steady-state concentrations predicted by the SCKEs (solid lines). The dashed lines indicate the mean concentration and flux dynamics predicted by the second-order SCKEs discussed in this article.

 
Our previous investigation shows that intrinsic stochastic fluctuations may produce appreciable quantitative differences between the stationary behavior of a biochemical reaction system predicted by classical chemical kinetics and the stationary behavior predicted by stochastic chemical kinetics. These differences are caused by nonzero mesoscopic forcing terms at steady state, which may influence stationary molecular concentrations and appreciably affect their values.

The stationary behavior of biochemical activity may affect cells in a biologically significant way. For example, it has been suggested that concentrations of regulatory proteins synthesized at steady state may be responsible for a cell's unique characteristics (phenotype) (26Go). As a consequence, the previous analytical and numerical investigations show that intrinsic stochastic fluctuations may quantitatively affect the epigenetic properties of cell regulation in a manner not accounted for by classical chemical kinetics. In addition, we show in the following that nonzero stationary mesoscopic forcing terms may influence the steady-state properties of a biochemical reaction system in a qualitative way, thus demonstrating the fact that intrinsic stochastic fluctuations may play a significant role in influencing cellular function.

Qualitative stationary behavior
In the quadratic autocatalator with positive feedback, the stationary concentration of P predicted by the SCKEs depends on the signs of parameters {alpha} and β(s), given by Eq. 15, and the value of the input flux {kappa}1s as compared to the value of the steady-state mesoscopic forcing term Formula 15. The resulting concentrations are summarized in Table 1 for {alpha} > 0 (similar results hold for {alpha} < 0). In particular, if Formula 15, the system has a unique stable stationary P concentration Formula 15, given by Eq. 13, regardless of the value of β(s). The situation, however, is different when Formula 15. If Formula 15 and β(s) ≥ 0, the system relaxes to a zero P concentration at steady state, whereas, if Formula 15 and β(s) ≥ 0, the system has no stationary P concentration. However, if Formula 15 and β(s) < 0, the system has two stationary P concentrations Formula 15 and Formula 15, given by

Formula 15
and

Formula 15

Note that Formula 15, with the two concentrations being equal when the input substrate concentration is set to Formula 15, where Formula 15 satisfies Formula 15 = Formula 15.

On the other hand, although the stationary concentration of P predicted by the CKEs still depends on the signs of parameters {alpha} and β(s), this concentration does not depend on the input flux values {kappa}1s but only on whether or not the input substrate concentration is zero. The resulting concentrations are summarized in Table 2, for {alpha} > 0. When s = 0 and β(0) ≥ 0, the CKEs predict zero stationary P concentration, whereas, when s = 0 and β(0) < 0, the CKEs predict two stationary P concentrations, Formula 15 and Formula 15, with the former being stable and the latter unstable. However, when s > 0, the CKEs predict a unique steady-state P concentration, regardless of the value of s and β(s), which is the same as the concentration Formula 15 predicted by the SCKEs, given by Eq. 13 with Formula 15.

To illustrate the previous analytical results and demonstrate their biological significance, we depict in Fig. 4 the stationary concentration of P as a function of the input substrate concentration s, predicted by the SCKEs (solid lines), estimated by Monte Carlo simulation using the Gillespie algorithm, and by the CKEs (dotted lines), obtained analytically. In Fig. 4 A, Formula 15, for every s > 0, in which case, the steady-state response curve predicted by the SCKEs will be given by Formula 15. The response curve Formula 15, predicted by the CKEs, is similar to the one predicted by the SCKEs, with Formula 15, since Formula 15. The situation, however, is very different in Fig. 4 B, in which Formula 15, for Formula 15, and Formula 15, for Formula 15, where Formula 15 is the input substrate concentration that satisfies Formula 15. In this case, the steady-state response curve predicted by the SCKEs is obtained by stitching together three stable stationary P concentrations, namely, Formula 15, for 0 ≤ s ≤ Formula 15 ~= 22.5 pM, Formula 15, for Formula 15 ≤ s ≤ Formula 15, and Formula 15, for s ≥ Formula 15. Note that the slope of Formula 15 is larger than the slope of Formula 15, whereas, the slope of Formula 15 tends to zero as s ->{infty}. As a consequence, and similarly to the behavior shown in Fig. 4 A, the system experiences appreciable protein amplification at low input substrate concentrations (i.e., for s ≤ Formula 15; see the open region in Fig. 4 B), a moderate amplification at intermediate input concentrations (i.e., for Formula 15—see the light shaded region in Fig. 4 B), and diminishing amplification at high input concentrations (i.e., for Formula 15; see the dark shaded region in Fig. 4 B). This behavior is essential to guarantee that, besides its normal operational range, the system responds quickly to low input substrate concentrations (high amplification) but very slowly to high concentrations (saturation). Note that the steady-state response curve predicted by the CKEs increases abruptly from 0 pM to ~0.95 pM at s = 0, thus failing to capture the previous "multistage" amplification property. However, since Formula 15, we have that Formula 15, and the two response curves predicted by the CKEs and the SCKEs will eventually coincide for a sufficiently large input substrate concentration.


Figure 4
View larger version (33K):
[in this window]
[in a new window]

 
FIGURE 4  The input flux k1s versus the stationary mesoscopic forcing term Formula 15, the stationary concentration of P as a function of s, predicted by the SCKEs (solid lines) and CKEs (dotted lines), and the ratio Formula 15 associated with the quadratic autocatalator, given by Eq. 12. The steady-state values obtained by the SCKEs have been computed by Monte Carlo simulation using the Gillespie algorithm, whereas the values obtained by the CKEs have been computed analytically. The system is initialized with two molecules D (the number of DNA copies of a particular gene per eukaryotic cell), and zero molecules P and Q. Parameters used are (A) c1 = 0.002 s–1, c2 = 0.0005 s–1, c3 = 0.005 s–1, c4 = 0.004 s–1, c5 = 0.004 s–1, c6 = 0.05 s–1, and (B) c1 = 0.0004 s–1, c2 = 0.02 s–1, c3 = 0.05 s–1, c4 = 0.04 s–1, c5 = 0.01 s–1, and c6 = 0.05 s–1. Moreover, V = 2 pL and K = 8000 cells. The heavy bold line in the middle figure depicts the steady-state response curve of P, calculated by Monte Carlo simulation using the Gillespie algorithm.

 
As a consequence of the previous investigations, intrinsic stochastic fluctuations may appreciably affect the qualitative properties of a biochemical reaction system at steady state. This can be analytically explained by the presence of nonzero mesoscopic forcing terms, which are responsible for introducing novel modes of behavior not accounted for by classical chemical kinetics. The significance of these modes should not be underestimated, since they may introduce behavior at low molecular concentrations that is essential for proper biological function.

Second-order SCKE approximation
It is unfortunate that we cannot compute the mesoscopic forcing term {varepsilon}(t) analytically. As a consequence, we cannot use standard numerical techniques to solve the SCKEs 9 (or Eqs. 58). Instead, we resort to Monte Carlo simulations using the Gillespie algorithm. However, to obtain accurate Monte Carlo estimates of mean concentration dynamics and fluxes, we need to uniformly sample the system state for a large number of DA trajectories. (Note that the variance of a Monte Carlo estimator with uniform sampling is ~1/K, where K is the number of samples used.) This approach is computationally intensive and especially burdensome when the biochemical reaction system is large and highly reactive.

To circumvent the computational expense of Monte Carlo simulation, we can approximate the SCKEs 5–8 by a system of first-order ordinary differential equations, which we can solve efficiently by the same numerical techniques we use to solve the CKEs. As a matter of fact, we can approximate the mean molecular concentrations u(t) by concentrations Formula 15 that satisfy the system of differential equations (see Appendix D) as

Formula 16(16)
with

Formula 17(17)
where the mth elements of Formula 17 and Formula 17 are given by

Formula 18(18)

Formula 19(19)

In Eq. 19, the terms Formula 19 approximate the DA covariances vZ,mm'(t) and satisfy the system of first-order ordinary differential equations,

Formula 20(20)
where {delta}mm' is the Krönecker delta given by Eq. 31, and {zeta}m,k is the first-order partial derivative of the propensity function of the mth reaction with respect to zk, given by

Formula 20

For reasons explained in Appendix D, we refer to Eqs. 1620 as second-order SCKEs.

We illustrate the quality of approximation obtained by the second-order SCKEs in Figs. 2 and 3, for the case of the quadratic autocatalator with feedback. For this example, the second-order SCKEs provide excellent approximations (dashed lines) of the mean concentration dynamics predicted by the CME (solid lines), which are clearly better than the approximations obtained with the CKEs (dotted lines). We also show in Fig. 5 that, by using the second-order SCKEs, we can obtain very good approximations of the dynamic evolutions of the coefficients of variation (CVs) associated with the intrinsic stochastic fluctuations in P and Q concentrations. (The CVs provide a measure of the relative dispersion, i.e., size, of stochastic fluctuations in the concentration of a molecular species from the mean value; see Appendix D.) It is clear that calculation of CVs is not possible with the CKEs. The reader may also refer to Fig. 2 and Fig. 6 in Goutsias (23Go) for results obtained with a more complex biological system, which includes transcription, translation, protein dimerization, and molecular degradation. Therefore, in addition to satisfactorily approximating the mean concentration dynamics, we may use the second-order SCKEs to characterize intrinsic fluctuations in a stochastic biochemical reaction system by approximating CV dynamics.


Figure 5
View larger version (11K):
[in this window]
[in a new window]

 
FIGURE 5  CV dynamics in the quadratic autocatalator, given by Eq. 12, associated with intrinsic stochastic fluctuations in the concentrations of P and Q, for the case when {kappa}5 = {kappa}2d, in panel A, and {kappa}5 > {kappa}2d, in panel B, predicted by the exact SCKEs (solid lines) and second-order SCKEs (dashed lines). The dynamics obtained by the exact SCKEs have been computed by Monte Carlo simulation using the Gillespie algorithm, whereas the dynamics obtained by the second-order SCKEs have been computed numerically. The parameters used are the same as in Figs. 2 and 3.

 

Figure 6
View larger version (22K):
[in this window]
[in a new window]

 
FIGURE 6  Absolute relative errors in the steady-state concentrations of P and Q associated with the quadratic autocatalator, given by Eq. 12, as a function of the input substrate concentration. (Solid lines) Second-order SCKEs with respect to the exact SCKEs; (dotted lines) CKEs with respect to the exact SCKEs; and (dashed lines) CKEs with respect to the second-order SCKEs. The steady-state values obtained by the exact and second-order SCKEs have been respectively computed by Monte Carlo simulation using the Gillespie algorithm and numerically, whereas the values obtained by the CKEs have been computed analytically from Eq. 44. The system is initialized with two molecules D (the number of DNA copies of a particular gene per eukaryotic cell), and zero molecules P and Q. Parameters used are c1 = 0.002 s–1, c2 = 0.001 s–1, c3 = 0.005 s–1, c4 = 0.004 s–1, c6 = 0.05 s–1, V = 2 pL, and K = 8000 cells. Moreover, c5 = 0.002 s–1 in panel A, and c5 = 0.006 s–1 in panel B.

 
Extensive simulations reveal that the quadratic autocatalator can be approximated very well by the second-order SCKEs for a wide range of parameter values and molecular concentrations (data not shown). One notable exception is at very low substrate concentrations s, in which case the biochemical reaction system will contain a very small number of molecules. We illustrate this in Fig. 6, which depicts the absolute relative errors in the steady-state mean concentrations of P and Q predicted by the second-order SCKEs with respect to the exact SCKEs (solid lines), by the CKEs with respect to the exact SCKEs (dotted lines), and by the CKEs with respect to second-order SCKEs (dashed lines), for the case when {kappa}5 = {kappa}2d, in Fig. 6 A, and {kappa}5 > {kappa}2d, in Fig. 6 B. Clearly, the second-order SCKEs provide consistently good and better approximations than the CKEs. As expected, when {kappa}5 = {kappa}2d, the errors in approximating the steady-state Q concentration are zero. Moreover, the errors in the concentrations of P and Q predicted by the two models gradually diminish for large input substrate concentrations. Note however that the accuracy of the second-order SCKEs decreases at very small input substrate concentrations.

A good match between the predictions obtained by the second-order SCKEs and the ones obtained by the exact SCKEs indicates that the mean and covariances provide a sufficient description of intrinsic stochastic fluctuations, in which case, the molecular distributions will approximately follow a Gaussian distribution. However, the observation that, at very small input substrate concentrations, the second-order SCKEs may not sufficiently approximate the dynamics obtained by the exact SCKEs strongly suggests that higher-order (≥3) central moments may play a significant role in determining these dynamics. In this case, the underlying reactions will be subject to appreciable higher-order statistical interactions and the underlying probability distributions will not be Gaussian. Recent findings suggest that molecular distributions are often non-Gaussian and that such distributions may play an important role in cellular regulation (27Go,28Go). In those cases, it will be necessary to derive higher-order SCKE approximations, by including higher-order moments in the formulation (see our discussion in Appendix D).


    CONCLUSIONS
 TOP
 ABSTRACT
 INTRODUCTION
 BIOCHEMICAL REACTION SYSTEMS
 A QUADRATIC AUTOCATALATOR
 CONCLUSIONS
 APPENDIX A: STOCHASTIC CHEMICAL...
 APPENDIX B: UNIDIRECTIONAL...
 APPENDIX C: QUADRATIC...
 APPENDIX D: SCKE APPROXIMATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
In this article, by adopting a general framework for modeling the macroscopic behavior of a biochemical reaction system consisting of elementary irreversible reactions, we have shown that a classical chemical kinetics approach to modeling biochemical reaction systems may not be appropriate. The flux of each reaction is decomposed into two terms, a macroscopic term that accounts for the effects of mean molecular concentrations on the macroscopic behavior of the system and a mesoscopic term that accounts for the effects of pairwise correlations among reactions. Based on this decomposition, we may characterize a biochemical reaction system by a system of exact first-order ordinary differential equations, the SCKEs, which provide a straightforward extension to the classical CKEs. The SCKEs require use of mesoscopic forcing terms, obtained by linearly transforming the mesoscopic fluxes through the stoichiometry matrix, whose calculation requires computationally expensive Monte Carlo simulations or evaluation of correlation dynamics among pairs, triplets, quadruplets, and larger groups of biochemical reactions.

To avoid such calculations, we have focused on a second-order approximation to the SCKEs, which includes only first- and second-order reaction statistics (i.e., means and pairwise correlations). These equations can be solved by standard numerical procedures and may lead to versatile tools for the analysis of biochemical reaction systems, similar to the ones used in classical chemical kinetics. Notably, a first-order approximation to the SCKEs produces the equations of classical chemical kinetics.

Our analysis indicates that pairwise correlation effects may lead, through mesoscopic forcing terms, to a dynamic behavior not accounted for by classical chemical kinetics. Numerical analysis of a quadratic autocatalator with positive feedback shows that the proposed second-order approximation faithfully reproduces system behavior, for a wide range of molecular concentrations and kinetic parameters. The success of this approximation demonstrates that the second-order SCKEs may provide substantial simplification in describing and analyzing stochastic biochemical reaction systems. Moreover, it suggests that pairwise statistical interactions among reactions may be sufficient for determining biological function and supports the use of multivariate Gaussian distributions for modeling biochemical reactions. However, this may not be true at very low molecular concentrations in which case higher-order approximations may be necessary. The need to include higher-order moments in the approximation highlights the importance of higher-order interactions among biochemical reactions and the inappropriateness of Gaussian modeling at very low molecular concentrations.


    APPENDIX A: STOCHASTIC CHEMICAL KINETICS
 TOP
 ABSTRACT
 INTRODUCTION
 BIOCHEMICAL REACTION SYSTEMS
 A QUADRATIC AUTOCATALATOR
 CONCLUSIONS
 APPENDIX A: STOCHASTIC CHEMICAL...
 APPENDIX B: UNIDIRECTIONAL...
 APPENDIX C: QUADRATIC...
 APPENDIX D: SCKE APPROXIMATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Since biochemical reactions occur by random collisions of reactant molecules, the number of molecules of a particular species present in the system at time t may fluctuate randomly. It is therefore appropriate to characterize the state of a biochemical reaction system at time t by an N x 1 random vector X(t) whose nth element Xn(t) is the number of molecules of the nth species present in the system at time t. In addition, we may use the degree of advancement (DA) Zm(t) to describe the (random) progress of the mth reaction during the time interval [0, t), where Zm(t) = z ≥ 0 means that the mth reaction has occurred z times during the time interval [0, t) (29Go). Note that, due to conservation of mass, we can uniquely determine X(t) from the M x 1 random vector Z(t) with elements Zm(t), m = 1, 2, ..., M, since

Formula 21(21)

Recall that our objective is to model the dynamic evolutions of molecular concentrations in a tissue containing a large population of cells by the N x 1 vector u(t) given by Eq. 1. By taking expectations on both sides of Eq. 21 and by dividing with AV, we obtain

Formula 22(22)
where u(0) = x(0)/AV and µZ(t) = E[Z(t)]. If we assume that the mean DA µZ(t) is differentiable with respect to t (see below why this is true), then, by differentiating both sides of Eq. 22, we obtain Eq. 5. The average reaction rate (flux) {nu}m(t) of the mth reaction is given by

Formula 23(23)
where {chi}m(t) is the average extent of the mth reaction defined as the mean DA of the reaction divided by the Avogadro number; i.e.,

Formula 24(24)

Computation of {nu}m(t) requires calculation of the derivative of the mean DA µZ,m(t) with respect to t. We show how to calculate this derivative in the following.

We denote by {varphi}m(x) the number of all possible distinct combinations of the reactant molecules of the mth reaction channel when the system is at state x, given by (compare with Eq. 4)

Formula 25(25)
We also denote by cm the specific probability rate constant of the mth reaction (i.e., the probability per unit time that a randomly chosen combination of reactant molecules will react through the mth reaction channel). Then, given that the biochemical reaction system is at state X(t) = x at time t, the probability that one mth reaction will occur during the time interval [t, t + dt) is {pi}m(x)dt + o(dt), for a sufficiently small dt, where o(dt) is defined so that o(dt)/dt -> 0, as dt -> 0, and

Formula 26(26)
is the propensity function of the mth reaction channel (30Go,31Go). Moreover, the probability that more than one reaction will occur during [t, t + dt) is o(dt).

If PZ(z;t) = Pr (Z(t) = z) is the probability that the DA vector Z(t) takes value z at time t, then (23Go,32Go,33Go)

Formula 27(27)
where em is the mth column of the M x M identity matrix, and

Formula 28(28)

This chemical master equation (CME) describes the dynamic evolution of the joint probability mass function of the DA process Z(t). As a consequence, we can show that the means µZ,m(t) and covariances vZ,mm'(t) of the DA process Z(t) satisfy the system of first-order ordinary differential equations (23Go),

Formula 29(29)

Formula 30(30)
for t ≥ 0, where {delta}mm' is the Krönecker delta, given by

Formula 31(31)

Note that the derivatives Z,m(t)/dt and dvZ,mm'(t)/dt always exist at finite times, regardless of the number of molecules present in the system, since the CME (Eq. 27) is valid only when the joint probability mass function PZ(z;t) is a continuous function of t, which in turn implies that the means µZ,m(t) and covariances vZ,mm'(t) are continuous functions of t as well.

If we expand the propensity function {alpha}m(z) by a Taylor series about the mean vector µz(t), we have

Formula 32(32)
where {zeta}m and Formula 32 denote the gradient vector (of the first-order partial derivatives with respect to z) and the Hessian matrix (of the second-order partial derivatives with respect to z) of {alpha}m(z), respectively, and T denotes vector (matrix) transposition. From Eqs. 25, 26, and 28, note that the propensity function {alpha}m is at most a quadratic function of the DAs. Therefore, its derivatives of order >2 are zero and Eq. 32 is exact. Moreover, the Hessian Formula 32 does not depend on z. By taking expectations on both sides of Eq. 32 and by using Eq. 29, we obtain

Formula 33(33)

In Eq. 33, {eta}m,kl is the (k, l) element of the Hessian matrix Formula 33 (i.e., the second-order partial derivative of the propensity function {pi}m(x) with respect to the DAs of the kth and lth reactions), given by

Formula 33
where {psi}m is given by Eq. 4 and

Formula 34(34)

Equation 34 relates the specific probability rate constants c with the reaction rate constants {kappa}. Equations 68 are now obtained from Eqs. 4, 2326, 28, 33, and 34.


    APPENDIX B: UNIDIRECTIONAL DIMERIZATION
 TOP
 ABSTRACT
 INTRODUC