help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

Originally published as Biophys J. BioFAST on May 19, 2006.
doi:10.1529/biophysj.106.081828
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
biophysj.106.081828v1
91/4/1235    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Tanaka, R. J.
Right arrow Articles by Kimura, H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tanaka, R. J.
Right arrow Articles by Kimura, H.
Biophysical Journal 91:1235-1247 (2006)
© 2006 The Biophysical Society

Mathematical Description of Gene Regulatory Units

Reiko J. Tanaka, Hiroyuki Okano and Hidenori Kimura

Bio-Mimetic Control Research Center, RIKEN, Nagoya, Japan

Correspondence: Address reprint requests to H. Kimura, Bio-Mimetic Control Research Center, RIKEN, Shimo-shidami, Moriyama-ku, Nagoya 463-0003 Japan. Tel.: 81-52-736-5861; Fax: 81-52-736-5862; E-mail: kimura{at}bmc.riken.jp.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 COMPOUND CONTROL
 ILLUSTRATIVE EXAMPLE: LACTOSE...
 MATHEMATICAL DESCRIPTION OF...
 EXAMPLES
 CONCLUSION
 REFERENCES
 
Revealing the control mechanisms responsible for the cell's surprisingly well-organized functions should lead directly to a better understanding of how the cell adapts to extraordinarily changing environments. A general framework for describing models that can represent diverse biochemical regulatory functions systematically would help not only systematic interpretation of the various models proposed for certain systems but also further understanding of the general control mechanism and design principles underlying different biological systems. This article presents a unified mathematical framework for describing gene regulatory units. The proposed framework is fairly compatible with the classical control theoretical framework, so it should serve as a connecting bridge between engineering control theory and biological control mechanisms. It should also provide a unified view of different regulatory units and facilitate systematic comparison of different mathematical models proposed in a variety of literature.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 COMPOUND CONTROL
 ILLUSTRATIVE EXAMPLE: LACTOSE...
 MATHEMATICAL DESCRIPTION OF...
 EXAMPLES
 CONCLUSION
 REFERENCES
 
Theoretical approaches to molecular biology, mainly based on modeling of biological control mechanisms, have been increasingly appreciated and pursued with the aim of systematically uncovering the design principles of molecular systems (1Go–3Go). Although modeling studies of various specific biological systems have successfully provided insights into the structural properties of those systems, they lack systematic ways of constructing models even for the same biological phenomenon. On the one hand, this reflects the fact that models serve as formal and precise extractions of intrinsic mechanisms responsible for extremely complex living cell functions (4Go) and that the differences in models themselves and in their constructions are a natural consequence of focusing on different characteristics of interest for each study. On the other hand, too much freedom given to modelers in the construction of models leads to difficulty in comparing different models proposed for the same system by different researchers and based on different assumptions, resulting in some confusion and misunderstanding in the interpretation and construction of models. A general framework for describing models that can represent diverse biochemical regulatory functions systematically would help not only systematic interpretation of the various models proposed for certain systems but also further understanding of the general control mechanism and design principles underlying different biological systems. This in turn would promote better cooperation between modelers and experimentalists.

The similarity between biological systems and engineering systems has been pointed out from the viewpoint of well-designed architectures that achieve robustness (5Go,6Go). An argument is that both types of systems use tremendously complex regulatory mechanisms that might not be necessary for basic system functions under normal conditions but are necessary for robust functioning against unpredictable and complex external and internal disturbances and for other design specifications, such as noise rejection and efficiency. We further argue here that a unique and intrinsic characteristic of control in biological systems, compared to manmade control, is found in its simultaneous and effective administration of several appropriate regulations to perform a cell task in response to complex and quite often compound environmental changes. We call this characteristic mode of biological control, "compound control." It should be emphasized that the critical points of biological compound control lie in its enhanced flexibility. Different cell actions can be taken by the same control mechanism in response to compound environmental changes, and this characteristic can be considered robustness in the broad sense. The importance of control in molecular biology has been fully recognized by molecular biologists for a long time, ever since the pioneering works of Jacob and Monod (7Go). Revealing the control mechanisms responsible for the cell's surprisingly well-organized functions, some of which are the same as in engineering systems, should lead directly to a better understanding of how the cell adapts to extraordinarily changing environments (6Go) and, more generally, of the design principles of cellular systems, a primary interest of recent molecular biology.

Engineering control systems are a good example of advancement in systematic design and synthesis due to using a unified theoretical framework called "control theory." It is thus natural to search for a unified theory for biological control. If biological control mechanisms can be understood in a similar way as engineering control systems, i.e., based on a general theory, application of the principles will improve understanding of biological systems (8Go). Several studies have already shed light on control theoretical interpretations of certain biological control systems and have provided important insights into their regulatory mechanisms (8Go–11Go). Such mathematical analysis of biological control in terms of control theory is, however, still in its infancy in terms of generality.

As a first step toward building a general theory of biological control systems, we propose a unified mathematical framework for describing gene regulatory units; a rich theory can then be built on this base to help in revealing fundamental design principles of biological control systems. The framework is proposed from the compound control viewpoint and is fairly compatible with the classical control theoretical framework. We thus believe that it will serve as a connecting bridge between engineering control theory and biological control mechanisms. It should also provide a unified view of different regulatory units and facilitate systematic comparison of different mathematical models proposed in a variety of literature.

This article is organized as follows. Compound Control Section describes the concept of compound control in more detail, providing the base for the notion of gene regulatory units we will formulate. Illustrative Example: Lactose Utilization Network Section uses a classic system, a lactose utilization network, as an illustrative example to provide a flavor of the general framework introduced in Mathematical Description of Regulatory Units Section. This general description of regulatory units, the main product of this article, is clarified with various examples in the last section.


    COMPOUND CONTROL
 TOP
 ABSTRACT
 INTRODUCTION
 COMPOUND CONTROL
 ILLUSTRATIVE EXAMPLE: LACTOSE...
 MATHEMATICAL DESCRIPTION OF...
 EXAMPLES
 CONCLUSION
 REFERENCES
 
Biological control has a characteristic feature that is common in different levels of living organisms (from cellular to neural), compared to manmade control: the simultaneous and effective administration of several appropriate regulations in response to complex and quite often compound environmental changes to perform a task. There is a correspondence, which we call a task, between environmental changes and the actions required to achieve a behavior, which is a unit of activities with a definite purpose evoked by environmental changes. Each action results from the successive administration of appropriate regulations. It is worth noting that basically all regulations in biological control result from successive occurrences of simple control actions, such as firings of neurons at neural level and protein-protein or protein-DNA interactions (PPIs or PDIs) at cellular level. Spatial and temporal combinations of firings in different neurons and of different PPIs or PDIs can result in a tremendous variety of regulations that correspond to compound environmental changes. Homogeneity of regulatory elements, i.e., firings of neurons and PPIs or PDIs, is also a characteristic feature of biological control. The combination of simple homogeneous regulatory mechanisms adaptively creates surprisingly elaborate, heterogeneous, and thus complex regulations corresponding to the compound environmental changes that biological systems experience. We call this characteristic mode of biological control, common to different levels of living organisms (from cellular to neural) and which achieves smooth and versatile adaptation to compound environmental changes, "compound control." It is worth noting that the phrase "compound control" is used in brain science in a slightly different but essentially same context (12Go).

This article is concerned with compound control at the cellular level, whose good example can be found in the classic diauxie, manifested when cultures of Escherichia coli are grown in mixtures of two carbon sources, such as glucose and lactose (13Go). The diauxie is not apparent in the presence of only a single carbon source. In the presence of both glucose and lactose, the cells use glucose first as a carbon source and then use lactose when the glucose runs out, in principle. Transcription of lac operons coding for the enzymes necessary for lactose catabolism is repressed during the first phase when only glucose is used. The repression of lac transcription is thus precisely regulated based on the availability of both glucose and lactose, resulting in the efficient use of available carbon sources. In other words, the cellular control mechanism for lac transcription is so well-designed that it can complete the cell task at hand: namely, efficient catabolism of lactose, despite compound environmental changes, and the availability of both glucose and lactose, by simultaneous and effective administration of multiple PPI- and PDI-based regulations for transcriptions.

This article aims to describe a general framework for systematic description of a compound control scheme at the cellular level that will hopefully help in revealing the fundamental design principles of biological systems. The arguments above suggest that the key to revealing the complexity and uniqueness of biological control at the cellular level lies in understanding the PPI- and PDI-based regulatory actions in response to compound environmental changes. We will consider operons in particular as central physical entities responsible for compound control, since compound environmental changes are, in most cases, processed at the transcription level where the activity of most bacterial promoters is determined depending on multiple environmental cues, except for some rare cases where regulatory proteins can integrate multiple signals (14Go). Transcriptional regulations are frequently the main control mechanisms for realizing cell tasks by initiating intracellular biochemical processes in response to compound environmental changes. Given a cell task, defined as the correspondence between environmental changes and the required actions, we can specify an elementary building block, which we call a "regulatory unit" and for which we provide a mathematical description. We naturally define it as a task-oriented module from the compound-control viewpoint, although we do not claim its originality, since similar notions have been already proposed. Its inputs are the relevant environmental changes, and the operons situated at its core are responsible for the realization of the actions (see Mathematical Description of Regulatory Units Section for a general treatment of regulatory units).


    ILLUSTRATIVE EXAMPLE: LACTOSE UTILIZATION NETWORK
 TOP
 ABSTRACT
 INTRODUCTION
 COMPOUND CONTROL
 ILLUSTRATIVE EXAMPLE: LACTOSE...
 MATHEMATICAL DESCRIPTION OF...
 EXAMPLES
 CONCLUSION
 REFERENCES
 
Before proposing a general treatment of regulatory units, we provide in this section its flavor with an illustrative example using a classic system, the lactose utilization network (15Go).

Lac regulatory unit
A regulatory unit is first defined from the compound-control viewpoint for the system of interest, and then the general descriptions are proposed. The central task of the lactose utilization network is efficient catabolism of lactose, a nutritious sugar that a cell finds in its environment. The task is accomplished through appropriate actions, that is, efficient transcription of the lacZYA operon precisely regulated by several PPI- or PDI-based regulatory actions, in response to changes in the availability of carbon sources, especially lactose and glucose in the environment. Catabolite repression in the presence of glucose in addition to lactose is a complex regulation apparent at the transcription level of the lacZYA operon, encoding ß-galactosidase for conversion of lactose into allolactose, lactose permease for uptake of external lactose, and acetyltransferase for sugar metabolism.

Fig. 1 shows our proposed lac regulatory unit for handling compound environmental changes in the availability of external lactose and external glucose. The lacZYA operon is placed in the core of the unit (the hatched module in Fig. 1). The operon can be considered a functional operator that determines the concentration of transcribed mRNA from the concentrations of the transcription factors, cAMP-CRP and LacI, in this case. The final output of the regulatory units is the concentration of translated proteins from the mRNA (three green arrows in Fig. 1). Although we are interested in the dynamic change in the mRNA concentration of the core operon (lacZYA, in this example) or its resulting proteins with the environmental changes as inputs (two blue arrows in Fig. 1), there is an additional step between the external changes (inputs to regulatory units) and concentrations of the transcription factors (inputs to the operon). The external environmental changes are transmitted through the cell by the concentrations of second messengers, which in turn affect the concentrations of transcriptional factors. In other words, transcription factors couple the expression of the target gene to environmental signals, and they are regulated by second messengers. In this example, the second messengers are cAMP for cAMP-CRP and allolactose for LacI. Repressor LacI is inactivated when it is bound by four allolactose molecules. The concentrations of second messengers are determined by the availabilities of external glucose and lactose and also by the lactose permease produced from LacY (the long feedback arrow in Fig. 1). To be precise, the concentration of cAMP rises sharply as the glucose is depleted and then drops rapidly to almost the initial level when the cell begins to use lactose (16Go).


Figure 1
View larger version (9K):
[in this window]
[in a new window]
 
FIGURE 1  Lac regulatory unit of lactose utilization network.

 
General description of Lac regulatory unit
The dynamics of the Lac regulatory unit defined above are described as

Formula 1(1)

Formula 2(2)

Formula 3(3)

Formula 4(4)
where x, Formula 4, Formula 4, Formula 4, and Formula 4 are the concentrations of lacZYA mRNA, produced proteins (y1 for ß-galactosidase, y2 for lactose permease, and y3 for acetyltransferase), external carbon sources (e1 for glucose and e2 for lactose), second messengers (s1 for cAMP and s2 for allolactose), and regulator molecules (u1 for cAMP-CRP and u2 for LacI), respectively, and {alpha}, ß(= diag{ß1, ß2, ß3}), {gamma}(= diag{{gamma}1, {gamma}2}), and {delta}(= diag{{delta}1, {delta}2}) represent the degradation rates (together with the growth rate) for x, y, u, and s, respectively. Functions F, G, H, and K describe the production rates of x, y, u, and s, respectively, and are rational functions of the arguments. Note that they explicitly depend on u, x, s, and e, respectively, as is obvious from the signaling flows in Fig. 1 and K depends also on s itself and on y corresponding to the feedback regulation.

In this system, F satisfies Formula 4 and Formula 4, since cAMP-CRP (u1) acts as an activator and LacI (u2) acts as a repressor. As a whole system, function F is implicitly affected by both e1 and e2, suggesting that changes in the two carbon sources' concentrations (e1 and e2) are compound environmental changes. The choice of functions F, G, H, and K together with the choice of variables to be included in the model leads to different models for the same system, as will be shown in the next subsection. The general description of the lac regulatory unit in Eqs. 14 provides a unified view for different models, enabling them to be compared.

Comparison of different models
Several mathematical models for the lactose utilization network have been proposed. Here we focus on four of them and clarify their focus in terms of the regulatory unit shown in Fig. 1 by specifying their variables, loops, and detailed expressions for F, G, H, and K. This enables us to illustrate the generality of the regulatory unit and its description, Eqs. 14, proposed above.

Setty et al. model
Setty et al. (17Go) proposed a dynamic model of the lac operon as a functional unit with inputs of cAMP (s1) and allolactose (or IPTG, s2). It does not consider the external environmental changes or the feedback regulation via lactose permease (y2). Fig. 2 shows the regulatory unit in their model, with the parts not considered shown as shaded dotted lines.


Figure 2
View larger version (12K):
[in this window]
[in a new window]
 
FIGURE 2  Setty et al.'s model of lactose utilization network (17Go).

 
The dynamics of their model are described as

Formula 5(5)

Formula 6(6)
corresponding to Eq. 1 and Eq. 3, respectively, while Eq. 2 and Eq. 4 are not considered, as shown in Fig. 2. Function F in Eq. 5 is given as a rational function of u1 and u2 (details omitted here), and the static relationship corresponding to Eq. 6 is

Formula 7(7)
where the subscript s in Hs stands for "steady state."

Yildirim and Mackey model
Yildirim and Mackey (18Go) proposed a detailed model for a lactose utilization network with lactose as the only carbon source. It takes into account the time delay due to the transcription and translation processes and feedback regulation via lactose permease. Fig. 3 shows the regulatory unit in their model, with the parts not considered shown as shaded dotted lines. In addition to the variables introduced in our proposed regulatory unit, it considers the concentration of intracellular lactose (L), although it does not appear explicitly in the general description of the regulatory unit. External glucose (e1), cAMP (s1), and cAMP-CRP (u1) are not considered.


Figure 3
View larger version (13K):
[in this window]
[in a new window]
 
FIGURE 3  Yildirim and Mackey's model of lactose utilization network (18Go).

 
The dynamics of their model are described as

Formula 8(8)

Formula 9(9)

Formula 10(10)

Formula 11(11)
where Formula 11. Note that Eq. 10 leads to a static relationship, u2 = Hs(s2), similar to the previous example, and that Eq. 11 is an extended representation of Eq. 4 for an augmented variable, Formula 11.

The detailed expressions for F, G, Hs, and K are

Formula 12(12)

Formula 13(13)

Formula 14(14)

Formula 15(15)
where a, b1, b2, c, d, e, f, g, and h are constants. The {Gamma}, {Phi}1, and {tau}u in Eq. 12 denote the spontaneous rate of mRNA production, the equilibrium constant for the operator-repressor reactions, and the time delay required to produce the mRNA, respectively. The Formula 15 and Formula 15 values in Eq. 13 denote the time delay for the mRNA translation of ß-galactosidase (y1) and of lactose permease (y2), respectively. The power 4 in Eq. 14 comes from the fact that the repressor LacI is deactivated by the tetramer of allolactose with {Phi}2 being the equilibrium constant for the repressor-allolactose reaction. The terms in Eq. 15 denote, respectively, the ß-galactosidase-mediated conversion of lactose into allolactose, the ß-galactosidase-mediated allolactose loss due to conversion into glucose and galactose, the permease-facilitated transport of external lactose, the intracellular lactose loss due to extracellular fluid because of the reversible nature of the permease-mediated transport, and the ß-galactosidase-mediated conversion of lactose to allolactose. The {Phi}i-values (i = 3, 4, 5, 6, 7) are the corresponding equilibrium constants. The nonlinear dynamics of this model depend on the system parameters and are further analyzed elsewhere (19Go).

Ozbudak et al. model
Ozbudak et al. (20Go) investigated the mechanism creating bistability in the lactose utilization network using a simple mathematical model. Feedback regulation via produced lactose permease for lactose uptake and inducer exclusion by glucose is explicitly taken into account, although the translation, thus y, is not (see Fig. 4).


Figure 4
View larger version (12K):
[in this window]
[in a new window]
 
FIGURE 4  Ozbudak et al.'s model of lactose utilization network (20Go).

 
The dynamics of their model are described as

Formula 16(16)

Formula 17(17)

Formula 18(18)
corresponding to Eqs. 1, 3, and 4, respectively, where a and R0 are constants, and K1 and K2 are functions corresponding to catabolite repression and lactose uptake, respectively (details omitted here). Equation 17 leads to the static relationship Formula 18 with constant RT.

Santillán and Mackey model
Santillán and Mackey (21Go) proposed yet another detailed model of the lactose utilization network with all of the variables in the proposed regulatory unit (see Fig. 5).


Figure 5
View larger version (11K):
[in this window]
[in a new window]
 
FIGURE 5  Santillán and Mackey's model of lactose utilization network (21Go).

 
The dynamics of their model are described as

Formula 19(19)

Formula 20(20)

Formula 21(21)

Formula 22(22)
corresponding to Eqs. 14, respectively. Equation 21 leads to Formula 22, where Formula 22 denotes the vector for the concentrations of free second messengers (subscript f stands for "free"), i.e., free cAMP (not bound to CRP) and free allolactose (not bound to the repressor LacI). The Formula 22 denotes the total (free and bound) concentration of cAMP and allolactose. Detailed expressions for F, G, H, Hf, and K are omitted here. Function F is determined by the configuration of the binding sites on the DNA and by the binding energies of possible binding states of the lac operon.


    MATHEMATICAL DESCRIPTION OF REGULATORY UNITS
 TOP
 ABSTRACT
 INTRODUCTION
 COMPOUND CONTROL
 ILLUSTRATIVE EXAMPLE: LACTOSE...
 MATHEMATICAL DESCRIPTION OF...
 EXAMPLES
 CONCLUSION
 REFERENCES
 
Using the illustrative examples in the previous section, here we present our general framework for describing models of general regulatory units.

Regulatory units
As we did for the lactose utilization network in the previous section, we first define a general regulatory unit for which mathematical descriptions are then proposed. A regulatory unit is defined as a task-oriented module from the compound control viewpoint. As described in Compound Control Section, a task in the compound control scheme is the correspondence between environmental changes and the actions required to realize a behavior, which is a unit of activities with a definite purpose evoked by environmental changes. For a given task, the relevant environmental changes and the actions required to perform the task are specified; a set of the main operons responsible for the actions is then naturally chosen (lacZYA for the lactose utilization network in Illustrative Example: Lactose Utilization Network Section). A regulatory unit is thus defined with the main operons in the core (Fig. 6 for the case with only one operon). The inputs and outputs of an operon are the concentrations of regulatory molecules (transcription factors) and of transcribed mRNA, respectively. The ultimate output of the regulatory unit is the concentration of translated proteins. Since transcription factors couple the expression of the target genes to environmental signals and they must be regulated, we consider an additional variable—the concentration of second messengers that regulate transcription factors. Second messengers include small ligands (such as allolactose in lac system) that regulate DNA-binding affinity of transcription factors, sensor kinases (such as NarX and NarQ) that regulate the activity of response regulators (such as NarL) by covalent modification as in two-component systems, and regulatory proteins that regulate effective concentration of a transcription factor by sequestration (14Go). The main flow of signals in the regulatory unit along these variables is shown by the thick arrows in Fig. 6. Additional regulatory relationships among the variables can be systematically defined to complete the regulatory unit. Fig. 6 shows the general scheme of a regulatory unit from the compound control viewpoint with feedback loops (shown by dotted arrows) from the output as representative regulatory relationships.


Figure 6
View larger version (11K):
[in this window]
[in a new window]
 
FIGURE 6  Regulatory unit from compound-control viewpoint.

 
A view of regulatory units from the control theoretical viewpoint suggests that the operon is a plant to be controlled and the rest of the regulatory unit is a controller, producing control inputs for the plant (Fig. 7). Fig. 8 shows a typical control system treated in control theory, with a feedforward controller and a feedback controller. This view provides a guide to carry out system identification of the total regulatory unit. A standard way of designing or identifying control systems begins with identification of the plant, the operon in this case, and then proceeds to identification or design of the controller, which is responsible for achieving the appropriate functioning of the regulatory unit in response to compound environmental changes. It should be noted that this classification of a plant and a controller is provided within the regulatory unit, which is simply a part of a more complex and larger biological system. Another way to classify the modules of a biological system into a plant and a controller is to consider the regulatory unit defined in this article as a controller and the physical phenomenogical part outside the regulatory unit as a plant (9Go,22Go).


Figure 7
View larger version (9K):
[in this window]
[in a new window]
 
FIGURE 7  Regulatory unit can be decomposed into a plant and a controller, corresponding to the operon and the rest, respectively.

 

Figure 8
View larger version (6K):
[in this window]
[in a new window]
 
FIGURE 8  Typical control system used in control theory, with a plant (P), a feedforward controller (C1), and a feedback controller (C2). Inputs to and outputs from the plant are denoted as u and y, respectively, and the external inputs from the environment are denoted as e.

 
Note that the notion of regulatory units is similar to that of gene circuits (23Go), which sense their environmental context and orchestrate the expression of a set of genes to produce appropriate patterns of cellular response. Similarly to regulatory units, a gene circuit is organized around transcription units, the simplest of which is an operon. Different ways of coordinating the expression of functions in a gene circuit are provided depending on the connectivity of transcription units, in a similar manner for regulatory units to achieve different tasks. Regulatory units put more emphasis than gene circuits do on the compound environmental changes related to a given task and on the general mathematical description that are compatible with classical control theory. Regulatory units are also similar but different from the classic notions of regulons, modulons, and stimulons (24Go) in the sense that the former are determined based on the operons responsible for the given task despite compound and thus (basically) several external changes, whereas the latter designate groups of operons controlled by a common regulator or responding to a given environmental change. Still, the notions of regulons, modulons, and stimulons can help define a regulatory unit for certain systems and, in some cases, they might help elucidate new actions, tasks, and connectivity that have not been considered.

Mathematical description
The mathematical description of the general regulatory unit shown in Fig. 6 is

Formula 23(23)

Formula 24(24)

Formula 25(25)

Formula 26(26)
where Formula 26, Formula 26, Formula 26, Formula 26, and Formula 26 are the concentrations of mRNA for the operons, produced proteins, external changes, second messengers, and regulator molecules, respectively, and Formula 26, Formula 26, Formula 26, and Formula 26 represent the degradation rates (together with the growth rate) for x, y, u, and s, respectively. Functions F, G, H, and K describe the production rates of x, y, u, and s, respectively, and are rational functions of the arguments. Note that F, G, H, and K explicitly depend on u, x, s, and e, respectively, as is obvious from the main signaling flow, and on the output y via feedback regulation. Functions H and K generally depend additionally on u and s themselves, respectively, reflecting the fact that the production rates of u and s are affected by their own concentrations (see, for example, the model of the tryptophan metabolic pathway in Tryptophan Metabolic System Section).

In this general description, a priori knowledge of whether the regulators are acting as activators or repressors is not necessary; however, it can be specified by the sign of the partial derivative of F: ui is a repressor if Formula 26 and is an activator if Formula 26. This formulation of the regulators without a priori knowledge of their activity is advantageous for a unified treatment of regulatory units. It is consistent with the fact that regulation by transcriptional factors is initiated by their interactions with the binding sites on DNA irrespective of their activities and that the transcription rate is determined by the configuration of binding sites on DNA and by the binding energies of the possible binding states (H. Kimura, H. Okano, and R. J. Tanaka, unpublished data), although the resulting actions that affect the functioning of RNAP may completely differ between activators and repressors. The fact that Formula 26 represents compound environmental changes can be verified by determining whether F depends on all of Formula 26.

In the example in Yildirim and Mackey Model (Fig. 3), the internal state (L) between the environmental change (e2) and the second messenger (s2) is considered with its dynamics, and an augmented variable, z, including the internal state is introduced in the model, as in Eq. 11. Although it is possible to augment the variables to include as much of the internal dynamics of the controller as we want, the variables shown in Eqs. 2326 (Fig. 6) are minimally necessary for describing the regulatory unit from the compound-control viewpoint. The only exception occurs when other operons should be included in the controller part, as shown in Fig. 9. Assume the additional operon in the controller part (the dotted black module in Fig. 9) has input uc and output xc. The dynamics of this part of the controller can then be given as

Formula 27(27)

Formula 28(28)

Formula 29(29)


Figure 9
View larger version (9K):
[in this window]
[in a new window]
 
FIGURE 9  Regulatory unit with the dynamic controller.

 
Introducing augmented state Formula 29, we can replace Eq. 25 in the general description with

Formula 30(30)

Formula 31(31)


    EXAMPLES
 TOP
 ABSTRACT
 INTRODUCTION
 COMPOUND CONTROL
 ILLUSTRATIVE EXAMPLE: LACTOSE...
 MATHEMATICAL DESCRIPTION OF...
 EXAMPLES
 CONCLUSION
 REFERENCES
 
To illustrate the generality of the mathematical description we proposed in the previous section, here we provide more examples of regulatory units, in addition to that for the lactose utilization network described above.

Arabinose utilization network
The task of an arabinose utilization network is arabinose catabolism, and is achieved by efficient transcription of the araBAD operon, the required actions in response to changes in the availability of carbon sources, especially glucose and arabinose in the environment. As in the classic lactose-glucose systems, an arabinose utilization network shows diauxie in the presence of both arabinose and glucose because of catabolite repression by glucose (13Go).

The regulatory unit of the arabinose utilization network is shown in Fig. 10. The external availabilities of glucose and arabinose are the environmental changes and they affect the concentrations of second messengers, cAMP and internal arabinose. The regulators for the main operon, araBAD, are cAMP-CRP, AraC, and AraC-arabinose. AraC by itself acts as a repressor, whereas AraC-arabinose acts as an activator (25Go). The araBAD operon is responsible for the catabolism of arabinose, and arabinose uptake requires transcription of two additional operons, araE and araFGH. We include only araFGH in the scheme for simplicity and omit araE. Regulator AraC is produced by an araC operon, which is regulated by the same set of regulators as for araBAD and araFGH, as shown in the shaded box in Fig. 10. The lac operon considered in Illustrative Example: Lactose Utilization Network Section is a special case in which an operon provides the proteins necessary for the catabolism as well as the uptake of lactose, and its repressor LacI is produced constitutively so that there is no need to include the lacI operon in the regulatory unit.


Figure 10
View larger version (20K):
[in this window]
[in a new window]
 
FIGURE 10  Regulatory unit of the arabinose utilization network; xc is internal state of dynamic controller.

 
The dynamics of the regulatory unit are given as

Formula 32(32)

Formula 33(33)

Formula 34(34)

Formula 35(35)
where Formula 35, Formula 35, Formula 35, Formula 35, and Formula 35 are the concentrations of mRNA (x1 for araBAD and x2 for araFGH), produced proteins (y1 for AraB, A, and D and y2 for AraF, G, and H), external carbon sources (e1 for glucose and e2 for external arabinose), second messengers (s1 for cAMP and s2 for internal arabinose), and regulator molecules (u1 for cAMP-CRP, u2 for AraC, and u3 for AraC-arabinose), respectively; and Formula 35 is the augmented variable with xc being the concentration of araC mRNA, L = diag{0, 1, 1, 1}, and Formula 35, Formula 35, Formula 35, and Formula 35 are the degradation rates (together with the growth rate) for x, y, z, and s, respectively.

Tryptophan metabolic system
The tryptophan metabolic system is another classic example of gene regulation systems and has been extensively studied both experimentally and theoretically (26Go). The task is the efficient supply of tryptophan, one of the amino acids that the cell requires; it is achieved by efficient transcription of the trpEDCBA operon, which encodes the genes for the enzymes necessary in the tryptophan biosynthesis pathway in response to the changes in availability of tryptophan in the cell environment.

The regulatory unit for this system can be described as shown in Fig. 11 for the trpEDCBA operon. The regulators for the operon are charged tRNA and active repressor TrpR-tryptophan, which are responsible for transcription attenuation and repression, respectively. In addition to these two mechanisms, this system can use enzyme inhibition for feedback regulation. A produced enzyme, anthranilate synthase, which is the first enzyme to catalyze a reaction in the tryptophan biosynthesis pathway beginning with chorismate, is a heterotetramer consisting of two TrpE and two TrpD polypeptides. The enzymatic activity of anthranilate synthase is inhibited when the TrpE subunits of an anthranilate are bound by two tryptophan molecules. A change in the external concentration of tryptophan in the environment is the only environmental change that affects the actions we consider, and the second messenger is internal tryptophan. We assume here that tRNA is constantly available for charged tRNA and omit its production mechanism, whereas the production of TrpR requires a trpR operon, which is also regulated by TrpR-tryptophan and is shown in the shaded box in Fig. 11.


Figure 11
View larger version (17K):
[in this window]
[in a new window]
 
FIGURE 11  Regulatory unit of the tryptophan metabolic system.

 
The dynamics of the regulatory unit in Fig. 11 are described as

Formula 36(36)

Formula 37(37)

Formula 38(38)

Formula 39(39)
where x, Formula 39, and Formula 39 are the concentrations of mRNA, produced proteins (y1, y2, y3, y4, y5 are for TrpA,B,C,D,E, respectively), external tryptophan, internal tryptophan, and regulatory molecules (u1 for charged tRNA and u2 for TrpR-tryptophan), respectively, Formula 39 is the augmented variable with xc being the concentration of mRNA for inactive transcription factor TrpR, Formula 39, and {alpha}, ß(= diag{ß1, ß2, ß3}), Formula 39, and {delta} are the degradation rates (together with the growth rate) for x, y, z, and s, respectively.

A number of theoretical models for the tryptophan metabolic system have been used to study different system characteristics. However, they have considered neither the gene regulation for TrpR nor the external input e of tryptophan outside the cell. As a result, the general description used for the tryptophan system does not need augmented variable z. For example, Sinha (27Go) proposed a simple mathematical model with transcription repression as the only regulation mechanism (ignoring transcription attenuation and enzyme inhibition) (Fig. 12). Bliss et al. (28Go) proposed a model including both transcription repression and enzyme inhibition but not transcription attenuation (Fig. 13). They also considered time delay for the translation. It is easy to verify that both models can be interpreted in the general framework given by Eqs. 2325.


Figure 12
View larger version (14K):
[in this window]
[in a new window]
 
FIGURE 12  Sinha's model of the tryptophan metabolic system (27Go).

 

Figure 13
View larger version (13K):
[in this window]
[in a new window]
 
FIGURE 13  Bliss et al.'s model of the tryptophan metabolic system (28Go).

 
Santillán and Zeron (29Go) recently proposed a detailed model of the tryptophan metabolic system with all three control mechanisms and the time delay due to translation. The related regulatory unit is shown in Fig. 14, and the dynamics are described as

Formula 40(40)

Formula 41(41)

Formula 42(42)

Formula 43(43)
where sf is the free tryptophan (not bound to anthranilate). The detailed expressions for F, G, H, K, and Hf are

Formula 44(44)

Formula 45(45)

Formula 46(46)

Formula 47(47)

Formula 48(48)

Formula 49(49)
with Formula 49 and Formula 49, where a, b, c, KG, KT, KI, K, KC, KP, P, and GT are constants.


Figure 14
View larger version (13K):
[in this window]
[in a new window]
 
FIGURE 14  Santillán and Zeron's model of the tryptophan metabolic system (29Go).

 
Although we did not include the transporter protein in the regulatory unit shown in Fig. 11, nor is it included in the three models shown above, it is used to take up tryptophan from outside the cell to inside. Tryptophan outside the cell is important from the compound-control viewpoint, so the transporter, the box from e to s in Fig. 11, should be taken into consideration in the regulatory unit. However, mtr gene encoding of the tryptophan-specific transporter is regulated not only by TrpR but also by TyrR-tyrosine, a compound of TyrR and another aromatic amino acid, tyrosine. We thus have to include the tyrosine biosynthesis pathway and the operon for TyrR with their regulators as well if we want to include the transporter proteins. Actually, there are several genes that are co-regulated by transcription factors related to three aromatic amino acids (tryptophan, tyrosine, and phenylalanine). They usually code for the proteins that are used in the common pathway for three aromatic amino acids. Consequently, it is reasonable to consider the whole aromatic amino-acid biosynthesis pathway, and not only the tryptophan biosynthesis pathway, when we discuss the regulatory unit from the compound-control viewpoint with external changes as inputs.

Heat shock response system
As an example of a more complex system, we consider systems for heat shock response (30Go). The task is to counter the effects of heat such as protein unfoldings and malfunctions. It is done by making heat shock proteins (molecular chaperones) in response to environmental changes in temperature. Based on a recent mathematical analysis of the heat shock response system (9Go), we consider a simple description of the system with DnaK/J as representative molecular chaperones and FtsH as a representative protease.

The regulatory unit for this simple heat shock response system is shown in Fig. 15. The main operons are dnaK for DnaK/J and hflB for FtsH. Transcription of both operons is promoted when {sigma}32 factors are bound to RNAP, so that free {sigma}32 factors (which are ready to bind to RNAP and promote transcription and are not bound to chaperones or proteases) act as regulatory molecules, whereas the total {sigma}32 factors are considered as second messengers. The environmental changes are temperature shifts outside the cell. The system uses three different regulatory mechanisms: feedforward by which the temperature upshift promotes translation of the {sigma}32 factors, sequestration feedback by which the {sigma}32 factors are sequestered by chaperones and thus cannot bind to RNAP and activate the transcription of the operons, and degradation feedback by which the {sigma}32 factors are degraded when bound to both DnaK and FtsH. All three regulatory mechanisms are shown in Fig. 15.


Figure 15
View larger version (22K):
[in this window]
[in a new window]
 
FIGURE 15  Regulatory unit for the heat shock response system.

 
The dynamics of the regulatory unit shown in Fig. 15 are described as

Formula 50(50)

Formula 51(51)

Formula 52(52)

Formula 53(53)
where Formula 53, Formula 53, and u are the concentrations of mRNA (x1 and x2 are for dnaK and hflB, respectively), produced proteins (y1 and y2 are for DnaK and FtsH, respectively), external temperature, total {sigma}32, and free {sigma}32, respectively, and {alpha}(= diag{{alpha}1, {alpha}2}), ß(= diag{ß1, ß2}), {gamma}, and {delta} are the degradation rates (together with the growth rate) for x, y, u, and s, respectively.

El-Samad et al. (9Go) proposed a simple mathematical model for the heat shock response system and used it to analyze how well the designed regulatory mechanisms fit the system requirements for robustness, response speed, noise rejection, efficiency, etc., by clarifying the role of each regulatory mechanism. The dynamics of their simpler model (reduced-order model), shown in Fig. 16, are described as

Formula 54(54)

Formula 55(55)

Formula 56(56)
corresponding to the general description, Eqs. 2426. The y1 and y2 stand for the concentrations of DnaK and FtsH, respectively. State x does not explicitly appear in the model based on the assumption that yi = aixi (i = 1, 2), with constants ai. The {alpha} is a constant, and ß and {delta} represent the degradation rates for y1 and s, respectively. A static relation, u = Hs(s, y), is given for Eq. 55. The K(e) in Eq. 56 denotes feedforward regulation in response to temperature upshifts, and the second term represents degradation feedback by which free {sigma}32 factors are degraded when they are bound to both DnaK (y1) and FtsH (y2), so that the resulting term is given by the product of free {sigma}32 factors (Hs), free DnaK (G1(y1)), and free FtsH (G2(y2)).


Figure 16
View larger version (12K):
[in this window]
[in a new window]
 
FIGURE 16  El-Samad et al.'s model of the heat shock response system (9Go).

 
{lambda}-system
As the last example, we consider the bacteriophage {lambda}-system, which has been studied since the 1950s (31Go) as a paradigm for developmental genetic networks (32Go). A {lambda}-infected bacterium follows either of two pathways: lysis, in which {lambda} actively produces viral copies using the bacterial molecular machinery, and lysogeny, in which all but one of the phage genes are turned off, and one phage chromosome integrates itself into the host chromosome so that the viral copies are produced passively as part of the host chromosome. The regulatory mechanism responsible for the lysogeny/lysis decision, the {lambda}-switch, has been extensively studied, and its detailed mechanism has been revealed, although not yet completely (33Go–35Go).

The task of the {lambda}-system is efficient and adaptive switching between lysogeny/lysis pathways in response to environmental changes such as in the nutritional state and in UV irradiation. It is done by appropriate production of related proteins such as CI and Cro: CI, the {lambda}-repressor, turns off all phage genes except its own gene and thus leads to stable lysogenic growth, whereas Cro promotes and is required for lytic growth. The corresponding mRNA for cI and cro are both transcribed from the right operator (OR).

The regulatory unit of the {lambda}-system is shown in Fig. 17, with OR as the central entity responsible for the task. The regulators of OR are CI2 and Cro2 dimers, whereas the outputs of the regulatory unit are CI and Cro monomers. We consider the nutritional state and UV irradiation as environmental changes that affect the regulatory molecules via the second messengers of CII and activated RecA proteins. CII is responsible for kickstarting CI production, and the nutritional state affects the production of CII. For example, the mean and peak of the CII concentration levels are reduced in well-fed cells, so there is less probability of CI production being kickstarted, resulting in the reduced probability of lysogeny. The RecA protein becomes activated when DNA is damaged, for example, by UV irradiation. The activated protein cleaves the CI monomers, which cannot dimerize anymore, and repressors fall off the operator. As a result, the rate of repressor synthesis drops and transcription of cro begins. Although there are more complex mechanisms for regulation of CII and RecA production, we consider a simplified scheme for the regulatory unit that omits other proteins such as N, Q, HflA, HflB, and CIII and terminators such as Formula 56 and TR in order to focus on the effects of environmental changes on the transcription of main mRNAs from the compound control viewpoint.


Figure 17
View larger version (8K):
[in this window]
[in a new window]
 
FIGURE 17  Regulatory unit of the {lambda}-system.

 
The dynamics of the regulatory unit shown in Fig. 17 are described as

Formula 57(57)

Formula 58(58)

Formula 59(59)

Formula 60(60)
where Formula 60, Formula 60, Formula 60, Formula 60, and Formula 60 are the concentrations of mRNA (x1 and x2 are for cI and cro, respectively), produced proteins (y1 and y2 are for CI and Cro monomers, respectively), environmental changes (e1 and e2 are for the nutritional state and UV irradiation, respectively), second messengers (s1 and s2 are for CII and RecA, respectively), transcription factors (u1 and u2 are for CI2 and Cro2 dimers, respectively), and {alpha}(= diag{{alpha}1, {alpha}2}), ß(= diag{ß1, ß2}), {gamma}(= diag{{gamma}1, {gamma}2}), and {delta}(= diag{{delta}1, {delta}2}) are degradation rates (together with the growth rate) for x, y, u, and s, respectively.

Based on the development of experimental studies of {lambda}-systems, several theoretical models have been proposed, e.g., (34Go–37Go). While two of them (34Go,35Go) consider most of the detailed processes related to the {lambda} switch, many others focus only on the transcriptional regulation of OR. Santillán and Mackey (37Go) proposed a mathematical model of the {lambda}-system and used it to investigate the stability of the lysogenic state. They focused on the dynamics of OR, as shown in Fig. 18. The dynamics are described as

Formula 61(61)

Formula 62(62)

Formula 63(63)
corresponding to Eqs. 5759, whereas second messengers s, and thus Eq. 60, are not taken into consideration. Function F(u) for the transcription rate is calculated from the probability of different compatible binding states of OR. The probability of each binding state is calculated under the assumption of thermodynamic equilibrium using the binding energies of corresponding molecules (38Go). The translation rate G(x) is given by

Formula 64(64)
for y1 and y2, respectively, with {tau}1 and {tau}2 being the time delay required for translation to begin. The static relationship for u and y is derived for Eq. 63 under a quasi steady-state assumption for the dimerization reactions.