| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |





* School of Mathematics, The Faculty of Exact Sciences, and
Laser Laboratory for Fast Reactions in Biology, Department of Biochemistry, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
Correspondence: Address reprint requests to M. Gutman, E-mail: me{at}hemi.tau.ac.il.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The pioneering experiments of fast perturbation of systems in chemical equilibria were carried out by Eigen (1964)
. The analysis, however, was based on linearization of an inherently nonlinear process. This shortcoming was amended by Gutman who introduced the laser-induced proton pulse technique (Gutman and Huppert, 1979
), which modeled the relaxation as a nonlinear process (Gutman, 1984
). Reaction systems as complex as proteins in solution were pulsed by free protons that react simultaneously with all proton-binding sites present on the protein, thus initiating a sequence of reactions that relax over a wide range of timescale, from the nanoseconds up to milliseconds. The analysis was carried out by reconstruction of the observed signals by numeric integration of coupled, nonlinear ordinary differential equations with linear and quadratic terms. The coefficients appearing in these equations are a combination of rate constants, equilibrium constants, and the concentrations of the reactants. The system was solved by a search within the parameter space, for any combination of adjustable parameters that reconstruct many, independently measured observations (Checover et al., 1997
, 2001
; Yam et al., 1988
; Marantz et al., 2001
; Nachliel and Gutman, 2001
). In such systems, the search for the value of the unknown parameters is extremely laborious, and the experience and thorough understanding of the chemical nature of the reactants become a crucial element for a successful search. The analysis is also prone to another kind of criticism as summarized by the general belief: "Three parameters are sufficient to draw an elephant and with four it will also dance." This poses a legitimate question: to what extent can the reconstruction of many experimental signals (all gathered under different experimental conditions) by a single set of kinetic parameters be taken as evidence that it consists as the only solution of the kinetics problem? The main goal of this study is to demonstrate that the search process for the rate constants can be fully automated using the Genetic Algorithm. This can lead to the subjection of large multiequilibria systems to detailed kinetic analysis. For this reason, in this study our goal was not to find a system that assures a fast conversion. On the contrary, the algorithm was set to search over a wide range of parameters, to make sure that any local minimum in the parameter space will be detected.
Since the first presentation by Holland (1970)
, the Genetic Algorithm had attracted a lot of interest and was applied to a multitude of scientific fields, including molecular modeling, polymer design, protein folding, etc. (for review see Leardi, 2001
and references therein). Yet, its application to the analysis of complex chemical processes was rather limited. Attempts to utilize the Genetic Algorithm for solving chemical kinetic problems are not numerous (Filipic et al., 2000
; Hongqing et al., 1999
; Houck and Kay, 1995
; Terry and Messina, 1998
), and a system comparable to ours was described by Viappiani et al. (1998)
. However, the mechanism investigated by Viappiani was rather simple and only two rate constants were searched for.
In this study, we investigated a system with inherent complexity made of seven independent variables, and we tested to what extent the Genetic Algorithm can solve the various proton transfer pathways. The system consists of a proton emitter (pyranine), indicator (fluorescein), and the bicarbonate ion that is made by the solvation of the atmospheric CO2. What is more, the indicator has two proton-binding sites: the oxyanion of the xanthene ring and the carboxylate of the benzene as independent sites, which can also exchange protons among themselves. Of the four reactants, only two are directly observed, whereas the other two have no spectral signature and the dynamics should be concluded from their modulation of the protonation dynamics of the observed reactants. The level of complexity of this system is comparable to the case of studying the protonation dynamics of an indicator attached to a protein (Nachliel et al., 1996
; Nachliel and Gutman, 2001
; Shimoni et al., 1993
; Yam et al., 1988
). The rate constants of protonation of these reactants had been measured previously by manual analysis and their pK values are directly measured. To increase the complexity of the system, the concentration of the bicarbonate had to be treated as an adjustable parameter, its concentration varied daily, reflecting the direction that the wind was blowing CO2 from the nearby power station.
The search for the uniqueness was carried out using the Genetic Algorithm under conditions that allowed an intensive search over the parameters' space. The first generation consisted of 100 "phenotypes", each having randomly selected values for the adjustable parameters. The convergence was rather slow, and the calculations proceeded well after the fitness function seemed to reach a constant level, to make sure that the solution was stable and had no tendency to drift into a new set of adjustable parameters. What is more, the same set of experimental signals was subjected to repeat analytic runs and the values of the adjustable parameters were subjected to statistical analysis, as to whether the values of each parameter are members of a single population. The calculations were carried out first with computer-generated signals, made by the integration of the differential rate equations with known values assigned for each adjustable parameter. When the target signals are noiseless, as in the computer-generated signals, the convergence of the Genetic Algorithm was definitely unique and the values derived by the program were practically identical to those used to create the signals. This is an indication that a global minimum for the fitness function in the multidimensional parameters' space can be obtained. Experimental curves are more difficult to analyze due to the electronic noise and experimental uncertainties, and global minimum was not observed. Yet, when more than one signal was subjected to the analysis, the convergence was efficient and each of the adjustable parameters acted as representing a single population of values.
| EXPERIMENTAL SYSTEM |
|---|
|
|
|---|
OH), which ejects a proton when excited by a photon; 2), fluorescein (Flu), a pH indicator that has two proton-binding sites, the first of which is the oxyanion of the xanthene ring. The protonation of this site causes a major spectral shift of the indicator; 3), the second proton-binding site of fluorescein is the carboxylate moiety of the fluorescein molecule, a site that differs from the chromophore proton-binding sites by its pK value and whose protonation does not bear any spectral signature in the visible range; and 4), a buffer molecule (BH) with unknown concentration that reacts with the proton but does not generate a measurable signal. In this study, the buffer is the bicarbonate anion,
which is spontaneously generated by the solvation of the atmospheric CO2. When the experiments are carried out at varying pH values, the bicarbonate concentration increases with the experimental pH. A further experimental complication had to be dealt with in this study; due to the proximity of the laboratory to a power station, the CO2 content in the air varied daily, depending on the wind's direction. For this reason, the bicarbonate concentration had to be introduced as an adjustable parameter.
The measurements were carried out as described in Checover et al. (1997)
. Briefly, a 100-mM NaCl aqueous solution was supplemented with pyranine (20 µM) and fluorescein (10 µM), equilibrated with the air at two pH values (6.8 and 7.3), and subjected to a train of laser pulses (11.5 mJ/pulse; 10 Hz, 355 nm, 3 ns full-width half maximum). The absorption transients were recorded at 458 and 496 nm, where pyranine and fluorescein are, respectively, absorbing. The time resolutions of the measurements were either 30 ns or 300 ns per data point and the readings were converted into concentration units using the differential extinction coefficients 24,000 and 50,000 M1cm2 for pyranine and fluorescein, respectively.
The kinetics measurements were carried out at each pH value at two wavelengths, 458 and 496 nm. During the measurements the pH of the solution was maintained as constant within ±0.05 pH unit. The signal/noise ratio of the signals at the maximal amplitude exceeded 100:1. The two vectors (concentrations versus time) were analyzed in tandem.
| THE MATHEMATICAL MODEL |
|---|
|
|
|---|
![]() |
At the zero time point, a certain fraction of
OH is dissociated by a brief laser pulse, and equal increments of H+ and
O are generated. The incremental concentration of these reactants relaxes by reaction with all other proton-binding sites present in the solution; the two sites on the fluorescein, the
and the pyranine anion itself
O. In parallel, the protonated states of each compound can deliver, by a collisional mechanism, a proton to any other site having a higher pK value.
All these reactions proceed in parallel with velocities that are given by the products of the reactants on the left side of the equation times the rate constant minus the product of the reactant on the right side of the equation times the rate constant. As the concentrations of all reactants vary with progression of time, the velocities will vary with time, in accordance with the temporal concentration of each reactant. All these relations are incorporated into a set of differential rate equations having the following form:
![]() | (1) |
The velocity of the reactions is given by the general expression as in Eq. 1 where Ci and Cj denote the concentrations of the unprotonated state of the reactants, and [CiH] and [CjH] are the concentrations of their protonated forms. ki,j and ki,j are the rate constants of the forward and backward rate constants of the proton transfer reactions with any other reactant (j).
We denote by Vi the deviation in the concentration of reactant i from its equilibrium level. The initial conditions are:
![]() |
X0 denotes the initial increment of the free proton that was released by the laser pulse and its value depends on the laser pulse energy. The integration of the differential rate equations was carried out using the rate constants of the following reactions as adjustable parameters:
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
For the sake of simplicity, the proton transfer between the fluorescein and the
(both having comparable pK values) was ignored.
The proton transfer reaction between the carboxylate and the pyranine was also omitted from the search, the electrostatic repulsion between them is strong and the pK of the carboxylate is low, as a result, the proton dissociation from the carboxylate is sufficiently fast to deplete the COOH state well before an encounter between the COOH and
O ion.
The last adjustable parameters are the concentrations of the bicarbonate present in the reaction mixture, values that varied with the pH of the solution. The concentration of the pyranine and fluorescein were experimentally measured. The pK values of the two dyes were determined by spectrophotometric titrations and that of the bicarbonate was taken as published (Robinson and Stock, 1959
).
Range of unknown parameters
Each adjustable parameter was allowed to vary within a given range depending on its nature. The rate constants in the thermodynamic-favored direction were allowed to vary from the upper limit set by the Debye-Smoluchowski equation (Gutman and Nachliel, 1997
), down to a lower limit, which was set to be four orders of magnitude smaller than the upper limit. The rate constants in the reverse direction were related with the forward ones through the equilibrium constant, ki = ki/10pKi.
The initial magnitude of the perturbation X0 was estimated directly by back extrapolation of the absorbance at 458 nm during the first 0.5 µs to t = 0.
Due to the large variation in CO2, the bicarbonate concentration was allowed to vary within a wide range, 5500 µM.
| THE FITTING PROBLEM |
|---|
|
|
|---|
For any given set of values of the unknown parameters, the ordinary differential equations can be solved using standard numerical subroutines such as Matlab's ODE23s. The level of agreement between the experimental signals and the numerical solutions can be expressed by a fitness function (Ft), which is a weighted average of the squares of the differences between the calculated solutions (
) and the measured signals (
), i.e.,
![]() | (2) |
The fitting problem is not only to find all the parameter combinations that minimize the fitness function. We also have to determine whether the solution is unique. For this reason, the analysis is repeated and the values of each adjustable parameter are subjected to statistical analysis. In the case where the values conform with normal distribution, we can consider them as members of a single population, thus implying that the Genetic Algorithm found the global minimum. Where the values found for one (or more) adjustable parameter appear to be members of more than one population we shall have to conclude that the analysis failed to find a global minimum in the multidimensional parameters' space.
| GENETIC ALGORITHM |
|---|
|
|
|---|
The fitness function was calculated for all new combinations and their fitness function values were evaluated among themselves and in comparison with the fitness function calculated for the previous generation for the unaltered phenotypes of their parent generation.
Our choice of Matlab as the computational platform is motivated by its portability across different platforms and by the availability of the genetic algorithm toolbox GAOT (Houck and Kay, 1995
). To demonstrate the robustness of our methodology, we use the default parameters of GAOT, rather than trying to optimize performance by varying with GAOT's parameters (the selection and termination functions, etc.).
In this study the Genetic Algorithm was searching for the minimum of the fitness function in a seven-dimensional space. The length of each run, lasting 3000 generations varied among the computes used for the calculations, ranging from 2 to 6 h, depending on the processors.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
|
Analysis of experimental signals
In contrast to the zero-noise input signal, the experimental data are noisy and the convergence can, at most, fit into a band defined by the spread of the experimental points. It is possible that the inherent uncertainty, caused by the width of the target signal, will interfere with the convergence of the fitness function, and end up with more than one set of parameters, each of them reconstructing curves that fit within the width of the input signals. The spread of the values can represent a single population, meaning that the convergence had reached a global minimum, which can be either well defined or shallow. Alternatively, the spread may be a combination of local minima, each represented by a different combination of values that all reproduce curves with the shape of the experimental observation. Naturally, the spreading of the results into a family of local minima undermines the utilization of the kinetics analysis of the experimental observations.
Fig. 3 depicts the variation of the adjustable parameters during eight independent runs corresponding to the convergence curves presented in Fig. 2. Although the signal has a relatively low noise level (see the experimental signal in Fig. 1) and the fitness function converged from 103 down to 0.1 ± 0.05 (Fig. 2), the values of the adjustable parameters exhibit large variations. In two runs, the value of intramolecular proton transfer (Fig. 3 C) had drifted to the upper limit that is allowed for the rate constant, with no noticeable effect on the quality of the fit. Such an observation is consistent with the existence of more than one local minimum in the parameters' space.
|
|
We can classify the adjustable parameters into two categories. The first category comprises those associated with the directly observed reactants, the pyranine and the indicator. The concentration of these reactants is accurately measured and precisely recorded at any time. Accordingly, the convergence of the parameter associated with these reactants is very effective (Table 2).
|
It should be stressed that the rate constants derived by the Genetic Algorithm confirm the values that had been determined by the manual search in the parameters' space (see last column in Table 2) (Gutman 1984
; Gutman and Nachliel, 1990
; Gutman et al., 1985
, 2003
).
Statistical evaluation of the solution
The analysis by the Genetic Algorithm is a search process and even when a noiseless signal is recurrently analyzed, the parameters of the best-fitted phenotype of the different runs are not identical and are dispersed (see Table 1). In the case that the target for the fitting is noisy, we needed two signals for tandem analysis. Accordingly the uniqueness of the solution should be derived from statistical analysis.
The normal probability plot (QQplot program of Matlab) is suitable for this kind of analysis. This program displays a plot of the values of the given parameter (on the y axis) versus theoretical normal distribution of the values as calculated from the whole population. If the distribution is normal, the plot will be close to linear.
The analysis was carried out for all adjustable parameters used for the analysis of the experimental signal (Figs. 5 and 6). Fig. 5 depicts the results of the analysis where the rate constants are the analyzed values. As seen in the figure, the values assigned for each of the rate constants appear to belong to a single normal population. Thus, even if the certainty is not high, and the standard deviation is
20% of the mean value, the results are still members of one normal population.
|
|
100 µM, the values are all members of a single normal population. To better ascertain this conclusion, we also analyzed the time constant of protonation of the bicarbonate, namely the product of the rate constant times the bicarbonate concentration. The lower frames in Fig. 6 confirm that the time constants are also members of a normal population. Thus, each of the adjustable parameters needed to reconstruct the observation is the only minimum along its axis, and the solution itself represents the global minimum in the multidimensional parameter space. | CONCLUDING REMARKS |
|---|
|
|
|---|
Finally, we wish to stress that even though this study is an empirical search for the uniqueness, it is versatile and readily applicable for any complex kinetic system, thus it should be considered as a future standard test, incorporated in kinetic analysis of multireactions systems.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Submitted on January 12, 2004; accepted for publication March 9, 2004.
| REFERENCES |
|---|
|
|
|---|
Checover, S., E. Nachliel, N. A. Dencher, and M. Gutman. 1997. Mechanism of proton entry into the cytoplasmic section of the proton-conducting channel of bacteriorhodopsin. Biochemistry. 36:1391913928.[CrossRef][Medline]
Eigen, M. 1964. Proton transfer, acid-base catalysis, and enzyme hydrolysis. Angew Chem. Int. Ed. Engl. 3:119.[Medline]
Filipic, B., I. Zun, and M. Perpar. 2000. Skill-based interpretation of noisy probe signals enhanced with a genetic algorithm. Int. J. Hum. Comput. Stud. 53:517535.[CrossRef]
Gutman, M. 1984. The pH jump: probing of macromolecules and solutions by a laser-induced, ultrashort proton pulse-theory and applications in biochemistry. Methods Biochem. Anal. 30:1103.[CrossRef][Medline]
Gutman, M., and D. Huppert. 1979. Rapid pH and deltamuH+ jump by short laser pulse. J. Biochem. Biophys. Methods. 1:919.[Medline]
Gutman, M., and E. Nachliel. 1990. The dynamic aspects of proton transfer processes. Biochim. Biophys. Acta. 1015:391414.
Gutman, M., and E. Nachliel. 1997. Time-resolved dynamics of proton transfer in proteinous systems. Annu. Rev. Phys. Chem. 48:329356.[CrossRef][Medline]
Gutman, M., E. Nachliel, and E. Gershon. 1985. Effect of buffer on kinetics of proton equilibration with a protonable group. Biochemistry. 24:29372941.[CrossRef][Medline]
Holland, J. H. 1970. Robust algorithms for adaptation set in a general formal framework. Proceedings of the 1970 IEEE Symposium on Adaptive Processes Decision and Control: XVII. ACM PRESS, New York.
Hongqing, C., Y. Jingxian, K. Lishan, C. Yuping, and C. Yongyan. 1999. The kinetic evolutionary modeling of complex systems of chemical reactions. Comput. Chem. 23:143151.[CrossRef]
Houck, C. J. J., and M. A. Kay. 1995. Genetic Algorithm for Function Optimization: A Matlab Implementation NCSU-IE TR 9509.
Leardi, R. 2001. Genetic algorithm in chemometrics and chemistry: a review. ET J. 15:559569.
Marantz, Y., E. Nachliel, and M. Gutman. 2001. Proton-collecting properties of bovine heart cytochrome C oxidase: kinetic and electrostatic analysis. Biochemistry. 40:1508615097.[CrossRef][Medline]
Gutman, M., E. Nachliel, A. Mezer, and O. Noivirt. 2003. Gauging of local micro-environment at protein water interface by time-resolved single-proton transfer reactions. Annals of the European Academy of Sciences. 1:75107.
Nachliel, E., M. Gutman, S. Kiryati, and N. A. Dencher. 1996. Protonation dynamics of the extracellular and cytoplasmic surface of bacteriorhodopsin in the purple membrane. Proc. Natl. Acad. Sci. USA. 93:1074710752.
Nachliel, E., M. Gutman, J. Tittor, and D. Oesterhelt. 2002. Proton transfer dynamics on the surface of the late M state of bacteriorhodopsin. Biophys. J. 83:416426.
Nachliel, E., and M. Gutman. 2001. Probing the substrate binding domain of lactose permease by a proton pulse. Biochim. Biophys. Acta. 1514:3350.[Medline]
Robinson, R. A., and R. H. Stock. 1959. Electrolyte Solutions. Butterworths Publications Ltd., London, UK.
Shimoni, E., E. Nachliel, and M. Gutman. 1993. Gaugement of the inner space of the apomyoglobin's heme binding site by a single free diffusing proton. II. Interaction with a bulk proton. Biophys. J. 64:480483.
Terry, D. B., and M. Messina. 1998. Heuristic search algorithms for the determination of rate constants and reaction mechanisms from limited concentration data. J. Chem. Inf. Comput. Sci. 38:12321238.
Viappiani, C., G. Bonetti, M. Carelli, F. Ferrati, and A. Sternieni. 1998. Study of proton transfer processes in solutions using the laser induced pH jump: a new experimental setup and an improved data analysis based on genetic algorithms. Rev. Sci. Instrum. 69:270276.[CrossRef]
Yam, R., E. Nachliel, and M. Gutman. 1988. Time-resolved proton-protein interaction. Methodology and kinetic analysis. J. Am. Chem. Soc. 110:26362640.[CrossRef]
This article has been cited by other articles:
![]() |
A. Mezer, E. Nachliel, M. Gutman, and U. Ashery A New Platform to Study the Molecular Mechanisms of Exocytosis J. Neurosci., October 6, 2004; 24(40): 8838 - 8846. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |