help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Rosales, R. A.
Right arrow Articles by Hladky, S. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rosales, R. A.
Right arrow Articles by Hladky, S. B.

Biophys J, January 2002, p. 29-35, Vol. 82, No. 1

Kernel Estimates for One- and Two-Dimensional Ion Channel Dwell-Time Densities

Rafael A. Rosales,* William J. Fitzgerald,dagger and Stephen B. HladkyDagger

 *Departamento de Matemáticas, Instituto Venezolano de Invetigaciones Científicas, Caracas 1020-A, Venezuela, and Departments of  dagger Engineering and  Dagger Pharmacology, University of Cambridge, Cambridge, United Kingdom


    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
DWELL-TIME KERNEL ESTIMATES
RESULTS
DISCUSSION
APPENDIX
REFERENCES

In this paper, we compare nonparametric kernel estimates with smoothed histograms as methods for displaying logarithmically transformed dwell-time distributions. Kernel density plots provide a simpler means for producing estimates of the probability density function (pdf) and they have the advantage of being smoothed in a well-specified, carefully controlled manner. Smoothing is essential for multidimensional plots because, with realistic amounts of data, the number of counts per bin is small. Examples are presented for a 2-dimensional pdf and its associated dependency-difference plot that display the correlations between successive dwell times.


    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
DWELL-TIME KERNEL ESTIMATES
RESULTS
DISCUSSION
APPENDIX
REFERENCES

Dwell-time histograms and the probability density functions (pdfs) estimated from them are an important means for presenting the results of patch-clamp investigations of ion channel currents. Blatz and Magleby (1986), McManus et al. (1987) and Sigworth and Sine (1987) demonstrated that constructing the distribution with a logarithmic rather than a linear time scale can greatly reduce the computations required for subsequent fitting and has a number of advantages for display of the data: it produces a pdf for a single exponential whose peak height and position on the log axis specify the amplitude and time constant of the exponential decay; it allows convenient display of dwell times spanning several orders of magnitude; and it provides a much better visual impression of the number of components in the exponential mixture. A major disadvantage has been that it is not straightforward to construct a histogram with log bins when the original data have been digitized. The mapping of dwell times for data recorded at a fixed sample interval to log bins is highly irregular: some bins for short durations can never receive counts, and significant irregularities between adjacent bins extend beyond the bin corresponding to 100 times the sample interval. Stark and Hladky (2000) have shown that simple approximations based on the nature of the binning process combined with the transformation rules for probability density functions make it possible to adjust the display of the counts to compensate for these irregularities. We report here that construction of kernel density plots (Bowman and Azzalini, 1997) rather than histograms provides a simpler method for displaying estimates of the pdf. These kernel-density plots also have the advantage of producing an estimate that has been smoothed in a well-specified, carefully controlled manner. The advantages of smoothing are most marked in multidimensional plots such as the 2-dimensional histograms used to display the occurrence of pairs of dwell-times (Magleby and Song, 1992) where the number of data counts per bin in a traditional histogram would be small.

In a traditional histogram, each sojourn in a state or aggregate of states with the same conductance, i.e., a conductance class, is represented by a count in the bin corresponding to the dwell time. The contribution to the plot is a small rectangle with unit height and the same width as the bin. The placing of the rectangle is the same regardless of where the dwell time falls within the range of times within the bin. A nonparametric kernel density estimate is constructed by representing each sojourn by a kernel function positioned at the actual value of the sampled dwell time. A common choice of function is a normal with width still to be specified. The kernel density estimate is then the sum of the increments for all sojourns in the class. Because, for a log plot, the increment can be chosen to have unit area, the total area under the resulting curve is equal to the number of counts, and the pdf can be obtained by dividing all values on the curve by the total number of counts.


    DWELL-TIME KERNEL ESTIMATES
TOP
ABSTRACT
INTRODUCTION
DWELL-TIME KERNEL ESTIMATES
RESULTS
DISCUSSION
APPENDIX
REFERENCES

Single class dwell times

Let t = (ti), i = 1, 2, ... , n be a series of dwell times at a particular class. Also let the kernel function phi  be a normal density on the log axis z = ln(t), with mean located at the value of the log of the dwell time, and scale dependent width (standard deviation), sigma i,
&phgr;<SUB><UP>i</UP></SUB>(z)=<FR><NU>1</NU><DE>&sfgr;<SUB><UP>i</UP></SUB><RAD><RCD>2&pgr;</RCD></RAD></DE></FR><UP> exp</UP><FENCE><UP>−</UP><FR><NU>(z−x<SUB><UP>i</UP></SUB>)<SUP>2</SUP></NU><DE>2&sfgr;<SUP><UP>2</UP></SUP><SUB><UP>i</UP></SUB></DE></FR></FENCE>, (1)
then the kernel estimate for the pdf of x = (xi), i = 1, ... , n, is
h(z)=<FR><NU>1</NU><DE>n</DE></FR> <LIM><OP>∑</OP><LL><UP>i=1</UP></LL><UL><UP>n</UP></UL></LIM>&phgr;<SUB><UP>i</UP></SUB>(z). (2)
For long dwell times, sigma i can be independent of the dwell time. Its value affects the appearance of the plot in a manner analogous to the way the bin width affects a histogram. If the width is much too narrow, the plot consists of many sharp spikes with a visual impression reflecting the accident of the particular samples taken rather than the underlying pdf. In contrast, if the width is much too wide, the plot becomes a single smear with loss of most information about the data. Fortunately, sigma i can be chosen simply by considering the properties of a log dwell-time plot.

When plotted on a logarithmic scale, the width of a exponential component is independent of its amplitude or time constant. Hence, the principles for choosing the kernel for a mixture are the same as those for a single exponential. Estimates constructed with several choices of constant width, sigma i = sigma , are compared in Fig. 1 A with the underlying theoretical pdf for an exponential with time constant 250 µs sampled at 5 µs intervals. Data points for the estimates (1000 per curve) were obtained deterministically in proportion to the value of the pdf (see Stark and Hladky, 2000, for details). For each data point, the kernel increment was evaluated at values of z on a grid with 20 steps per e-fold change in dwell time, which is sufficiently fine to avoid distorting the shape and size of the kernel. (The grid spacing must be less than sigma .) Evaluations were restricted to the range of z for which phi  is significant. Consider first the range xi = ln(ti) > -3. For these values, any sigma  < 0.2 preserves the shape. Larger values of sigma  smear out the distribution, lowering the peak and adding weight for longer and shorter dwell times. The present example is not suitable to demonstrate the effect of making sigma  too small because there are too many data points and they are deterministically spaced.



View larger version (14K):
[in this window]
[in a new window]
 
FIGURE 1   Comparisons of kernel estimates with a theoretical exponential probability density function of tau  = 250 µs. (A) Estimates for different normal kernels with constant width zeta . zeta  values are: solid curve, 0.1; dashed, 0.2; heavy dots, 0.4; light dots, 0.8 and 1.6. The theoretical curve is shown as a heavier line. (B) Estimates with different corrections for the uncertainty for very short dwell-times. tau  = 50 µs. zeta  = 0.1. The m values are dotted line, 0; solid, 1; and dashed, 2. (C) Demonstration of a close match between estimate and theory (thin and heavy solid lines). tau  is 250, zeta  = 0.1 m = 1. ±SD confidence bands for the estimate, computed via Eq. A8, are also shown as dash-dotted lines. In this and the next two figures, "log" means logarithm to the base e.

The series of peaks at short dwell time in the trace for sigma  = 0.1 in Fig. 1 A represent the increments added for dwell times of 1, 2, ... sample intervals. Because the data have been sampled at discrete times, all increments to the kernel estimate for short dwell times are centered at just these precise values. Thus these peaks are an artifact of the sampling, not a feature of the underlying pdf. As sigma  is increased, the increments are smeared out over the possible range, producing a better representation of the underlying signal.

It makes no sense to set the width of the kernel increment to a value smaller than the uncertainty in the dwell times. This uncertainty must be at least of the order of a sample interval. A constant uncertainty on a linear scale transforms to a variable uncertainty on a log scale. The value of sigma i corresponding to an uncertainty of m sample intervals for a dwell time of duration ti is
&sfgr;<SUB><UP>i</UP></SUB>=<UP>ln</UP><FENCE>1+<FR><NU>m&dgr;</NU><DE>2t<SUB><UP>i</UP></SUB></DE></FR></FENCE>, (3)
where delta  is the sample interval. The value of sigma i calculated from this relation is ln(1.5) for m = 1 and a dwell time equal to one sample interval, but it decreases rapidly, falling below 0.1 for dwell times exceeding five sample intervals. Figure 1 B illustrates results obtained with several values of m. For m = 1, the kernel estimate is not completely smooth but the variations occur centered about the underlying pdf. For larger m, the curve is smoother but it lies below the pdf for small dwell times and above it for somewhat larger values. The optimal value of m is close to 1 regardless of the time constant. In the following, we have set m = 1.

Eq. 3 cannot be used on its own as a means of specifying the width, because sigma i becomes much too small for large dwell times. A simple, effective procedure is to use instead
&sfgr;<SUB><UP>i</UP></SUB>=<UP>max</UP><FENCE><UP>ln</UP><FENCE>1+<FR><NU>&dgr;</NU><DE>2t<SUB><UP>i</UP></SUB></DE></FR></FENCE>, &zgr;</FENCE>. (4)
Figure 1 C displays the result of applying this relation with zeta  = 0.1, which produces essentially no distortion of the pdf. Although freedom from distortion favors this small zeta , elimination of the roughness resulting from random sampling favors large zeta . The more data available the smaller the value of zeta  that should be used. However, because zeta  = 0.2 produces negligible distortion, although that produced by 0.8 would normally be unacceptable, in practice, it is the range 0.2-0.4 that will be useful. A procedure for the optimal estimation of zeta  given the number of dwell times available, or, alternatively, an estimate of the minimum number of times required for a chosen value, is outlined in the Appendix. For 900 dwell times and an underlying pdf with two spaced exponential components, the optimal value is between 0.26 and 0.34.

Joint dwell times

Let xi,1 = ln(ti,1) and xi,2 = ln(ti,2) for i = 1, 2, ... , n, denote the log-transformed sojourns of consecutive dwell times in any two classes F and C. Bivariate kernel estimates for the joint density h(z1, z2) of adjacent dwell-time pairs, (xi,1, xi,2), may be defined by extending Eq. 1 as the product of univariate kernels along each direction,
&phgr;<SUB><UP>i</UP></SUB>(z<SUB>1</SUB>, z<SUB>2</SUB>)=&phgr;<SUB><UP>i</UP></SUB>(z<SUB>1</SUB>)&phgr;<SUB><UP>i</UP></SUB>(z<SUB>2</SUB>). (5)
This function is defined for all points (z1, z2) on the real plane, because, in principle, (ln(ti,1), ln(ti,2)) is in  R2. This construction leads to the estimate
h(z<SUB>1</SUB>, z<SUB>2</SUB>)=<FR><NU>1</NU><DE>2n&pgr;</DE></FR> <LIM><OP>∑</OP><LL><UP>i=1</UP></LL><UL><UP>n</UP></UL></LIM> <FR><NU>1</NU><DE>&sfgr;<SUB><UP>i,1</UP></SUB><UP>&sfgr;<SUB>i,2</SUB></UP></DE></FR> <UP>exp</UP>[f(z<SUB>1</SUB>, z<SUB>2</SUB>)] (6)
with
f(z<SUB>1</SUB>, z<SUB>2</SUB>)=−<FENCE><FR><NU>(z<SUB>1</SUB>−x<SUB><UP>i,1</UP></SUB>)<SUP>2</SUP></NU><DE>2&sfgr;<SUP><UP>2</UP></SUP><SUB><UP>i,1</UP></SUB></DE></FR>+<FR><NU>(z<SUB>2</SUB>−x<SUB><UP>i,2</UP></SUB>)<SUP>2</SUP></NU><DE>2&sfgr;<SUP><UP>2</UP></SUP><SUB><UP>i,2</UP></SUB></DE></FR></FENCE>. (7)
Alternatively, it is possible to use a bivariate kernel with correlated coordinates to allow for smoothing along particular directions, see Bowman and Azzalini (1997) or Wand and Jones (1995).

Estimates based on Eq. 5 can also be used to generate plots of the dependency difference function introduced by Magleby and Song (1992),
D(z<SUB>1</SUB>, z<SUB>2</SUB>)=h(z<SUB>1</SUB>, z<SUB>2</SUB>)<SUP>1/2</SUP>−h(z<SUB>1</SUB>)<SUP>1/2</SUP>h(z<SUB>2</SUB>)<SUP>1/2</SUP>, (8)
with h(z1, z2) given by Eq. 6 and h(z1), h(z2), the one-dimensional (1D) density estimates defined in Eq. 2, which can be calculated from h(z1, z2) as the marginals
h(z<SUB>1</SUB>)=<LIM><OP>∑</OP><LL><UP>k=1</UP></LL><UL><UP>K<SUB>2</SUB></UP></UL></LIM>h(z<SUB>1</SUB>, z<SUB><UP>2,k</UP></SUB>), h(z<SUB>2</SUB>)=<LIM><OP>∑</OP><LL><UP>k=1</UP></LL><UL><UP>K<SUB>1</SUB></UP></UL></LIM>h(z<SUB><UP>1,k</UP></SUB>, z<SUB>2</SUB>), (9)
with K1 and K2 the sizes of the discrete grid used to evaluate the joint density along each coordinate. The dependency difference indicates the importance of the discrepancy between the observed distribution of dwell-time pairs and that which would arise if the dwell-times in each class were independent of each other.


    RESULTS
TOP
ABSTRACT
INTRODUCTION
DWELL-TIME KERNEL ESTIMATES
RESULTS
DISCUSSION
APPENDIX
REFERENCES

It is routinely possible to obtain enough experimental data to construct 1D log dwell-time distributions with both a relatively large number of bins per decade and sufficient counts per bin to obtain a reasonable estimate of the pdf. The principal advantage of the use of kernel estimation in the construction of these 1D estimates is its simplicity. Binning, correction for the binning artefacts (see Stark and Hladky, 2000) and smoothing of the histogram are all replaced with a single, easily implemented summation of increments. The major disadvantage is that the smearing correction for the binning artifacts for dwell-times less than ~5 sample intervals is not as accurate (see for example Fig. 1 B) as the correction factors for log-binned histograms (see Stark and Hladky, 2000).

Two-dimensional (2D) pdfs or difference dependency plots require the square of the number of bins needed for the 1D plot, and, for these, the smoothing becomes critical. To illustrate the use of kernels for the estimation of open-closed dwell-time densities, we have generated synthetic noisy data using the following four-state reversible cyclic mechanism (see Scheme 1).



View larger version (15K):
[in this window]
[in a new window]
 
Scheme 1  

The states were grouped into two classes, F and C, with F = {O1, O2} as the class of "open" states and C = {C1, C2} the class of "closed" states. This decomposition induces the following partition for the Q matrix,
Q=<FENCE><AR><R><C><UP>Q<SUB>𝔉𝔉</SUB></UP></C><C><UP>Q<SUB>𝔉ℭ</SUB></UP></C></R><R><C><UP>Q<SUB>ℭ𝔉</SUB></UP></C><C><UP>Q<SUB>ℭℭ</SUB></UP></C></R></AR></FENCE><UP>,</UP> (10)
with Q𝔉𝔉 as the submatrix of rate constants between the states of the F class, Q𝔉ℭ the rates corresponding to the transitions from the F class to the C class, and so on. The rate constants ki; i = 1, ... , 8, were chosen to ensure states with widely separated life times within each aggregate. These were set to k1 = 6.183, k2 = 0.454, k3 = 2.697, k4 = 1.665, k5 = 0.446, k6 = 13.163, k7 = 0.182, and k8 = 11.812 (all in arbitrary units). In this case, both O1 and C1 are short-lived states (tau O1 = 0.0551, tau C1 = 0.0629), whereas O2 and C2 are long (tau O2 = 1.351, tau C2 = 0.5519). For this setting, long sojourns at each class originate from events of the type O1 right-left-harpoons  O2 or C1 right-left-harpoons  C2, which are then interrupted by brief transitions to the other class by events of the form C2 right-arrow O1 right-arrow C2 or O2 right-arrow C1 right-arrow O2.

To assess the performance of joint histograms and the estimate in Eq. 6, a noisy trace of 900 dwell-time pairs that correspond to the consecutive sojourns in F and C was generated by sampling at delta  = 0.005. These data were idealized by using the methods described in Rosales et al. (2001). Kernel estimates, together with the theoretical joint density, are shown in Fig. 2. The highest peaks, located right foreground, originate from brief sojourns in F followed by long durations in C. In principle, these pairs could arise from events of the type O1 right-arrow C2 or O1 right-arrow C2 right-left-harpoons  C1. The second maxima originates from long sojourns in F followed by short ones in C. These correspond to the transitions O2 right-arrow C1 or O1 right-left-harpoons  O2 right-arrow C1. A third minor peak corresponds to long consecutive sojourns in both classes, which might appear to correspond to transitions O2 right-arrow C2. Figure 2 B presents the kernel estimate obtained from the 900 simulated dwell-time pairs. This estimate was obtained using Eq. 6 with zeta  = 0.35 on a 60 × 60 grid, showing close resemblance to the theoretical density in Fig. 2 A. Figure 2 C shows a smoothed histogram constructed using the same sequence of dwell times by considering a 70 × 70 grid with 10 bins per e-fold change along each axis. The counts were corrected for the log binning artefacts as described by Stark and Hladky (2000). Without smoothing, both the raw histogram and the corrected plot had the appearance of a forest of trees. The histogram shown in Fig. 2 C was obtained by replacing each point with an average of 361 points weighted according to the inverse of their distance to the center (the central point and the four nearest neighbors all have weight 1). Similar, but somewhat more ragged, results were obtained using a uniform average over 289 points instead of the inverse weighted average over 360. In both cases, the size of the averaging region were chosen so that the peak heights would be close to the theoretical pdf and the kernel estimates, but this agreement was achieved at the expense of considerable elevation in the distributions for short open or closed times. Smoother histogram plots could be produced using larger averaging regions, but, in these, there was severe loss of peak height and even larger tails.



View larger version (30K):
[in this window]
[in a new window]
 
FIGURE 2   Joint dwell-time densities for dwell-time pairs for consecutive sojourn in F (right axis) and C (left axis). (A) Theoretical density predicted by the spectral decomposition of Q, (B) kernel density estimate, and (C) histogram smoothed as described in the text. Both (B) and (C) are generated from 900 dwell-time pairs fitted to synthetic data generated by the mechanism described by Q.

Dependence-difference plots are shown in Fig. 3. Figure 3 A presents the theoretical results calculated from Eq. 8 using the theoretical joint and marginal densities obtained from the spectral decomposition of Q (Colquhoun and Hawkes, 1982). Figure 3 B shows the dependence-difference contour obtained using Eq. 8 and the estimate plotted in Fig. 2 B. Figure 3 C presents the corresponding dependence plot computed from the smoothed histogram displayed in Fig. 2 C. It can be seen from Fig. 3 B, that positive dependencies (D(z1, z2) > 0) are related to dwell-time pairs associated with events of the form O1 right-arrow C2 and O2 right-arrow C1, whereas negative dependencies are seen for the regions that correspond to the transitions O1 right-arrow C1 and O2 right-arrow C2. Therefore, the third minor maximum at Fig. 2 B corresponding to long sojourns in both F and C arises from composite transitions from O2 to C2, i.e., O2 right-arrow C1 right-arrow C2 or O2 right-arrow O1 right-arrow C2.



View larger version (57K):
[in this window]
[in a new window]
 
FIGURE 3   Dependence-difference contour plots. (A) is calculated from the theoretical joint and marginals given by Q and the dependency-difference definition in Eq. 8. (B) and (C) correspond to the dependencies calculated via Eq. 8 and Eq. 9 from the kernel estimate in Fig. 2 B and the histogram in Fig. 2 C, respectively. The vertical axis is used for the times in C, whereas the horizontal axis for those in F.


    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
DWELL-TIME KERNEL ESTIMATES
RESULTS
DISCUSSION
APPENDIX
REFERENCES

The comparison of the estimate for the open-closed density shown in Fig. 2 B with the theoretical shape in Fig. 2 A, and, even more markedly, the comparison of the dependency differences computed from them in Fig. 3, show clearly the utility of kernel density estimation for log-transformed dwell times. To produce the plot at Fig. 2 C, the histogram was first constructed and then corrected to account for the aberrations induced by log binning (Stark and Hladky, 2000). The corrected counts were finally smoothed. This sequence is accomplished in a single step when evaluating the kernel directly from Eq. 6, making these estimates simpler to compute.

Inspection of Fig. 2 and Fig. 3 may suggest that the corrected and smoothed histogram is closer than the density estimate to the underlying pdf. However, the amount of smoothing for the histogram was chosen to make the highest peak agree with the theoretical distribution to display a result as favorable as possible for this method. In practice, this choice is not available because it depends on the knowledge of the underlying density. By constrast, the kernel method provides an automatic choice for the amount of smoothing found by minimizing the bias introduced by the estimation procedure. It should be emphasized that the results presented show that estimates obtained with zeta  in the range of 0.2-0.4 do not create or eliminate peaks nor do they shift peaks appreciably on the z axis or the (z1, z2) plane.

Construction of kernel estimates and histograms should produce similar results as the number of dwell times increases asymptotically toward infinity. However, the kernel estimator can be shown to be more efficient because its bias (in terms of the mean integrated squared error, see the Appendix) decreases as O(n-4/5) whereas the bias of a histogram follows O(n-2/3) (see Wand and Jones, 1995, p. 23). This property is particularly useful when estimating joint densities for the kinetic analysis of ion channels, since the observations available from a single molecule are usually brief or few when compared to the number of bins that would be required.

The errors incurred when fitting the kernels generated by Eqs. 4 and A5 with exponential mixtures gives similar results as those shown in Fig. 2 of Sigworth and Sine (1987). However, although the smoothing introduced by the kernel estimation can be chosen to minimize the mean integrated squared error, quantitative assessment of the effects of this smoothing on subsequent fitting remains a subtle, open question.


    APPENDIX
TOP
ABSTRACT
INTRODUCTION
DWELL-TIME KERNEL ESTIMATES
RESULTS
DISCUSSION
APPENDIX
REFERENCES

Optimal Bandwidths

An optimal choice for a fixed scale bandwidth zeta  follows by minimizing a measure for the overall effectiveness of h in estimating the underlying density g. A commonly used distance is provided by the mean integrated squared error. In the univariate case, this is
&egr;(h)=<B><UP>E</UP></B><FENCE><LIM><OP>∫</OP></LIM>(h(x)−g(x))<SUP>2</SUP><UP> d</UP>x</FENCE> (A1)

=<LIM><OP>∫</OP></LIM>[<B><UP>E</UP></B>{h(x)}−g(x)]<SUP>2</SUP><UP> d</UP>x+<LIM><OP>∫</OP></LIM><B><UP>V</UP></B>{h(x)}<UP> d</UP>x,
with E, V denoting expectation and variance (Bowman and Azzalini, 1997, p. 26). Here x is in  R is used to denote an element of the domain of all the functions to be considered. Asymptotically, as n becomes large, Taylor series approximations applied to both the bias and the variance terms lead to
&egr;(h)≈<FR><NU>1</NU><DE>4</DE></FR> &zgr;<SUP>4</SUP><LIM><OP>∫</OP></LIM>[g″(x)]<SUP>2</SUP><UP> d</UP>x+<FR><NU>1</NU><DE>n&zgr;</DE></FR> <LIM><OP>∫</OP></LIM>&phgr;(x)<SUP>2</SUP><UP> d</UP>x. (A2)
An optimal width follows by minimizing epsilon (h) with respect to zeta . The second term in Eq. A2 is obtained by taking phi (x) as defined by Eq. 1 with sigma  = zeta , and integrating over R,
<LIM><OP>∫</OP></LIM>&phgr;(x)<SUP>2</SUP><UP> d</UP>x=<FR><NU>1</NU><DE>2<RAD><RCD>&pgr;</RCD></RAD></DE></FR>. (A3)
An optimal bandwidth is then given by
&zgr;=<FENCE>n2<RAD><RCD>&pgr;</RCD></RAD><LIM><OP>∫</OP></LIM>[g″(x)]<SUP>2</SUP><UP> d</UP>x</FENCE><SUP>−1/5</SUP>. (A4)
By considering an exponential pdf with rate lambda  in the log metric,
g(x)=&lgr;<UP> exp</UP>[x−<UP>exp</UP>(x)&lgr;],
the required integral of [g"(x)]2 in Eq. A2 yields 1/4, and, hence, from Eq. A4, the optimal value is zeta  = 1.02/n1/5.

More generally, by taking an exponential mixture of two components with equal amplitudes and rates lambda 1, lambda 2 = beta lambda 1, and varying beta  from 0 to infinity , the width is bounded as
<FR><NU>1.02</NU><DE>n<SUP>1/5</SUP></DE></FR>≤&zgr;≤<FR><NU>1.34</NU><DE>n<SUP>1/5</SUP></DE></FR>.
The smallest value is obtained for isolated exponentials, i.e., for beta  = 1. The greatest is achieved at beta  = 0.22 (or beta  = 4.5). Because the data usually follows a multiple-component mixture, it seems reasonable to take the upper limit,
&zgr;=1.34n<SUP>−1/5</SUP>. (A5)
The bivariate case follows from a similar argument. Let x = (x1, x2), and g(x) the joint pdf for successive sojourns x1 and x2. By considering Eq. 7 and assuming sigma i,1 = sigma i,2, i.e., zeta 1 zeta 2 triple-bond  xi , a Taylor series approximation for Eq. A1 gives
&egr;(h)≈<FR><NU>1</NU><DE>4</DE></FR> &xgr;<SUP>4</SUP><LIM><OP>∬</OP></LIM>[∇<SUP>2</SUP>g(<A><AC>x</AC><AC>&cjs1171;</AC></A>)]<SUP>2</SUP><UP> d</UP><A><AC>x</AC><AC>&cjs1171;</AC></A>+<FR><NU>1</NU><DE>n&xgr;<SUP>2</SUP></DE></FR><LIM><OP>∬</OP></LIM>&phgr;(<A><AC>x</AC><AC>&cjs1171;</AC></A>)<SUP>2</SUP><UP> d</UP><A><AC>x</AC><AC>&cjs1171;</AC></A>,
where the integrations are taken over R2, and
∇<SUP>2</SUP>g(<A><AC>x</AC><AC>&cjs1171;</AC></A>)=<FR><NU>∂<SUP>2</SUP>g(<A><AC>x</AC><AC>&cjs1171;</AC></A>)</NU><DE>∂x<SUB>1</SUB></DE></FR>+<FR><NU>∂<SUP>2</SUP>g(<A><AC>x</AC><AC>&cjs1171;</AC></A>)</NU><DE>∂x<SUB>2</SUB></DE></FR>.
The integral for phi (x)2 leads to 1/(16pi 2), and, hence, the optimal width is
&xgr;=<FENCE>n2<SUP>3</SUP>&pgr;<SUP>2</SUP><LIM><OP>∬</OP></LIM>∇<SUP>2</SUP>g(<A><AC>x</AC><AC>&cjs1171;</AC></A>)<UP> d</UP><A><AC>x</AC><AC>&cjs1171;</AC></A></FENCE><SUP>−1/6</SUP>.
By considering a bivariate mixture, g(x), with two components along each coordinate with rates (lambda 1, beta 1lambda 1) for x1 and (lambda 2, beta 2lambda 2) for x2,
g(<A><AC>x</AC><AC>&cjs1171;</AC></A>)=<FR><NU>1</NU><DE>4</DE></FR> <LIM><OP>∏</OP><LL><UP>i=1,2</UP></LL></LIM>{&lgr;<SUB><UP>i</UP></SUB><UP>exp</UP>[x<SUB><UP>i</UP></SUB>−<UP>exp</UP>(x<SUB><UP>i</UP></SUB>)&lgr;<SUB><UP>i</UP></SUB>]

+&bgr;<SUB><UP>i</UP></SUB>&lgr;<SUB><UP>i</UP></SUB><UP>exp</UP>[x<SUB><UP>i</UP></SUB>−<UP>exp</UP>(x<SUB><UP>i</UP></SUB>)&bgr;<SUB><UP>i</UP></SUB>&lgr;<SUB><UP>i</UP></SUB>]},
and letting beta 1 and beta 2 vary from 0, up to lim(beta 1, beta 2) right-arrow +infinity , gives
<FR><NU>0.66</NU><DE>n<SUP>1/6</SUP></DE></FR>≤&xgr;≤<FR><NU>0.93</NU><DE>n<SUP>1/6</SUP></DE></FR>.
Again, we propose as an optimal bandwidth the upper limit of this interval, that is,
&xgr;=0.93n<SUP>−1/6</SUP>. (A6)
As one should expect, both widths zeta  and xi  decrease monotonically with n. For the data shown in the Results section where n = 900, Eq. A5 gives zeta  = 0.34 and Eq. A6 xi  = 0.299.

Confidence Intervals

Following Bowman and Azzalini (1997), the scatter associated with the kernel-estimation procedure can be approximated by restricting attention to the kernel variance V{h(x)}. This quantity can be made independent of the underlying density, g, by considering a square root transformation of the kernel estimate. More generally, denote by T a continuous differentiable transformation of h(x). By using a Taylor expansion,
<B><UP>V</UP></B>{T(h(x))}≈<B><UP>V</UP></B>{h(x)}[T′(<B><UP>E</UP></B>{h(x)})]<SUP>2</SUP>.
If T(h(x)) = <RAD><RCD><IT>h(x)</IT></RCD></RAD>, then the principal term of the above expression becomes
<B><UP>V</UP></B><FENCE><RAD><RCD>h(x)</RCD></RAD></FENCE>≈<FR><NU>1</NU><DE>4</DE></FR> <FR><NU>1</NU><DE>n&zgr;</DE></FR><LIM><OP>∫</OP></LIM>&phgr;(u)<SUP>2</SUP><UP> d</UP>u, (A7)
which is independent of the unknown density g. Thus, on the square root scale, the uncertainty associated with the estimate can be expressed by placing a band of width equal to two standard errors (sd) around <RAD><RCD><IT>h(z)</IT></RCD></RAD>. From Eq. A7, this gives
2<UP>sd</UP>=2<RAD><RCD><FR><NU>1</NU><DE>4n&zgr;</DE></FR> <LIM><OP>∫</OP></LIM>&phgr;(u)<SUP>2</SUP><UP> d</UP>u</RCD></RAD>,
and, following Eq. A3,
<UP>sd</UP>=<FR><NU>1</NU><DE>4</DE></FR><RAD><RCD><FR><NU>2</NU><DE>n<RAD><RCD>&pgr;</RCD></RAD>&zgr;</DE></FR></RCD></RAD>. (A8)
These variance-regularizing procedures have also been considered by Sigworth and Sine (1987) in the histogram context.

    FOOTNOTES

Received for publication 21 March 2001 and in final form 20 September 2001.

Address reprint requests to Rafael Rosales, Departamento de Matemáticas, Instituto Venezolano de Invetigaciones Científicas, Apartado. 21827, Caracas 1020-A, Venezuela. Tel.: +58-212-5041412/13; Fax: +58-212-5041416; E-mail: rrosales{at}cauchy.ivic.ve.


    REFERENCES
TOP
ABSTRACT
INTRODUCTION
DWELL-TIME KERNEL ESTIMATES
RESULTS
DISCUSSION
APPENDIX
REFERENCES

Biophys J, January 2002, p. 29-35, Vol. 82, No. 1
© 2002 by the Biophysical Society   0006-3495/02/01/29/07  $2.00



This article has been cited by other articles:


Home page
JGPHome page
R. A. Rosales, M. Fill, and A. L. Escobar
Calcium Regulation of Single Ryanodine Receptor Channel Gating Analyzed Using HMM/MCMC Statistical Methods
J. Gen. Physiol., April 26, 2004; 123(5): 533 - 553.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Rosales, R. A.
Right arrow Articles by Hladky, S. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rosales, R. A.
Right arrow Articles by Hladky, S. B.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2002 by the Biophysical Society.