help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Stark, J. A.
Right arrow Articles by Hladky, S. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stark, J. A.
Right arrow Articles by Hladky, S. B.

Biophys J, February 2000, p. 662-667, Vol. 78, No. 2

Adjustments for the Display of Quantized Ion Channel Dwell Times in Histograms with Logarithmic Bins

J. Alex Stark and Stephen B. Hladky

National Institute of Statistical Sciences, Research Triangle Park, North Carolina 27709-4006 USA, and Department of Pharmacology, University of Cambridge, Cambridge CB2 1QJ, England

    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
CHARACTERIZING THE BINNING...
ADJUSTMENTS
DISCUSSION
REFERENCES

Dwell-time histograms are often plotted as part of patch-clamp investigations of ion channel currents. The advantages of plotting these histograms with a logarithmic time axis were demonstrated by Blatz and Magleby (1986, J. Physiol. (Lond.). 378:141-174), McManus et al. (1987, Pflügers Arch. 410:530-553), and Sigworth and Sine (1987, Biophys. J. 52:1047-1054). Sigworth and Sine argued that the interpretation of such histograms is simplified if the counts are presented in a manner similar to that of a probability density function. However, when ion channel records are recorded as a discrete time series, the dwell times are quantized. As a result, the mapping of dwell times to logarithmically spaced bins is highly irregular; bins may be empty, and significant irregularities may extend beyond the duration of 100 samples. Using simple approximations based on the nature of the binning process and the transformation rules for probability density functions, we develop adjustments for the display of the counts to compensate for this effect. Tests with simulated data suggest that this procedure provides a faithful representation of the data.

    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
CHARACTERIZING THE BINNING...
ADJUSTMENTS
DISCUSSION
REFERENCES

The dwell-time histogram is one of the most common forms of presentation for the results of patch-clamp investigations of ion channel currents. Blatz and Magleby (1986), McManus et al. (1987), and Sigworth and Sine (1987) have argued persuasively that it is most useful to plot such histograms over the logarithms of the dwell times; this makes it possible to clearly present in a single plot dwell times spanning several orders of magnitude and makes it easier to obtain a visual impression of the number of exponential components in the dwell-time distribution. However, the procedures for constructing these histograms are not entirely straightforward, because the recorded data are quantized as multiples of the sampling interval. When linearly quantized data are presented logarithmically the number of different, discrete dwell times mapped to each logarithmic bin varies. Log bins corresponding to the shortest dwell times receive the recorded occurrences or counts for only a few different dwell times or even for none, while log bins corresponding to long dwell times receive the counts for many different dwell times. To account for these variations McManus et al. (1987) chose to plot the ratio of the number of counts to the number of sample intervals mapped to that bin (i.e., the average counts per sample interval). Because the counts per sample interval become very small for long dwell times, they used a log-log presentation. For an exponential distribution of dwell times this gives the appearance of a plateau with a steep roll-off toward longer dwell times starting at about the mean. Sigworth and Sine (1987), who started from interpolated data in which dwell times are continuously distributed, argued that it is clearer to present the counts rather than the counts per sample interval. Attractive features of this style of presentation include the following:

We show here how to construct this type of plot when the dwell times are discrete instead of continuously distributed.

An example of the difficulties encountered when the dwell times are multiples of the sample interval is introduced in the next section. There is a markedly irregular variation in the number of counts in neighboring log bins; some bins remain empty and significant variations in counts extend to dwell times of 100 samples (see Fig. 1). McManus et al. (1987) dealt with these irregularities in their log-log presentation by constructing an empirical decoding file listing the midpoint time and number of mapped dwell times for each log bin. In the third section we build on the ideas they presented, introducing an automatic procedure for characterizing the binning process and extending the correction of their Eq. 24 to a pdf style of plot. We have chosen to plot the estimated pdf as the ordinate so that areas on the transformed plot can be interpreted as probabilities. The square-root transformation suggested by Sigworth and Sine (1987) can easily be applied as a subsequent step if desired. The example is synthetic, but we have found that applying the same corrections to real results yields similar improvements.



View larger version (35K):
[in this window]
[in a new window]
 
FIGURE 1   The observed counts in logarithmic bins of the rounded values from a single exponential distribution of dwell times with mean 70. The dwell times were mapped into bin numbers, using the formula in Eq. 1. The counts are plotted scaled by omega /N, where 1/omega is the width of a log bin and N is the total number of counts. In part a the scaled counts are plotted as points with the equivalent transformed exponential superimposed as a pdf, and in b the scaled counts are plotted in the more familiar form of a traditional histogram. Significant irregularities in the binning extend to bins 39 and 40, to which dwell times around 103 and 116 are mapped. There are also errors in the abscissae, especially for small dwell times: the bin to which dwell time 10 has been mapped is centered between the grid lines for times 9 and 10. Note that the value of the scaled counts is zero in a number of bins corresponding to small dwell times, i.e., no counts are mapped to these bins.

This paper is primarily concerned with the visual presentation of logarithmically binned histograms. McManus et al. (1987) consider in detail the errors introduced into subsequent analysis by logarithmic binning. While simulations suggest that the procedures developed here are visually accurate, they are still approximations, and it will be more accurate if the original rather than the transformed data are used for numerical fitting (as in Qin et al., 1996, for example).

    CHARACTERIZING THE BINNING PROCESS
TOP
ABSTRACT
INTRODUCTION
CHARACTERIZING THE BINNING...
ADJUSTMENTS
DISCUSSION
REFERENCES

Quantized logarithmic bins

When logarithmic binning is used for discrete dwell times, the range of dwell times that map to each bin is very irregular, even for longer values. Throughout this article we will follow an example in which dwell times tau , expressed as multiples of the sample interval, are mapped to bin numbers x by scaling their base 10 logarithms and rounding. This can be represented by the formula
b(&tgr;)=⟨ω <UP>log</UP><SUB>10</SUB> (&tgr;)⟩, (1)
where <  >  denotes rounding to the nearest integer and omega  gives the number of bins per decade; we have chosen 19.385 because this highlights some problems more clearly than an integer value.

An example histogram was produced by generating 30,000 points from an exponential distribution with mean 70. The points were chosen deterministically; values of exp(-tau /70) were spaced uniformly (see Press et al., 1992, Section 7.2). The counts generated using the mapping of Eq. 1 are displayed in Fig. 1, along with the theoretical distribution g(z), given by
g(z)=<UP>ln</UP>(10) <UP>exp</UP><SUB>10</SUB> <FENCE>z−z<SUB><UP>m</UP></SUB>−<FR><NU><UP>exp</UP><SUB>10</SUB> (z−z<SUB><UP>m</UP></SUB>)</NU><DE><UP>ln</UP>(10)</DE></FR></FENCE>. (2)
where z = log10(tau ) and zm = log10(70). The counts are scaled to correspond to a pdf when the abscissae are interpreted as base 10 logarithms. Significant irregularities in the counts extend to high bin numbers. In this case the count for times 97-109 (bin 39) is too high, whereas that for times 110-122 (bin 40) is too low. In addition to the variation in the number of times mapping to each bin, the abscissae can be misleading. In this example the count for time 10, which maps to bin 19, is placed between the scaled logs of times 9 and 10. The results generated with alternative forms of rounding, such as up or down, are generally even worse; these can arise implicitly when data are rebinned. The correction factors developed below can account for any rounding scheme.

Dwell-time ranges for each bin

Table 1, which lists the destination bin numbers given by Eq. 1 for a range of times, illustrates the discontinuous variation in the number of quantized durations that map to each bin. Bins 1-5 will remain empty, and while bin 18 will collect the counts for durations 8 and 9, only that for 10 maps to bin 19. 


                              
View this table:
[in this window]
[in a new window]
 
TABLE 1   The mapping b(tau ) from quantized dwell time tau  to logarithmic bins, x

Characterizing the bin mapping is awkward, especially because numerical effects make a simple inversion of b(tau ) difficult. There are two sources of error: mismatches between the software that gathers the histogram data and that which processes it, and numerical artefacts within one program. An example of the latter is that, in tests, we found it necessary to account for the errors in the computed value of exp(ln(10)). These can be addressed through the use of a decoding file, as suggested in Section I.4(b) of McManus et al. (1987). We have found it most convenient not to attempt to invert b(tau ) directly but rather to summarize the mapping information in a function, a(x), which provides the lowest dwell time that maps to bin number x or higher. An example is shown in Table 2. This could be calculated exactly by lookup from values of b(tau ) in an extended version of Table 1. Alternatively one can employ a generating function and test values on either side. In these tests (with double-precision floating-point arithmetic),
a(x)=⌈10<SUP>(<UP>x−1/2</UP>)<UP>/ω</UP></SUP>−&eegr;⌉, (3)
where left-ceiling  rceil denotes rounding up, produced correct results with eta  = 10-10. The factor eta  ensures that values that should be at or just below integers are not erroneously rounded up.


                              
View this table:
[in this window]
[in a new window]
 
TABLE 2   The shortest dwell time, a(x), which is mapped to bin x or higher

The lower bound on the range of quantized dwell times that map to a bin x is, by definition, a(x). The shortest dwell time to map to a higher bin is a(x + 1), even if bin x + 1 is unmapped. Hence the number of dwell times that map to bin x is
s(x)=a(x+1)−a(x). (4)
Example values are set out in Table 3.


                              
View this table:
[in this window]
[in a new window]
 
TABLE 3   The number of quantized dwell times, s(x), in the range that maps to each bin x

Rebinning

There are situations in which it is possible to collect counts with a higher bin density than would be used for plotting. Bins would then be amalgamated to simplify plots and reduce statistical variation in the counts per bin. In such cases b(tau ) then represents the combination of the original binning and the rebinning processes. Note, however, that simple uniform amalgamation of neighboring bins has a rounding down effect (when indexed from 0 as here). Rounding to the nearest integer can be preserved if the bin density is reduced by an odd factor and the original histogram is padded with empty bins at its ends.

    ADJUSTMENTS
TOP
ABSTRACT
INTRODUCTION
CHARACTERIZING THE BINNING...
ADJUSTMENTS
DISCUSSION
REFERENCES

Abscissa adjustment

Consider the task of displaying the observations in a bin by a point. The scaling and rounding operations affect not only the ordinates, but also the abscissae. The simplest case is where exactly one dwell time maps to a bin. Plotting the point at the midpoint of the logarithmic bin does not correctly represent the logarithm of the dwell time that maps to it because of the rounding operation; e.g., in this example without adjustment, dwell time 10, which maps to bin 19, is displayed between the logarithms of 9 and 10. In this case it makes sense to remap the value of x to the appropriate scaled logarithm, z:
<A><AC>z</AC><AC>ˆ</AC></A>=<UP>log</UP><SUB>10</SUB>(a(x)), (5)
which correctly maps dwell time 10 to 1 in the example. This can be readily generalized for bins that receive counts from more than one dwell time. A point representation of a bin count is most meaningful if it is located centrally in the range of dwell times that map to it, i.e., using a relation such as
<A><AC>z</AC><AC>ˆ</AC></A>=(1/2) [<UP>log</UP><SUB>10</SUB> (a(x))+<UP>log</UP><SUB>10</SUB> (a(x+1)−1)]. (6)
For example, using this rule, the adjusted counts for dwell times 8 and 9, which both map to bin 18, will be displayed at (1/2)log10(8 × 9). Any available knowledge about the quantization process should be taken into account. If the continuous dwell times are known to be rounded to the nearest integer, as in this example, then the range being mapped into bin x starts at a(x) - 1/2 and ends at a(x + 1) - 1/2.

Ordinate adjustment for points

In this section we will combine the use of Eq. 6 with an adjustment to the ordinate to produce a graph of the form shown in Fig. 2. We begin by considering the case of a discrete display. The objective is to estimate the pdf for the logarithmic scale from the observed counts for the dwell times.



View larger version (37K):
[in this window]
[in a new window]
 
FIGURE 2   A plot of the adjusted data values as estimates of the pdf. The theoretical curve is the same as in Fig. 1. The estimated pdf values are the counts multiplied by psi (x), the adjustment factor defined in Eq. 9, using log bin boundaries defined in Eqs. 10 and 11.

The impression conveyed by a graph for the probability of an event being observed with any dwell time range should be independent of the choice of axes used. The probability of an event having a duration between tau 1 and tau 2 can be expressed in terms of pdfs f(tau ) and g(z) for linear and logarithmic scales, respectively:
<LIM><OP>∫</OP><LL>&tgr;<SUB>1</SUB></LL><UL>&tgr;<SUB>2</SUB></UL></LIM> f(&tgr;) <UP>d</UP>t=<LIM><OP>∫</OP><LL><UP>log</UP><SUB>10</SUB> (&tgr;<SUB>1</SUB>)</LL><UL><UP>log</UP><SUB>10</SUB> (&tgr;<SUB>2</SUB>)</UL></LIM> g(z) <UP>d</UP>z. (7)
For small-duration ranges one can accurately approximate the pdfs as linear. Thus for z and tau  in the middle of the range,
g(z) ≈ <FR><NU>f(&tgr;) (&tgr;<SUB>2</SUB>−&tgr;<SUB>1</SUB>)</NU><DE><UP>log</UP><SUB>10</SUB> (&tgr;<SUB>2</SUB>)−<UP>log</UP><SUB>10</SUB> (&tgr;<SUB>1</SUB>)</DE></FR>. (8)
An estimate of f(tau ) can be obtained by dividing the fraction of counts falling in bin x by s(x) (McManus et al., 1987, Eq. 24). Hence, the adjustment factor that should be applied to the counts to obtain an estimate of g(z) is
&psgr;(x)=<FR><NU>(&tgr;<SUB>2</SUB>−&tgr;<SUB>1</SUB>)</NU><DE>N s(x)[<UP>log</UP><SUB>10</SUB> (&tgr;<SUB>2</SUB>)−<UP>log</UP><SUB>10</SUB> (&tgr;<SUB>1</SUB>)]</DE></FR>. (9)
where N is the total number of counts. If the sparsely separated lower bins are deemed to be interpreted in a fashion similar to that of the higher ones, which are virtually uniformly spaced, then we should use a range of half a bin, logarithmically, on either side. In other words,
<UP>log</UP><SUB>10</SUB> (&tgr;<SUB>1</SUB>)=<A><AC>z</AC><AC>ˆ</AC></A>−<FR><NU>1</NU><DE>2ω</DE></FR>, (10)

<UP>log</UP><SUB>10</SUB> (&tgr;<SUB>2</SUB>)=<A><AC>z</AC><AC>ˆ</AC></A>+<FR><NU>1</NU><DE>2ω</DE></FR>. (11)
This is the scheme used to generate Fig. 2.

Traditional histogram display

The adjustments introduced in the preceding section produce an estimate of the probability density function over z = log10(t). It is also possible to generate a histogram in the traditional style---a connected staircase as in Fig. 3. We know that the upper quantized dwell time mapped to a bin is a(x + 1) - 1 and that the lower time for the next bin is a(x + 1). Choosing arbitrarily to draw boundaries half-way between these discrete linear dwell times and thus drawing a bin extending from a lower limit,
&tgr;<SUB>1</SUB>=a(x)−1/2, (12)
to an upper limit,
&tgr;<SUB>2</SUB>=a(x+1)−1/2, (13)
the number of dwell times mapped to the bin is
s(x)=a(x+1)−a(x)=&tgr;<SUB>2</SUB>−&tgr;<SUB>1</SUB>. (14)
The adjustment factor in Eq. 9 becomes
&psgr;(x)=<FR><NU>1</NU><DE>N[<UP>log</UP><SUB>10</SUB> (&tgr;<SUB>2</SUB>)−<UP>log</UP><SUB>10</SUB> (&tgr;<SUB>1</SUB>)]</DE></FR>, (15)
and the area of the bin is simply proportional to the sum of the counts for the corresponding discrete dwell times. McManus et al. (1987) discuss in detail how these binned counts can be used in subsequent analysis. It should be noted that while the bin boundaries do correspond to the correct discrete dwell times, the exact positioning between these dwell times has been chosen arbitrarily, and thus the boundaries do not indicate the precise range of continuous dwell times mapped to each bin. Largely to avoid conveying a false impression, we prefer the pdf style of plot, as in Fig. 2, for the visual presentation of data.



View larger version (35K):
[in this window]
[in a new window]
 
FIGURE 3   A traditional style of histogram plot. For each bin the range of quantized dwell times was a(x) to a(x+1) - 1. The boundaries between bins are the midpoints in linear time between endpoints of neighboring ranges; bins that were unmapped were discarded. The number of counts in each bin is proportional to the area. In this case psi (x) was calculated using Eq. 9 with Eqs. 12 and 13.

Bounds on the correction factor

It is possible to derive an expression that bounds the deviation from unity of the ratio of the observed and adjusted count values:
<FENCE><FR><NU>1</NU><DE>&psgr;(ω <UP>log</UP><SUB>10</SUB> (&tgr;))</DE></FR>−1</FENCE>⪅ <FR><NU>1</NU><DE>&tgr;(10<SUP>1/2ω</SUP>−10<SUP>−1/2ω</SUP>)</DE></FR>. (16)
As can be verified empirically, values occur close to this bound, and hence substantial deviations occur for durations longer than 100 samples.

    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
CHARACTERIZING THE BINNING...
ADJUSTMENTS
DISCUSSION
REFERENCES

The uses and advantages of using a logarithmic time axis for the display of dwell-time data have been set out by by Blatz and Magleby (1986), Sigworth and Sine (1987), and McManus et al. (1987). The principal impediment arises because the experimentally observed dwell times are necessarily multiples of the sample interval. The irregularities in histograms generated from the logarithms of discrete dwell times can be severe for short dwell times and can even be significant for durations as long as 100 samples. Furthermore, the midpoints of the logarithmic bins that display the data for small dwell times do not occur at the log of the duration they represent. The irregularities can be alleviated by interpolation of the experimental record between data points (see Sigworth and Sine, 1987) or by maintaining a table that records for each bin "the actual midtime of the bin and bin width" for use in decoding the histogram (McManus et al., 1987). The method outlined in this paper allows the displayed values to be adjusted so as to remove the irregularities.

The adjustment factors are based on simple approximations, the description of the binning process, and rules for the transformation of pdfs. These allow calculation of point estimates of the distribution for the logarthimic scale, based directly on the frequency of observation of the discrete dwell times. We recommend this method of display. However, if preferred it is possible to present histograms in the more traditional style of a connected staircase in which the bin boundaries indicate the range of the discrete dwell times mapped to the bin, the height is an estimate of the probability density function, and the area is proportional to the number of counts in the bin.

Fig. 2 shows that the adjustments proposed are able to recover, essentially without error, a single exponential distribution of dwell times. A more realistic example is provided in Fig. 4, which plots dwell times drawn at random from a double-exponential distribution. For logarithmic binning defined by Eq. 1, the relatively simple adjustments provided by Eqs. 6 and 9 with Eqs. 10 and 11 provide a visually accurate representation of the data.



View larger version (36K):
[in this window]
[in a new window]
 
FIGURE 4   The observed counts (open circle ) and estimated pdf () for 3000 dwell times tau  drawn at random from the pdf (0.35/4) exp(-tau /4) + (0.65/70)exp(-tau /70), plotted as a line after transformation.

    ACKNOWLEDGMENTS

This research was supported in part by grant 8/E03204 from the Biotechnology and Biological Sciences Research Council, and one author (JAS) was supported in part by Christ's College, Cambridge.

    FOOTNOTES

Received for publication 24 May 1999 and in final form 30 September 1999.

Address reprint requests to Dr. J. Alex Stark, National Institute of Statistical Sciences, Research Triangle Park, NC 27709-4006, USA. E-mail: stark{at}niss.org.

Matlab code for adjusting logarithmic histograms and for the examples presented in this paper is available at http://www.phar.com.ac.uk/RI/sbh/logadjst.zip.

    REFERENCES
TOP
ABSTRACT
INTRODUCTION
CHARACTERIZING THE BINNING...
ADJUSTMENTS
DISCUSSION
REFERENCES

Biophys J, February 2000, p. 662-667, Vol. 78, No. 2
© 2000 by the Biophysical Society   0006-3495/00/02/662/06  $2.00




This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Stark, J. A.
Right arrow Articles by Hladky, S. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stark, J. A.
Right arrow Articles by Hladky, S. B.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2000 by the Biophysical Society.