help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lubensky, D. K.
Right arrow Articles by Nelson, D. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lubensky, D. K.
Right arrow Articles by Nelson, D. R.

Biophys J, October 1999, p. 1824-1838, Vol. 77, No. 4

Driven Polymer Translocation Through a Narrow Pore

David K. Lubensky and David R. Nelson

Department of Physics, Harvard University, Cambridge, Massachusetts 02138

    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL BACKGROUND
COARSE-GRAINED DESCRIPTION
REGIME OF VALIDITY
MICROSCOPIC MODEL OF THE...
DISCUSSION
CONCLUSION
APPENDIX A
APPENDIX B
APPENDIX C
REFERENCES

Motivated by experiments in which a polynucleotide is driven through a proteinaceous pore by an electric field, we study the diffusive motion of a polymer threaded through a narrow channel with which it may have strong interactions. We show that there is a range of polymer lengths in which the system is approximately translationally invariant, and we develop a coarse-grained description of this regime. From this description, general features of the distribution of times for the polymer to pass through the pore may be deduced. We also introduce a more microscopic model. This model provides a physically reasonable scenario in which, as in experiments, the polymer's speed depends sensitively on its chemical composition, and even on its orientation in the channel. Finally, we point out that the experimental distribution of times for the polymer to pass through the pore is much broader than expected from simple estimates, and speculate on why this might be.

    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL BACKGROUND
COARSE-GRAINED DESCRIPTION
REGIME OF VALIDITY
MICROSCOPIC MODEL OF THE...
DISCUSSION
CONCLUSION
APPENDIX A
APPENDIX B
APPENDIX C
REFERENCES

Modern polymer physics has achieved great success with models in which the polymer is regarded as a flexible, uniform "string" whose conformational entropy dominates the system's behavior (de Gennes, 1979; Doi and Edwards, 1986). Although this is usually an excellent description, in some situations other interactions can become important. One example is the insertion of a polymer into a pore of diameter comparable to the size of the chemical repeat units that make up the polymer. Although perhaps unusual with synthetic polymers, such a situation can easily occur in biological systems. For example, Kasianowicz, Brandin, Branton, and Deamer (hereafter KBBD) have recently detected single strands of RNA (polyuridylic acid) passing through a 1.5-nm pore formed by a membrane-bound protein (Kasianowicz et al., 1996). Szabò and coworkers (Szabò et al., 1997, 1998) and Hanss and coworkers (Hanss et al., 1998) have studied similar systems. In addition to their intrinsic interest, these experiments may eventually lead to a single-molecule RNA and DNA sequencing technique. More generally, most cells must transport macromolecules across membranes to function; in several cases, relatively thick molecules are believed to pass through nanometer-scale channels. The translocation of polynucleotides through proteic pores has been implicated in a variety of processes, including phage infection and bacterial conjugation (Dreiseikelmann, 1994), the uptake of oligonucleotides by certain organs (Hanss et al., 1998), and transport across the nuclear envelope in plants (Citovsky and Zambryski, 1993). It has been speculated that some of these transport pathways could eventually prove important in gene therapy (Szabò et al., 1998; Hanss et al., 1998). Similarly, polypeptide-conducting channels play an important role in protein kinesis (Schatz and Dobberstein, 1996; Simon and Blobel, 1991); in a few instances, the translocation may even be driven by electrophoretic effects (Attardi and Schatz, 1988).

There exists a considerable literature on the confinement of polymers in channels of diameter significantly larger than the polymers' persistence length (de Gennes, 1979, 1999); well-developed scaling techniques can be used in the theoretical treatment of this regime. Recently, theorists have also shown an interest in the opposite limit of a very narrow, almost point-like hole. For example, several groups have studied the diffusion of polymers across idealized, infinitely thin membranes (Carl, 1998; Di Marzio and Mandell, 1997; Yoon and Deutsch, 1995; Lee and Obukhov, 1996; Park and Sung, 1998a,b; Sung and Park, 1996). The pore and the membrane are viewed as hard walls whose only interaction with the polymer is steric, and the emphasis is on how the walls' presence decreases the entropy and slows the dynamics of those parts of the polymer outside of the hole. Possible mechanisms for the active transport of polymers through pores in biological systems have also been studied (Peskin et al., 1993; Simon et al., 1992; Sung and Park, 1996).

Inspired largely by the experiments of KBBD, in this paper we consider a different scenario: we study the motion of a homopolymer threaded through a narrow pore with which it has strong interactions. The pore is taken to be sufficiently small that no more than one polymer diameter can fit in it at a given time; in particular, "hairpin" bends are not allowed to pass through the channel. We also put aside the question of how the polymer first enters the hole, focusing instead on the dynamics once one end has been inserted. We then argue that, in the presence of a force driving the polymer through the pore, there should be a regime in which the polymeric degrees of freedom outside of the pore can be neglected, and the system is effectively one-dimensional (1D). In this limiting case, we propose a two-tiered picture: a coarse-grained macroscopic description of wide validity and a simple microscopic model from which the macroscopic parameters may be calculated. Our approach follows several authors (Peskin et al., 1993; Simon et al., 1992; Park and Sung, 1998a,b; Sung and Park, 1996) in viewing the translocation process as essentially diffusion in one dimension; we differ, however, in emphasizing the role that interactions with the pore itself play in this diffusion process. On the more microscopic level, we include the effects of these interactions through a tilted washboard potential, similar to models of laser mode locking (Haken et al., 1967) or phase dynamics in Josephson junctions (Ambegokar and Halperin, 1969) (see Figs. 5 and 6). The periodic modulation of the potential reflects the periodicity of the polynucleotide's sugar-phosphate backbone. The importance of polymer-pore interactions has previously been emphasized by Bezrukov, Kasianowicz, and coworkers (Bezrukov and Kasianowicz, 1997; Bezrukov et al., 1996; Korchev et al., 1995); our model also bears some similarity to work on gel electrophoresis that examines the importance of local "solid friction" forces between the polyelectrolyte and the gel (Deutsch, 1987; Burlatsky and Deutch, 1993, 1995; Viovy and Duke, 1994; Deutsch and Yoon, 1997). Although the macroscopic parameter values for KBBD's system differ in some respects from those predicted by our microscopic model, we are nonetheless able to make several fairly robust predictions. More importantly, we show how a simple physical mechanism can account for several striking features of the data of KBBD. We thus hope that our work will provide a useful contribution to our understanding of the translocation of polyelectrolytes.

Since our analysis relies heavily on KBBD's results, the next section sketches some salient features of their data. We then introduce a long length-scale "hydrodynamic" description of 1D diffusion and use it to calculate the distribution of passage times for a polymer being driven through a pore. The arguments used to arrive at these results are quite general; in particular, they require few assumptions about the details of the microscopic dynamics of the system. There are, however, circumstances when our approximations break down, and we consider these next. This section also serves to emphasize several aspects of the experiments that will guide our choice of the microscopic model in the succeeding section. After introducing this microscopic model, we use it to calculate a mean drift velocity and an effective diffusion coefficient and compare them to values estimated from KBBD's data. These comparisons will reveal certain features that cannot be accounted for by our model in its simplest form, so we then discuss possible reasons for the discrepancy, as well as touching briefly on several applications of our calculations. We conclude by summarizing our results and highlighting some issues that remain open.

    EXPERIMENTAL BACKGROUND
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL BACKGROUND
COARSE-GRAINED DESCRIPTION
REGIME OF VALIDITY
MICROSCOPIC MODEL OF THE...
DISCUSSION
CONCLUSION
APPENDIX A
APPENDIX B
APPENDIX C
REFERENCES

In the experiments of interest to us, KBBD worked with a Staphylococcus aureus alpha -hemolysin ion channel in an artificial lipid bilayer membrane (diphytanoyl-PC). Kasianowicz and Bezrukov have demonstrated that, in concentrated salt solutions, this pore can remain open for periods on the order of tens of seconds (Kasianowicz and Bezrukov, 1995; Bezrukov and Kasianowicz, 1993). The alpha -hemolysin protein has recently been crystallized and an x-ray structure obtained (Song et al., 1996). This reveals a mushroom-shaped complex with a roughly 10-nm-long solvent-filled channel. The channel is 1.5 nm in diameter at its narrowest constriction, barely larger than the diameter of a single polynucleotide strand. After inserting a single pore into a bilayer membrane and applying a transmembrane potential of between 110 and 140 mV, KBBD added homopolymeric single-stranded DNA or RNA to one side of the membrane, designated cis. The samples of polynucleotides had mean lengths on the order of a few hundred nucleotides and were assumed to be close to monodisperse. (Various groups have measured the persistence length of single-stranded DNA in high salt concentrations to be between 0.75 and 1.5 nm [Achter and Felsenfeld, 1971; Smith et al., 1996; Tinland et al., 1997], or roughly 1 to 2 nucleotides, meaning that the polymers used were of order 100 persistence lengths long.) After adding the polynucleotides, KBBD monitored the transmembrane ionic current as a function of time. The time series shows a baseline current, modulated by periods on the order of hundreds of microseconds in which the current decreases almost to zero (Fig. 1, inset). A variety of observations support the interpretation that these blockades were caused by the passage of a polymer through the alpha -hemolysin channel. The data of KBBD can thus be interpreted as giving measurements of the times required for individual polynucleotides to traverse the membrane under the influence of an electric field.



View larger version (37K):
[in this window]
[in a new window]
 
FIGURE 1   Histogram of the number of observed blockade events versus the lifetime of the blockade, for 210 nucleotide poly[U]. Numbers 1-3 label the different peaks. From KBBD (courtesy of J. Kasianowicz, NIST, and D. Branton, Harvard University). (Inset) Typical time series of the current versus time in the experiments of KBBD, showing a transient blockade due to the translocation of a polymer (courtesy of J. Kasianowicz, NIST, and D. Branton, Harvard University).

When these data are displayed as a histogram, with the number of observed events plotted against the length of the blockade (Fig. 1), one sees that the blockade times fall into three distinct peaks. Of these, peak 1 is caused by polymers that enter and retract and thus do not completely cross the membrane, whereas peaks 2 and 3 are both the result of a polymer's actually passing through the channel. The polymers in peak 3 evidently cross the membrane roughly three times faster than those in peak 2. KBBD made the intriguing suggestion that there are two characteristic times associated with translocation because the polynucleotide can enter the pore in two distinct directions: One peak corresponds to polymers that enter the channel with their 3' end first, the other to polymers that enter with their 5' end first. We will show in subsequent sections how such behavior can arise from a simple microscopic model.

A quantity of considerable interest in what follows will be the force F driving the polymer through the pore. One can define F as the mean force required to immobilize a given monomer in the pore, where the average is taken over time and over all of the monomers in a given polymer. Thus, F does not include hydrodynamic drag forces nor forces that vanish when averaged over all the monomers. Equivalently, F can be defined by requiring that exp(Fa/kBT) be the ratio of the probability that the polymer will move forward one base to the probability that it will move backward one base, again appropriately averaged over all monomers. Clearly F is primarily the result of the electric field acting on the polymer. Because a long, narrow channel has a much larger electrical resistance than the macroscopic volumes of solution on either side of the membrane, any voltage V applied to the system should fall almost entirely across the alpha -hemolysin pore. The charge on each nucleotide is just the electron charge e, so the electrostatic energy gained by moving one nucleotide completely through the pore is eV. This suggests that F is roughly
F≈<FR><NU>eV</NU><DE>a</DE></FR>≈5 <FR><NU>k<SUB><UP>B</UP></SUB>T</NU><DE>a</DE></FR>, (1)
where a approx  6 Å is the length of a nucleotide, and the second equality holds for V approx  125 mV. This figure, of course, is a crude estimate and is almost certainly larger than the true force. Nonetheless, it is at least plausible that the driving force is of the order of kBT/a, and thus is quite large when expressed in appropriate units. For most of the rest of the paper, we will explore the consequences of this hypothesis and will use F approx  5kBT/a when numerical estimates are required. The Discussion section will reconsider the value of F in light of what we have learned; some effects that could modify F are also considered in Appendix C.

Before presenting our model, we would finally like to review the experimental evidence that the interactions between the polymer and the alpha -hemolysin pore do indeed play the dominant role in KBBD's experiments. We have already mentioned the existence of two distinct characteristic times for the polymer to cross the membrane. Such a result is easiest to interpret if one believes that the polymer's speed is determined by interactions between the polymer and the narrow channel constriction, where molecular scale asymmetries could be important. Similarly, recent data show that homopolynucleotides of different bases can move at strikingly different speeds (D. Branton, Harvard University, personal communication): poly[U] is of order 20 times faster than poly[dA]. Although chemical differences certainly can lead to variations in polymer properties such as the persistence length, we believe that such strong dependence on molecular details can more easily be explained if we focus on the pore region. Finally, even the fastest polynucleotides pass through the pore far more slowly than simple estimates of hydrodynamic drag would suggest. Model the pore as a cylindrical hole of radius R and the part of the polymer in the pore as a cylinder of radius r. Then, when the polymer moves with speed v, the drag force per length on the part in the pore is roughly 2pi eta rv/(R - r). Electrophoretic effects change this result very little (see Appendix C). For a polynucleotide in an alpha -hemolysin channel, r/(R - r) is somewhat larger than unity, and the total length of the cylinder is roughly 50 Å. According to scaling arguments of Lee and Obukhov (1996), the contribution to the drag force from the ends of the polymer outside the channel is only 2 × 6pi eta bv, where eta  is the solvent viscosity, and the Kuhn length b is between 15 and 30 Å. Even if hydrodynamic interactions are entirely screened by the motion of counterions (as they are for the electrophoresis of an isolated polymer in solution, with screening length of order the monomer size), the drag on those parts of the polymer in solution cannot be larger than roughly 4pi eta Lv. If one substitutes typical parameter values for KBBD's experiments and balances the sum of these drag forces with the naive driving force of 5kBT per nucleotide, one finds that the polymer would be expected to move through the pore at a rate of roughly 108 nucleotides/s, 100 times faster than observed. The three observations of this paragraph, taken together, certainly suggest that we focus on the degrees of freedom in the pore when trying to understand the experiments of KBBD.

    COARSE-GRAINED DESCRIPTION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL BACKGROUND
COARSE-GRAINED DESCRIPTION
REGIME OF VALIDITY
MICROSCOPIC MODEL OF THE...
DISCUSSION
CONCLUSION
APPENDIX A
APPENDIX B
APPENDIX C
REFERENCES

Motivation and governing equation

This section, and most of the rest of the paper, is concerned with predicting distributions of blockage times of the sort shown in Fig. 1. It is now well established in condensed matter physics that the form of the slow, long length-scale dynamics of a system is often determined by the system's symmetries and conservation laws. All microscopic details are subsumed in phenomenological coupling constants and transport coefficients. In this spirit, we would like to obtain a coarse-grained equation for the probability P(xt) that a contour length x of the polymer's backbone has passed through the pore at time t. (The variable x is defined so that if the polymer backbone has length L, x = 0 when the polymer has just started in the pore and x = L when it has reached the other side). For such a hydrodynamic description to make sense, several conditions must be met. One is that the polymer length L be much larger than the distance a between successive nucleotides. We also demand that the dissolved counterions (as well as the solvent and any other solutes) relax quickly compared to the translocating polymer, so that we may ignore their dynamics. Because the ions are much smaller than a polynucleotide, and consequently diffuse much faster, this condition should not be difficult to satisfy. Finally, our task will be considerably simplified if the microscopic system is (approximately) invariant under translations by an integer multiple of a in either direction. Then, after averaging over variations on the scale of a single nucleotide, we must obtain a translationally invariant equation. We will give this assumption a firmer basis in the next section. Roughly, however, there should be translational invariance when we can neglect the parts of the polymer outside of the channel, and this, in turn, should be possible when the interactions between the polymer and the pore are strong enough.

Under the conditions just outlined, the (probability) density of the polymer is the only conserved variable, and it is relatively straightforward to write down the coarse-grained hydrodynamic equation for P. Because there is only a single polymer (or, equivalently, a "gas" of noninteracting polymers going through the same hole), the probability current j, defined by partial P/partial t + partial j/partial x = 0, must be linear in P. The lowest-order allowed terms are then proportional to P and to partial P/partial x:
j(x, t)=vP(x, t)−D <FR><NU>∂P(x, t)</NU><DE>∂x</DE></FR>. (2)
The first term is permitted because there is an electric field driving the system. P then satisfies the familiar equation for diffusion with drift,
<FR><NU>∂P</NU><DE>∂t</DE></FR>=D <FR><NU>∂<SUP>2</SUP>P</NU><DE>∂x<SUP>2</SUP></DE></FR>−v <FR><NU>∂P</NU><DE>∂x</DE></FR>. (3)
Here v and D are, respectively, an average drift velocity and an effective diffusion coefficient. Their values are determined by more microscopic physics; in particular, they may depend nonlinearly on the applied electric field. Eq. 3 may alternatively be derived from a microscopic master equation that is invariant under translations by a. The coefficients v and D are then related to the lowest-lying eigenvalues of the master equation. This connection will be illustrated in a subsequent section.

On the macroscopic level of Eq. 3, all information on the competition between driving and diffusive spreading is encoded in a parameter that we call the diffusive length ld triple-bond  D/v. Roughly speaking, on length scales less than ld, the polymer's motion is little affected by the presence of the bias from the electric field, whereas, on scales larger than ld, the driving dominates. Indeed, if Eq. 3 described a rigid particle diffusing in 1D under the influence of a uniform force f, an Einstein relation would hold, and we would have v = Df/(kBT), and ld = kBT/f. Thus, in this case, ld is precisely the length over which the driving force does a quantity kBT of work. In the remainder of this section, we will often assume that the length L of the polymer is larger than ld, a condition satisfied by KBBD's data.

Distribution of Passage Times

We now propose to calculate a distribution of passage times of the sort measured by KBBD. This section will show that, for given v and D, the probability psi (t) that the polynucleotide takes a time t to pass through the channel has only one peak. Thus, the presence of two peaks in KBBD's data must be explained by the assumption that different physical situations give rise to different values of v and D. Subsequent sections will argue that a polynucleotide passing through the pore with its 3' end first can indeed have an average velocity that is significantly different from one passing through with its 5' end first. This section, however, is confined to the calculation of the passage times for fixed parameter values. The distribution psi (t) we obtain should thus be compared to a single peak in the data of KBBD.

One can easily estimate the first few cumulants of this distribution. If a polymer of length L moves with average velocity v, one expects that the mean time to pass through the channel should be < t>  approx  L/v. Likewise, the variance in the distance traveled in a time < t> is (Delta x)2 = 2D< t> . It would then seem reasonable that the variance in arrival times should be Delta t2 triple-bond  < (t - < t> )2>  approx  (Delta x)2/v2, or Delta t2 approx  2DL/v3. These conclusions are, in fact, roughly correct for a sufficiently long polymer. One might expect corrections, however, because some fraction of the polymers that enter the pore will leave again from the same side instead of passing all the way through. On average, these will be the slower molecules: those that spend a significant time with only the tip of the polymer inserted in the channel are far more likely to fall back out than are those that are quickly driven through the hole. Thus, only faster chains tend to enter into the calculation of the mean transit time, decreasing < t> . This effect is most pronounced for small L/ld, because only molecules within ld of the cis side have an appreciable chance of backing out instead of exiting on the trans side. Indeed, when L << ld, the driving should be negligible, and we expect < t> to approach its v = 0 value L2/6D. To determine the precise form of this crossover, we must turn to a more detailed calculation.

This calculation can be formulated as one of a well-studied class of problems known as first-passage problems (Risken, 1984; van Kampen, 1992). Essentially, all that is required is to solve Eq. 3 on the interval [0, L] with absorbing boundary conditions P(0) = P(L) = 0. Then, the current density j(L) at L gives the probability per time that the polymer will leave the pore from the far (trans) side, while -j(0) is the probability per time that it will exit from the cis side from which it entered. One must also specify the starting point x0 is in  [0, L] of the polymer; in what follows, we always take the limit x0 right-arrow 0, in keeping with the fact that the polymer starts entirely on the cis side of the membrane. The algebraic details of the solution are summarized in Appendix A; here we include only a discussion of the main results.

Although exact expressions for < t> and Delta t may be obtained, it turns out to be more instructive to consider the distribution psi (t) itself. For arbitrary L/ld, this can only be expressed as an infinite series, but, if terms that become exponentially small as L2/(vtld) right-arrow infinity  are neglected, a comparatively simple analytic expression is obtained:
&psgr;(t)≃<FR><NU>v</NU><DE>2</DE></FR><RAD><RCD><FR><NU>l<SUB><UP>d</UP></SUB></NU><DE>&pgr;</DE></FR></RCD></RAD><FENCE><FR><NU>L<SUP>2</SUP></NU><DE>l<SUB><UP>d</UP></SUB>(vt)<SUP>5/2</SUP></DE></FR>−<FR><NU>2</NU><DE>(vt)<SUP>3/2</SUP></DE></FR></FENCE><UP>exp</UP><FENCE><UP>−</UP><FR><NU>(vt−L)<SUP>2</SUP></NU><DE>4vtl<SUB><UP>d</UP></SUB></DE></FR></FENCE>

×<FENCE><FR><NU>L<SUP>2</SUP></NU><DE>vtl<SUB><UP>d</UP></SUB></DE></FR> &z.Gt; 1</FENCE>. (4)
Note that this expression is not valid for sufficiently large t, and, in particular, not for t so large that it predicts that psi (t) becomes negative. Nonetheless, for values of t near the maximum in psi (t), i.e. those such that vt/L ~ O(1), it is accurate to within a percent for L/ld as small as 4, and correctly reflects the qualitative features of psi (t) for significantly smaller L/ld. Figure 2 plots psi (t) for L/ld = 5; a Gaussian with the same mean and variance is included for comparison. Evidently, psi (t) is quite skewed, and its mean and maximum are correspondingly well separated. Thus, < t> and Delta t are not the best parameters for describing experimental data. Indeed, both cumulants are sensitive to how psi (t) decays for large t, making them very hard to extract accurately from realistic data sets. A more useful choice of parameters to characterize psi (t) are the position tmax of its maximum (which satisfies dpsi /dt|tmax = 0) and the width delta t of the peak. The latter is defined as delta t triple-bond  (tR - tL)/2, where tR and tL satisfy psi (tRtL) = e-1/2psi (tmax); we have chosen a factor of e-1/2 instead of the more conventional 1/2 to facilitate comparison with fits of data to a Gaussian. One expects that, as L/ld right-arrow infinity , tmax and delta t should approach < t> and Delta t, respectively. For example, for large L/ld we have,
t<SUB><UP>max</UP></SUB>=<FR><NU>L</NU><DE>v</DE></FR><FENCE>1−5 <FR><NU>l<SUB><UP>d</UP></SUB></NU><DE>L</DE></FR>+<FR><NU>17</NU><DE>2</DE></FR> <FR><NU>l<SUP><UP>2</UP></SUP><SUB><UP>d</UP></SUB></NU><DE>L<SUP>2</SUP></DE></FR>+32 <FR><NU>l<SUP><UP>3</UP></SUP><SUB><UP>d</UP></SUB></NU><DE>L<SUP>3</SUP></DE></FR>+…</FENCE>. (5)
The rapidly growing coefficients indicate that, although tmax approaches L/v as L approaches infinity, it falls away from its asymptotic form quite rapidly for finite L.



View larger version (15K):
[in this window]
[in a new window]
 
FIGURE 2   The distribution psi (t) of passage times plotted versus t for L/ld = 5. Both quantities are appropriately nondimensionalized, t as vt/L and psi (t) as Lpsi (t)/v. The dashed curve is a Gaussian with the same mean and variance as psi (t).

More generally, one can easily find tmax and delta t by numerically solving the equations that define them. Figure 3 plots delta t/tmax versus the polymer length L. This ratio is especially interesting because it depends only on L/ld, and not on v and D separately; one can thus use it quickly to estimate L/ld. In KBBD's data, delta t/tmax is usually of order 0.5 for an ~200-nucleotide chain, suggesting that L/ld approx  5, or that ld is of order 40 nucleotides. As Fig. 4 indicates, in this range tmax already deviates significantly from the naive guess tmax approx  L/v. In particular, tmax/L varies by a factor of 2 as L/ld increases from 5 to 25. With sufficiently good data, this deviation from a strict proportionality to L might well be observable, providing strong confirmation of our quasi 1D picture.



View larger version (13K):
[in this window]
[in a new window]
 
FIGURE 3   Plot of the relative width delta t/tmax of the peak in the distribution of passage times, versus ld/L. This curve may be used to obtain the quick estimate ld approx  40 nucleotides for the system studied by KBBD. The dashed curve gives the L right-arrow infinity asymptotic behavior, delta t/tmax ~ <RAD><RCD><IT>2l</IT><SUB>d</SUB>/<IT>L</IT></RCD></RAD>. We have chosen to put ld/L instead of L/ld along the ordinate to allow smooth contact with this large L behavior.



View larger version (19K):
[in this window]
[in a new window]
 
FIGURE 4   vtmax/L plotted versus L/ld. Note that vtmax/L varies significantly over the range of L/ld relevant to the experiments of KBBD, and, in particular, that it does not reach its asymptotic value of unity until well outside the range of this plot. (Inset) Plot of tmax (nondimensionalized by ld/v) versus L (nondimensionalized by ld). The dashed line gives the large L limiting form L/v, the solid line the exact value. Note that, although tmax appears to the eye to depend linearly on L over much of the range of the plot, it still differs significantly from L/v.

    REGIME OF VALIDITY
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL BACKGROUND
COARSE-GRAINED DESCRIPTION
REGIME OF VALIDITY
MICROSCOPIC MODEL OF THE...
DISCUSSION
CONCLUSION
APPENDIX A
APPENDIX B
APPENDIX C
REFERENCES

In the previous section, we argued that a requirement for the validity of a 1D diffusion model was that the system be (approximately) unchanged if the polymer moves an integer number of monomers forward or backward in the pore. This section discusses when this condition is satisfied. We begin by dividing the polymer into three parts: the roughly 10-nucleotide-long piece that is actually inside the channel, and the two ends, comprising the majority of the nucleotides, outside the channel. The pore always contains the same number of bases, so, for the homopolymers, this part of the polymer always satisfies the requirement of translational symmetry. The length of each end "dangling" outside the pore, in contrast, changes with the translocation parameter x, destroying translational invariance. In what follows, we shall argue that, under certain conditions, this variation may be neglected. Our arguments assume that the parts of the polynucleotide outside the pore may be described by the theories usually applied to long, flexible polymers (de Gennes, 1979; Doi and Edwards, 1986); we thus ignore, for example, hydrogen-bonding and other specific interactions (Cantor and Schimmel, 1980). We also assume that the ion channel is sufficiently long and narrow that any voltage drop falls almost entirely across the channel (see Appendix C). The electric field and the solvent flow velocity outside of the channel can then be ignored.

There are two criteria for ignoring the ends of the polymer outside of the pore. First, they should have a characteristic relaxation time that is much faster than the characteristic time for the motion of a monomer through the channel. In the absence of interactions between the polymer and the pore, one would expect diffusion on the scale of a few monomers to be much faster than the relaxation of a long polymer coil, and this inequality could never be satisfied. However, because the nucleotides in the pore can be expected to interact strongly with the confining protein, the requirement is not implausible. The longest time scale of an isolated polymer in solution is the Zimm time tZ approx  0.4eta RG3/(kBTapprox  0.4eta N3nu b3/(kBT), where nu  is the Flory exponent (In principle, nu  approx  0.6 for a long polymer in a good solvent. However, even with the longest available chains, nu  is never observed experimentally to be larger than 0.55 [Doi and Edwards, 1986], so we use this value for specific numerical calculations.), b is the Kuhn segment length (equal to twice the persistence length), eta  is the solvent viscosity, and N = L/b. Substituting numerical values for a single-stranded polynucleotide in water, one finds that tZ approx  N3nu (3.2 × 10-4 µs). If we imagine that the polymer moves a monomer through the channel by hopping over an energetic barrier (an idea to be considered in more detail when we introduce our microscopic model), then, in the limit of strong driving, the translocation speed is simply v = a/tpore, where tpore is the longest relaxation time of the part of the polymer in the pore. Substituting numerical values for poly[U], we find tpore = a/v approx  1.5 µs. Comparing this figure to tZ, we see that the two become of the same order when N is of order 150, corresponding to a length of polymer of roughly 300 nucleotides protruding from each side of the pore. Of course, for polymers that traverse the membrane more slowly, as is the case for poly[dA], the value of N above, which tZ gsim  tpore can be significantly larger.

As long as the dynamics of the polymer outside of the pore are fast compared to the dynamics in the pore, one need not treat the external degrees of freedom explicitly. Instead, they affect the motion of the polymer only through a contribution F (x) to its free energy and through the increased drag they contribute. (Here, we assume that v is sufficiently small that the parts of the polymer outside the pore are essentially in equilibrium. On purely dimensional grounds, this must be true when tZ << Ny(b/v) for some nonnegative exponent y, a requirement that is met in KBBD's experiments.) Lee and Obukhov's (1996) scaling argument implies that their effect on the drag is independent of the length of polymer on a given side of the membrane. In contrast, for us to be able to neglect F , dF /dx must be small compared to the force F driving translocation. Denote the free energy of the coil on the cis side of the membrane by F C(x) and that of the coil on the trans side by F T(x); their sum is F (x). Sung and Park (1996) pointed out that F C and F T are simply the free energies of a polymer grafted by one end to a planar surface. For a polymer of length x, this entropic free energy is known to be proportional to kBT ln(x/b), with a coefficient of order unity that depends on whether excluded volume effects are important (Binder, 1983). Ignoring the few monomers actually in the channel, the lengths of polymer on the cis and trans sides of the barrier are x and L - x, respectively, so
ℱ(x) ∝ k<SUB><UP>B</UP></SUB>T<FENCE><UP>ln</UP><FENCE><FR><NU>x</NU><DE>b</DE></FR></FENCE>+<UP>ln</UP><FENCE><FR><NU>L−x</NU><DE>b</DE></FR></FENCE></FENCE>. (6)
For a chain that is a fixed fraction of the way through the hole (i.e., for fixed x/L), dF /dx vanishes like 1/L. Further, it makes little sense to consider x < a, where a is the length of a single monomer, so we must always have dF /dx ~<  kBT/a. Typical values will be much smaller than this bound. The driving force F approx  5kBT/a. thus greatly exceeds dF /dx; indeed, because the polymers used by KBBD are several hundred nucleotides long, F is more than a factor of 100 larger than a typical value of dF /dx. In sum, we have shown that, in the window of polymer lengths,
<FR><NU>k<SUB><UP>B</UP></SUB>T</NU><DE>Fa</DE></FR> &z.Lt; N &z.Lt; <FENCE><FR><NU>k<SUB><UP>B</UP></SUB>Ta</NU><DE>&eegr;b<SUP>3</SUP>v</DE></FR></FENCE><SUP>1/3&ngr;</SUP>, (7)
the polymer is short enough to relax quickly, but long enough that the entropic barrier to crossing the membrane is not too steep. For lengths in this window, the ends of the chain hanging outside of the pore can be neglected compared to the monomers inside the pore. Since the system studied by KBBD falls within this window, we are justified in using simple 1D models to describe it.

    MICROSCOPIC MODEL OF THE PORE
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL BACKGROUND
COARSE-GRAINED DESCRIPTION
REGIME OF VALIDITY
MICROSCOPIC MODEL OF THE...
DISCUSSION
CONCLUSION
APPENDIX A
APPENDIX B
APPENDIX C
REFERENCES

Until now, we have avoided specifying the physics of the interactions within the pore. In this section, we present a simple phenomenological model of these interactions. Our main goal is to understand physically how the parameters v and D can vary sufficiently to explain experimental facts like the difference in velocities between polymers moving forward and backward.

Description of the model

We begin by focusing on the polymer backbone, whose coordinate x tells us what fraction of the polymer chain has passed through the channel. If the motion of the backbone is sufficiently slow compared to all the other degrees of freedom in the pore, then we can take x to be the only dynamical variable in the problem. The remaining degrees of freedom are then described by a free energy Phi (x) that depends on the polymer translocation parameter x. The potential Phi (x) can, for example, be expected to have contributions from electrostatic interactions between the polymer and the alpha -hemolysin heptamer. Two unit charges separated by 1 Å in water have an energy of about 6kBT at room temperature; because both polynucleotide and protein have completely ionized groups in physiological pH, it is thus plausible that typical values of Phi  should be at least on the order of several kBT. We split Phi  into a mean slope F determined by the applied voltage drop and a part U(x) that captures the details of the polymer's interactions with the pore: Phi (x) = U(x) - Fx. (In principle, U could depend on the applied voltage and hence on F. We ignore this effect; many of our conclusions will, in any case, turn out to be insensitive to it.) For homopolymers (provided we continue to neglect the degrees of freedom outside the pore), U(x) is periodic, with period a = 1 nucleotide. F is precisely the mean force introduced in Eq. 1. It is equal to eV/a in the simplest picture, but will, in general, be less than this value in the presence of an nonzero ionic current.

Our problem is now formally no different from that of a point particle diffusing in a periodic potential U and driven by a constant force F. The probability P(x) of finding such a particle at a point x is governed by a Smoluchowski equation,
<FR><NU>∂P</NU><DE>∂t</DE></FR>=D<SUB>0</SUB> <FR><NU>∂</NU><DE>∂x</DE></FR><FENCE><FR><NU>∂P</NU><DE>∂x</DE></FR>+<FR><NU>U′(x)−F</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR> P</FENCE> (8)

≡ℒP.
The bare diffusion constant D0 is related through an Einstein relation to some suitable hydrodynamic drag force on the polymer in the channel. It is not to be confused with the effective diffusion constant D that includes the effects of U and describes the polymer's motion on length scales much larger than a. As is common in theories of electrophoresis, we assume that D0 is unaffected by the counterion flow.

It is helpful both for numerical work and for intuition building to have a concrete idea of the simplest form U(x) could take. In particular, such a simplified cartoon will give us an idea of the minimum number of parameters needed to describe the gross features of the potential. A natural choice for such a U(x) is a sawtooth potential of the sort sketched in Fig. 5. It is described by two dimensionless parameters, the peak height U0/kBT and the asymmetry parameter alpha . When alpha  = 1/2, the potential is perfectly symmetrical, whereas alpha  = 0 or 1 corresponds to maximal asymmetry. In addition to U(x), the full potential Phi  contains a term proportional to the driving force F, which figures in the dimensionless group Fa/kBT. Thus, to specify our potential fully, we require the three dimensionless parameters U0/kBT, alpha , and Fa/kBT, as well as D0 and the repeat distance a, which set a time and a length scale. More generally, we expect that any form of U(x) with only one peak per period will be roughly characterized by a peak height U0 (equal to the difference between the minimum and the maximum values of U(x)), and an asymmetry alpha  (defined as the distance between a minimum in U(x) and the next maximum to the right, divided by a). Although we have no a priori information about alpha , we have suggested that U0 should be of order several kBT, and have argued Fa/kBT approx  5 for KBBD's experiments.



View larger version (5K):
[in this window]
[in a new window]
 
FIGURE 5   Sketch of the sawtooth cartoon potential discussed in the text. The potential has period a, and alpha a is the distance from one minimum to the next maximum. The parameter U0 gives the energy difference between minimum and maximum.

Because the time required to diffuse over a barrier depends exponentially on the barrier height, small differences in U0 can lead to significant changes in translocation speed, consistent with KBBD's observations. Further, if U(x) is asymmetrical, forces F and -F will lead to different barrier heights, and thus to different mean drift speeds for the diffusing polymer. Figure 6 illustrates this point. Unfortunately, a change in the sign of F does not correspond directly to changing the polymer orientation from 3' end first to 5' end first. As shown in Fig. 7, three different vector quantities can be oriented relative to the membrane: the applied electric field, the alpha -hemolysin pore, and the DNA. Each can point toward the cis or the trans chamber. With, say, the electric field held fixed, there are four possible situations. The two that have been realized in the experiments of KBBD (corresponding to B and D in Fig. 7) are related by a flip of the polymer, whereas transforming U(x) maps to  U(-x) (or equivalently F maps to  -F) in our model amounts to changing the direction of the pore. Thus, the two situations probed by KBBD correspond, in our simple model, to two different potentials U(x); they may have different translocation speeds even as the applied voltage V tends to zero. In contrast, two orientations related by F maps to  -F (AB and CD in Fig. 6) must have the same linear response to an applied field, and thus the same translocation speed for small enough V. Because the polymer is asymmetric, however, they may have different translocation speeds outside the linear response regime for large enough V. After all four possible situations have been explored experimentally, it should be possible to observe that four different translocation speeds at finite V collapse onto two speeds as V decreases, and, thereby, to estimate the value of alpha  or even of Fa/kBT by comparing data for the appropriate pairs of orientations.



View larger version (12K):
[in this window]
[in a new window]
 
FIGURE 6   Sketch showing how asymmetry in the potential can lead to different speeds for forward and backward motion. A bias is applied to the unperturbed potential (A) so that it has the same average gradient in the two bottom pictures. The potential at the right (B), however, has been reflected through the vertical axis before the gradient is applied. It thus has smaller barriers to hopping from one minimum to the next than the potential at left (C), leading to slower dynamics.



View larger version (24K):
[in this window]
[in a new window]
 
FIGURE 7   The four possible relative orientations of polymer, pore, and applied electric field. In KBBD's experiments, (B, D) the relative orientation of the pore and field is fixed and the orientation of the polymer is allowed to vary. In our microscopic model, A is related to B and C is related to D by the transformation F maps to  -F. More generally, the coefficients giving the linear response to sufficiently small voltages should be the same for the two orientations in each of the pairs (AB) and (CD), but should differ between pairs.

Effective mobility and diffusion coefficient

We now turn to the task of calculating the parameters v and D that describe the behavior of Eq. 8 on long length scales. Several approaches are available; in this section, we will describe the results of an analysis based on ideas of Risken (1984). Details of the calculation, which relies on an eigenfunction expansion, are given in Appendix B. In the most general case, v and D have fairly complicated forms, but relatively simple limiting cases capture most of the relevant behavior. For example, one finds (le Doussal and Vinokur, 1995; Scheidl, 1995)
<FR><NU>1</NU><DE>v</DE></FR>=<FR><NU>1</NU><DE>D<SUB>0</SUB></DE></FR> <LIM><OP>∫</OP><LL>0</LL><UL>∞</UL></LIM> <UP>d</UP>z <UP>exp</UP><FENCE><UP>−</UP><FR><NU>Fz</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE> (9)

<LIM><OP>∫</OP><LL>0</LL><UL><UP>a</UP></UL></LIM> <FR><NU><UP>d</UP>x</NU><DE>a</DE></FR> <UP>exp</UP><FENCE><FR><NU>U(x+z)−U(x)</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE>,
from which a number of limiting behaviors can be extracted. Several equivalent expressions for v, as well as a similar, but more involved, expression for D, can also be obtained.

Figure 8 plots the velocity v versus F for polymers traveling in two different directions in the same (asymmetric) potential. At typical values of F, differences in velocity between forward and backward motion of a factor of 3 or more are easily obtained. Likewise, the calculated velocities are much slower than they would have been in the absence of a potential.



View larger version (21K):
[in this window]
[in a new window]
 
FIGURE 8   Plot of the (nondimensionalized) average velocity v from Eq. 9 versus the driving force Fa/kBT; v is calculated using our microscopic model with a sawtooth potential. The parameter values are U0/kBT = 10; alpha  = 0.7 for the upper curve and alpha  = 0.3 for the lower curve. The potentials for the two curves are thus related by U(x) maps to  U(-x). (Inset) The diffusive length ld versus the barrier height U0 of the sawtooth potential, for fixed driving force Fa = 5kBT and asymmetry alpha  = 0.7. Note that over the entire range of U0, ld ~<  a.

One can gain more quantitative insight into both of these observations by studying how v and D behave in various limiting cases. Relegating the derivations to Appendix B, we next consider several such expressions. Three cases are particularly of interest: large and small driving force F, and large potential barriers U0 (the case of small U0 corresponds to the absence of a potential and was discussed earlier). For small F, v and D must satisfy an Einstein relation. Indeed, in this limit one finds,
v=<FR><NU>D<SUB>0</SUB>F</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR> <FR><NU>1</NU><DE>I<SUP>(0)</SUP><SUB>1</SUB>I<SUP>(0)</SUP><SUB>2</SUB></DE></FR><FENCE>1+𝒪<FENCE><FR><NU>Fa</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE></FENCE>

<UP>and</UP> (10)

D=D<SUB>0</SUB> <FR><NU>1</NU><DE>I<SUP>(0)</SUP><SUB>1</SUB>I<SUP>(0)</SUP><SUB>2</SUB></DE></FR><FENCE>1+𝒪<FENCE><FR><NU>Fa</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE></FENCE>,
where
I<SUP>(0)</SUP><SUB>1</SUB>=<LIM><OP>∫</OP><LL>0</LL><UL><UP>a</UP></UL></LIM> <FR><NU><UP>d</UP>x</NU><DE>a</DE></FR> <UP>exp</UP><FENCE><FR><NU>U(x)</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE>

<UP>and</UP> (11)

I<SUP>(0)</SUP><SUB>2</SUB>=<LIM><OP>∫</OP><LL>0</LL><UL><UP>a</UP></UL></LIM> <FR><NU><UP>d</UP>x</NU><DE>a</DE></FR> <UP>exp</UP><FENCE><FR><NU><UP>−</UP>U(x)</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE>.
Thus, v/D = F/kBT, as the fluctuation-dissipation theorem requires, but the effective diffusion coefficient D is reduced from its bare value D0 by a factor that grows exponentially with the characteristic height of the potential. Perhaps more surprising is the fact that a linear response-like regime is also reached for sufficiently large F. As F right-arrow infinity ,
v=D<SUB>0</SUB>F<FENCE>1+𝒪<FENCE><FR><NU>U<SUB>0</SUB></NU><DE>Fa</DE></FR></FENCE><SUP>2</SUP></FENCE>

<UP>and</UP> (12)

D=D<SUB>0</SUB><FENCE>1+𝒪<FENCE><FR><NU>U<SUB>0</SUB></NU><DE>Fa</DE></FR></FENCE><SUP>2</SUP></FENCE>.
The physical content of this result is that, when F is much larger than a typical force derived from U(x), Phi '(xapprox  -F, and contributions from U may be neglected entirely. In the opposite limit of large U0, one might expect that the diffusion process can essentially be described as hopping from one potential minimum to the next. Approximate formulas based on the Kramers escape rate (van Kampen, 1992) should then apply. In fact, for large U0 one finds
  v≃<FR><NU>D<SUB>0</SUB></NU><DE>aI<SUP>(0)</SUP><SUB>1</SUB>I<SUP>(0)</SUP><SUB>2</SUB></DE></FR><FENCE><UP>exp</UP><FENCE><FR><NU>&agr;Fa</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE>−<UP>exp</UP><FENCE><UP>−</UP><FR><NU>(1−&agr;)Fa</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE></FENCE> (13)
and
  D≃<FR><NU>D<SUB>0</SUB></NU><DE>2I<SUP>(0)</SUP><SUB>1</SUB>I<SUP>(0)</SUP><SUB>2</SUB></DE></FR><FENCE><UP>exp</UP><FENCE><FR><NU>&agr;Fa</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE>+<UP>exp</UP><FENCE><UP>−</UP><FR><NU>(1−&agr;)Fa</NU><DE>k<SUB><UP>B</UP></SUB>T</DE></FR></FENCE></FENCE>. (14)
As before, we select the origin of U(x) so that its maximum and minimum in each period occur at points xmax > xmin, with xmax - xmin = alpha a.

We have already estimated from KBBD's data that ld triple-bond  D/v approx  40a. A striking feature of the asymptotic forms Eqs. 10-14 just obtained is that all three imply a much smaller value. As we noted when we introduced the parameter ld, the linear response results both yield ld = kBT/F; given our naive estimate Fa/kBT approx  5, we find ld approx  a/5 << 40a. For U0 large enough that the hopping approximation of Eq. 14 applies, this order of magnitude is little changed even as F right-arrow infinity . Indeed, in this limit, Eq. 14 gives ld = a/2. It is, of course, possible that some particular form of U(x) with finite U0 and F might lead to a value of ld of order 40a. It seems more likely, however, that ld interpolates reasonably smoothly among its various limiting values. The inset to Fig. 8 illustrates this point for the sawtooth potential introduced earlier. Although v and D each separately can depend strongly on the shape of U(x), their ratio is far less sensitive. We are thus led to one of the central conclusions of this paper: while many aspects of KBBD's results can be qualitatively explained by a model of diffusion in a 1D periodic potential, the observed width of their peaks is inconsistent with this model if one takes Fa approx  5kBT.

    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL BACKGROUND
COARSE-GRAINED DESCRIPTION
REGIME OF VALIDITY
MICROSCOPIC MODEL OF THE...
DISCUSSION
CONCLUSION
APPENDIX A
APPENDIX B
APPENDIX C
REFERENCES

In the previous section, we argued that the peaks in KBBD's distribution of first-passage times are much wider than is consistent with our minimal 1D model. It is not difficult to suggest reasons why this might be the case. Perhaps the most obvious is that Fa/kBT could differ significantly from 5. Not only would a decrease of a factor of 100 in F bring our prediction for ld into line with experimental observations, it would also explain the polymer's unexpectedly slow translocation speed. At least two effects might decrease F. First, unless the pore has infinite resistance, not all of the applied voltage drop V will be across the pore. Although the large resistance of the alpha -hemolysin channel makes it unlikely that this mechanism could diminish F by orders of magnitude, it certainly leads to some decrease. Second, the fact that there is a nonzero ionic current flowing through the pore while the polymer is translocating means that the motion of the polymer itself need not satisfy detailed balance. That is, the error rate, or ratio of the probabilities of moving forward one base to moving backward one base, is no longer required to be equal to exp(eV/kBT). To use a somewhat different language, as the counterions are forced through the pore by the electric field, they entrain some of the solvent along with them. This solvent flow exerts an additional drag force on the polymer, and this drag contributes to the mean force F. As a result, the electrophoretic mobility of the polymer in the channel is not, in general, equal to its hydrodynamic mobility multiplied by its charge. Appendix C presents simple estimates based on continuum mechanics that suggest that both of these effects are small. These estimates, however, make a number of simplifications; indeed, even the validity of the continuum equations is not assured on the nanometer scale. Given the importance of a large value of Fa/kBT to any attempts to sequence polynucleotides using the alpha -hemolysin pore, it thus seems desirable to verify experimentally that it is indeed of order 5.

Although a smaller than expected driving force is certainly one mechanism that would generate wider peaks, others exist that do not require a large error rate. In many ways, our most poorly justified assumption is that the motion of the polymer backbone through the pore is much slower than the relaxation of every other degree of freedom in the system, so we begin by considering what might happen if this assumption were to break down. For example, the protonation state of the open alpha -hemolysin channel is known to fluctuate on a much slower time scale than the characteristic polymer time a/v ~ 1 µs (Kasianowicz and Bezrukov, 1995; Bezrukov and Kasianowicz, 1993), and the energy barrier to moving a base through the pore might change significantly when the protonation state changes. It is instructive to consider a naive extension of our 1D model meant crudely to describe such a situation. Suppose that the pore + polymer system can be in one of two states, state 1, in which the polymer backbone can diffuse freely, and state 2, in which the backbone is trapped and cannot move. Let there be a transition rate (per time)