help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Schwartz, R.
Right arrow Articles by Berger, B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Schwartz, R.
Right arrow Articles by Berger, B.

Biophys J, December 1998, p. 2626-2636, Vol. 75, No. 6

Local Rules Simulation of the Kinetics of Virus Capsid Self-Assembly

Russell Schwartz,* Peter W. Shor,# Peter E. Prevelige Jr.,§ and Bonnie Berger*

 *Laboratory for Computer Science and  Department of Mathematics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139;  #AT&T Labs, Florham Park, New Jersey 07932, and  §Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA

    ABSTRACT
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

A computer model is described for studying the kinetics of the self-assembly of icosahedral viral capsids. Solution of this problem is crucial to an understanding of the viral life cycle, which currently cannot be adequately addressed through laboratory techniques. The abstract simulation model employed to address this is based on the local rules theory of Berger et al. (1994. Proc. Natl. Acad. Sci. USA. 91:7732-7736). It is shown that the principle of local rules, generalized with a model of kinetics and other extensions, can be used to simulate complicated problems in self-assembly. This approach allows for a computationally tractable molecular dynamics-like simulation of coat protein interactions while retaining many relevant features of capsid self-assembly. Three simple simulation experiments are presented to illustrate the use of this model. These show the dependence of growth and malformation rates on the energetics of binding interactions, the tolerance of errors in binding positions, and the concentration of subunits in the examples. These experiments demonstrate a tradeoff within the model between growth rate and fidelity of assembly for the three parameters. A detailed discussion of the computational model is also provided.

    INTRODUCTION
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

The pathway by which icosahedral virus protein coats assemble from their subunits is an important and incompletely described phenomenon, consisting of the self-assembly of many proteins into a complicated but regular structure. Capsid structure has been described by the quasiequivalence theory of Caspar and Klug (1962), which provides an explanation for the final assembled state of a capsid. However, quasiequivalence offers little predictive power regarding the process of assembly. Although the subunits in a capsid are usually chemically identical, they can adopt distinct conformations in the assembled capsid, with the conformation of any given subunit attained in a pathway-dependent fashion (Rossman, 1984; Johnson and Speir, 1997). It is not currently known what constraints, if any, there are on assembly pathways or the distribution of intermediates. In addition, it is unknown what physical properties of coat proteins would enforce these constraints. Other open questions regarding the assembly process include the points at which conformational switching occurs and the source of the observed nucleation-limited behavior (Prevelige et al., 1993). All icosahedral viruses must have some means of addressing these questions, although the mechanistic answers may differ from one virus to another. Answers to these and other questions may provide insight into viral assembly and have the potential to assist in developing novel approaches to interfering with viral infection.

Experimental techniques for examining these pathway-dependent processes have thus far been hampered by the complexity of the system. However, modeling and simulation-based approaches have the potential to assist in understanding such processes. Prior modeling and simulation work has provided insight into many aspects of virus capsid assembly but has not been directed at answering the specific questions regarding assembly kinetics addressed by the present work. Horton and Lewis (1992) developed a method of using association energies of viral coat proteins to predict assembly intermediates. Reddy et al. (1998) extended this method, applying the CHARMM energy calculation program (Brooks et al., 1983) to the crystallographic structures of three icosahedral viruses to estimate the interaction energies that would be applied to the prediction of intermediates. The method, however, relies on the assumption that the stability and distribution of intermediates can be reliably predicted by examination of the final assembled structure. Given the nature and extent of conformational changes accompanying capsid assembly (Steven et al., 1992; Galisteo and King, 1993; Prevelige et al., 1994), this assumption may not be reasonable for examining some aspects of capsid assembly. Furthermore, as the present work attempts to demonstrate, unexpected behaviors may appear during assembly, even when the energetics of interactions are understood, suggesting that the sort of information provided by the work of Horton and Lewis and Reddy et al. might be supplemented by additional modeling work. Marzec and Day (1993) modeled capsomeres as generic "morphological units" and showed that allowing them to move freely over the surface of a sphere, minimizing a potential energy function, could give rise to observed quasiequivalent patterns. Tarnai et al. (1995) used a similar approach, searching for maximally dense packings of pentagons on a sphere, to model viruses such as SV40 and papilloma virus, which are composed entirely of pentamers. Although the approaches of Marzec and Day and Tarnai et al. are useful for studying symmetries of the final shell, they provide little insight into its assembly process. Zlotnick (1994) developed a model based on the assumption that fully formed and partially formed capsids exist at equilibrium with free coat proteins, giving rise to kinetics that appear to be nucleation-limited; this approach allowed for a more dynamic simulation of the growth process than other methods. Zlotnick's approach, though, examined interactions at the scale of population distributions of large numbers of intermediates over time, rather than providing detailed information about a small number of capsids, as the present work attempts to do. Furthermore, Zlotnick's simulations were specific to his equilibrium model. It can also be noted that none of the aforementioned approaches can model irregular or malformed structures, an important goal of the present work.

More general simulation techniques are also unable to address the concerns of the present work, because of the computational cost of the problem. Low-level molecular models that work at atomic or even amino acid resolution cannot handle more than a few proteins the size of a typical viral coat protein. Simulations of capsid assembly kinetics must consider hundreds or thousands of coat proteins; the molecular dynamics techniques currently used for protein structure determinations or protein docking simulations are therefore unusable for studying many questions of capsid assembly kinetics. Simpler lattice models that rely on combinatorial optimization would also be unsuitable for this problem, because of their focus on equilibrium values rather than kinetics.

Because the problems examined by the present work cannot currently be adequately addressed experimentally or by prior modeling and simulation approaches, we have attempted to explore them through more abstract modeling techniques based on the principle of local rules (Berger et al., 1994). The local rules theory proposes that coat proteins take on distinct conformations determined by sets of "local rules," in which a protein chooses its conformation and its relative position based on the conformations of its immediate neighbors. The local rules theory makes it possible to describe the high-level symmetries of a completed capsid, using only information available to individual subunits. Fig. 1 shows the simplest local rules set, which describes a T = 1 geometry. The rule indicates that a protein in conformation 1 is connected to three other proteins, each of conformation 1. The angles between successive binding sites are ~120°, ~120°, and ~108°. This rule provides an abstract mechanism by which a T = 1 capsid could form; each subunit, as it connects to any other subunit, need only ensure that its positioning relative to its neighbors is consistent with the rule to ensure that the completed capsid will have T = 1 symmetry. Fig. 2 shows a more complicated T = 7 rules set with seven conformations. (One also exists with four conformations.) Local rules can also describe such complicated structures, although that capability is not applied in the present work. The local rules theory has offered explanations for several puzzling observations about capsid assembly. Berger and Shor (1995) showed how only a minor change in a rule set could select between T = 4 and T = 7 geometries, providing a possible explanation for observed links between the two geometries (Dokland et al., 1992; Earnshaw and King, 1978; Katsura 1983; Thuman-Commike et al., 1998). In addition, Berger et al. (1994) used an early computer model of local rules to explore the robustness of rules sets to small deviations from their ideal binding angles and to demonstrate a possible source of observed shell malformations resulting from an error in applying local rules.


View larger version (11K):
[in this window]
[in a new window]
 
FIGURE 1   Local rule for a T = 1 capsid. Angles specify the angle between binding interactions. Arrows distinguish different edge types, so that any edge with an outward arrow on one subunit must connect to an edge with an inward arrow on another subunit, whereas an edge with no arrow on one subunit must connect to an edge with no arrow on another subunit. Assembling subunits consistently with these rules can only produce a T = 1 shell or a subset thereof.


View larger version (16K):
[in this window]
[in a new window]
 
FIGURE 2   One set of local rules for a T = 7 capsid. As with Fig. 1, angles specify angles between binding interactions and arrows distinguish types of edges. This rule set specifies seven distinct conformations, each with its own set of neighbors and binding interactions. Assembling subunits according to these rules can only produce partial or complete T = 7 shells.

In this paper, we apply the local rules theory to the development of an abstract model for subunit-subunit interactions that allows for computationally feasible simulations of the overall self-assembly reaction. By assuming that assembly proceeds according to local rules, we can remove many details that then become irrelevant to answering many of the questions that concern us. For example, by assuming that interprotein binding always follows sets of local rules, we can disregard the complex combination of electrostatics, hydrophobic interactions, and other processes that may actually be involved in binding. Other simplifications include the use of probabilistic Boltzmann distributions for handling binding and conformational shifting and the use of a model of Brownian motion for maintaining realistic thermodynamic behavior. By relying on theoretical and statistical models to remove details that do not appear to be relevant to the questions we are asking, we have been able to make the problem computationally tractable. This approach has the additional benefit of allowing for a model that is general with respect to the specific details of assembly and can be adapted to many different assembly models. It remains to be shown that we have retained sufficient detail to use our model as a predictive tool for studying self-assembly behavior.

The purpose of this paper is to describe our approach to the computer modeling of capsid assembly dynamics. To illustrate what can be accomplished with this approach, the paper presents the results of a series of simulation experiments conducted using a simplified version of the simulation model. Although the central focus of this paper is the simulator itself, rather than the results of the particular simulation experiments presented, the simulations nonetheless raise some interesting results relating growth rates and malformation rates of simulated T = 1 capsids, which the paper describes. It then draws some conclusions about our results and the applicability of our techniques to future work in biochemical simulation. Finally, the paper describes the simulation model and provides details of how it was implemented for the present work.

    MATERIALS AND METHODS
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

Computer model

Defining a simulated system under the local-rules model requires specifying some basic assembly subunit. The subunit may be an individual coat protein, as in the simulation experiments. However, the model also allows for larger subunits, such as dimers or capsomers, which can themselves assemble according to local-rules binding patterns. It should also be noted that even if a larger subunit is used by the system being modeled, the simulation model described here can capture correct assembly in terms of monomers by biasing monomer kinetics to form the correct subunit more readily than other subsets of a capsid. Simulated subunits are built from unions of spheres, each with a characteristic mass, radius, and binding configuration. In the simplest form, a subunit can be modeled as a single sphere with a set of bonds, each of which represents a potential binding interaction with another subunit. One such subunit is shown in Fig. 7 A. Subunits of different shapes can be made from unions of these simple spheres, modeled as if they were rigidly connected; such a subunit is shown in Fig. 7 B. (This representation is not used in the experiments presented here.)

Binding interactions between subunits are modeled through bonds that form between separate subunits. Each bond has a characteristic length, direction, rotational vector, pair of activation energies for the association and dissociation reactions, set of spring constants used to model the forces exerted by binding interactions, set of tolerances, and set of allowable neighbors. A bond can attach to or detach from another bond probabilistically according to its characteristic energies, provided the two bonds are allowed to be neighbors and their locations are within their permitted tolerances. For a bond with activation energy of association Er and activation energy of dissociation -Ed, the probability of binding to another bond of an allowed type within the correct tolerance is eEr/kbT/(1 + eEr/kbT), and the probability of breaking an existing bond is (1 + eEd/kbT)-1, where kb is Boltzmann's constant and T is the temperature. The forces bound subunits exert on each other are described in the next section.

Conformational shifting is implemented by allowing subunits to take on several shapes, each with a characteristic energy. A subunit shifts between its allowed shapes probabilistically according to the Boltzmann distribution of the energies. The probability of a subunit shifting to a conformation k on a particular time step is given by
P(k)=<FR><NU>e<SUP><UP>E<SUB>k</SUB>/k<SUB>b</SUB>T</UP></SUP></NU><DE>∑<SUB><UP>i</UP></SUB>e<SUP><UP>E<SUB>i</SUB>/k<SUB>b</SUB>T</UP></SUP></DE></FR>
where Ei is the energy of conformation i, T is the temperature, and kb is Boltzmann's constant. If the subunit has formed any bonds, then the energies of those bonds are added to the energy of the current conformation, as they must be broken to shift conformations. Multiple-conformation subunits can be used as subsets of other subunits, allowing separate domains to shift conformations independently, although this functionality is not used in the experiments presented here.

Subunit-subunit and subunit-solvent interactions

The most important contributions to the force on a bound subunit are the bond forces exerted by its neighbors. Each bond is forced toward a characteristic angle and length. The forces pushing a bond toward its ideal value are modeled as three springs, representing a translational force, Ft, a rotational torque around the bond, Tr, and a bending torque, Ts, that straightens bonds. These forces are calculated by the following equations:
F<SUB><UP>t</UP></SUB>=k<SUB><UP>t</UP></SUB>(<A><AC>e</AC><AC>&cjs1164;</AC></A><SUB>2</SUB>−<A><AC>e</AC><AC>&cjs1164;</AC></A><SUB>1</SUB>)
T<SUB><UP>r</UP></SUB>=k<SUB><UP>r</UP></SUB>[(<A><AC>u</AC><AC>&cjs1164;</AC></A><SUB>1</SUB>×<A><AC>u</AC><AC>&cjs1164;</AC></A><SUB>2</SUB>) · <A><AC>d</AC><AC>&cjs1164;</AC></A><SUB>1</SUB>]<A><AC>d</AC><AC>&cjs1164;</AC></A><SUB>1</SUB>
T<SUB><UP>s</UP></SUB>=k<SUB><UP>s</UP></SUB><RAD><RCD>0.5+0.5(<A><AC>d</AC><AC>&cjs1164;</AC></A><SUB>1</SUB> · <A><AC>d</AC><AC>&cjs1164;</AC></A><SUB>2</SUB>)</RCD></RAD> <FR><NU><A><AC>d</AC><AC>&cjs1164;</AC></A><SUB>2</SUB>×<A><AC>d</AC><AC>&cjs1164;</AC></A><SUB>1</SUB></NU><DE>∥<A><AC>d</AC><AC>&cjs1164;</AC></A><SUB>2</SUB>×<A><AC>d</AC><AC>&cjs1164;</AC></A><SUB>1</SUB>∥</DE></FR>
Here, kt, kr, and ks are user-supplied spring constants; <A><AC>e</AC><AC>&cjs1164;</AC></A>1 and <A><AC>e</AC><AC>&cjs1164;</AC></A>2 are the end points of bonds one and two; <A><AC>u</AC><AC>&cjs1164;</AC></A>1 and <A><AC>u</AC><AC>&cjs1164;</AC></A>2 are rotational vectors defining the desired relative rotations of the two subunits around the bonds; and <A><AC>d</AC><AC>&cjs1164;</AC></A>1 and <A><AC>d</AC><AC>&cjs1164;</AC></A>2 are the directions of the two bonds. The end points of the bonds define the optimal relative positions of the two bound subunits and need not have any relationship to the sizes of the two particles.

Collisions between two subunits or between a subunit and an artificial boundary around the simulated solution create a second class of forces. When two spheres collide, they exert a force on each other, given by
<UP>min</UP><FENCE>C <FR><NU>(r<SUB>1</SUB>+r<SUB>2</SUB>)<SUP>2</SUP>−(r<SUB>1</SUB>+r<SUB>2</SUB>)∥<A><AC>d</AC><AC>&cjs1164;</AC></A>∥</NU><DE>∥<A><AC>d</AC><AC>&cjs1164;</AC></A>∥<SUP>2</SUP></DE></FR>, 2C</FENCE>
where C is a constant set at 50 kDa · nm/µs2, <A><AC>d</AC><AC>&cjs1164;</AC></A> is a vector between the centers of the spheres, and r1 and r2 are their radii. Likewise, when a sphere collides with the boundary of the simulation, the boundary exerts a force on the sphere, given by
<UP>min</UP><FENCE>Cm <FR><NU>1−o/r</NU><DE>(o/r)<SUP>2</SUP></DE></FR>, 12CM</FENCE>
where M is the mass of the subunit, r is the radius of the sphere, o is the overlap distance between the edge of the sphere and the edge of the simulation, and C is defined as above. These forces were selected by trial and error to give reasonable collision behavior while being inexpensive to compute and allowing quick convergence of the numerical methods.

The final class of forces results from a model of Brownian motion, intended to keep the average kinetic energy of the particles close to a constant over long periods of time. This model consists of two forces: a damping force and a randomized force of adaptive magnitude designed to increase kinetic energy. The damping force on each time step or partial time step is implemented by the assignments
<A><AC>v</AC><AC>&cjs1164;</AC></A>←<A><AC>v</AC><AC>&cjs1164;</AC></A>−d<A><AC>v</AC><AC>&cjs1164;</AC></A>M<SUP>2/3</SUP>
<A><AC>o</AC><AC>&cjs1164;</AC></A>←<A><AC>o</AC><AC>&cjs1164;</AC></A>−d<A><AC>o</AC><AC>&cjs1164;</AC></A>I<SUP>2/3</SUP>
where <A><AC>v</AC><AC>&cjs1164;</AC></A> is the velocity, <A><AC>o</AC><AC>&cjs1164;</AC></A> is the angular velocity, M is the subunit mass, I is its moment of inertia about its axis of rotation, and d is a user-supplied damping constant. This damping force alone would tend to reduce the energy of a simulation. The randomized force adds small perturbations to the subunit velocity, whose distributions are adaptively chosen to tend to increase subunit energy. The two forces together prevent small numerical errors from gradually increasing or decreasing the average kinetic energy over the course of a simulation but avoid drastically altering subunit paths over short distances.

Numerical methods

The central computational problem of the simulator is the integration of the equations of motion based on the forces described above. The equations of motion can be described in terms of the vectors of velocities, <A><AC>v</AC><AC>&cjs1164;</AC></A>, and positions, x, as two differential equations:
<FR><NU><UP>d</UP><A><AC>x</AC><AC>&cjs1164;</AC></A></NU><DE><UP>d</UP>t</DE></FR>=<A><AC>v</AC><AC>&cjs1164;</AC></A>
<FR><NU><UP>d</UP><A><AC>v</AC><AC>&cjs1164;</AC></A></NU><DE><UP>d</UP>t</DE></FR>=f(<A><AC>x</AC><AC>&cjs1164;</AC></A>, <A><AC>v</AC><AC>&cjs1164;</AC></A>)
where f computes the accelerations based on the current positions and velocities. The overall approach employed is an adaptive Euler method with Richardson extrapolation. This method is run in parallel, using the Cilk multithreading system (Blumofe et al., 1995).

The Euler method is the simplest numerical integration scheme, computing the simulation state at a future time based on the state at the current time. A forward Euler for integrating velocity is combined with a backward Euler for integrating position, giving the following pair of iterations:
<A><AC>v</AC><AC>&cjs1164;</AC></A><SUB><UP>n+1</UP></SUB>=<A><AC>v</AC><AC>&cjs1164;</AC></A><SUB><UP>n</UP></SUB>+&Dgr;tf(<A><AC>x</AC><AC>&cjs1164;</AC></A><SUB><UP>n</UP></SUB>, <A><AC>v</AC><AC>&cjs1164;</AC></A><SUB><UP>n</UP></SUB>)
<A><AC>x</AC><AC>&cjs1164;</AC></A><SUB><UP>n+1</UP></SUB>=<A><AC>x</AC><AC>&cjs1164;</AC></A><SUB><UP>n</UP></SUB>+&Dgr;t<A><AC>v</AC><AC>&cjs1164;</AC></A><SUB><UP>n+1</UP></SUB>
This gives a first-order accurate expression that requires one function evaluation for each time step.

Because of the irregular nature of the problem, this integration scheme is modified with an adaptive step size. When the simulator attempts to advance the time by one unit of time, it first evaluates the problem with a step size of one unit. It then repeats the evaluation for all subunits with step sizes of 0.5 units. It then uses an approximate test of convergence, testing whether the results for these two times are within a user-specified tolerance. If the two values are not within that tolerance, then the evaluation is repeated with those subunits, using half of the previous step size, and all other subunits, using the same step size used on the previous round. This process continues until all parameters have converged. Although this process does not provide any absolute guarantee of bounded errors, it does set an approximate bound on the values of numerical errors.

One further complication in the method is the use of Richardson extrapolation. Richardson extrapolation is a technique for combining lower order representations to generate a higher order representation. This is combined with the adaptive step size, so that the values from different step sizes are extrapolated to produce an nth-order approximation when n different step sizes are used. Because of the possibility of discontinuous forces interfering with the extrapolation, the simulator tries extrapolating with the k smallest step sizes available, for each value of k, and uses any that have converged, favoring higher order values if more than one extrapolated value converges on a given step.

The numerical methods described here are implemented in a parallel, thread-based program. Each time one step of the Euler method is performed, a separate thread is spawned for each subunit, allowing force calculations to proceed in parallel.

User interface

The simulator uses a graphical user interface running as a front end to an interpreted control language. The graphical interface allows users to enter most common commands through a simple, button-based system. Under this system, users can easily manipulate the graphics display, add or remove subunits, advance the simulation time, or load and save simulations, using only a few button and key presses. Thus most commands can be handled quickly and with little knowledge of simulator details.

At a lower level, all user interactions are handled by the interpreted control language. User commands entered through the graphical interface are translated into text commands, which are passed to the interpreter. In addition, users can type commands directly to the interpreter. The interpreter is also accessible by loading data files, allowing users to create loadable modules that can redefine many aspects of simulator behavior. Users design simulations by creating "rules files"; these are files of commands written in the control language that define the simulation parameters, add the simulated subunits, and create any other data structures needed by a particular simulation. The use of a powerful control language ensures that the simulator is sufficiently versatile to adapt to a wide range of user needs, and the graphical interface maintains ease of use for common commands.

Computational issues

The simulator is written primarily in C. However, the force calculations and most numerical routines are implemented with the Cilk multithreading system (Blumofe et al., 1995), a parallel extension to C portable to several serial and parallel architectures. For the tests described here, the parallel routines ran on an eight-processor Sun Ultra Enterprise 5000 symmetric multiprocessor. The user interface ran separately on a Silicon Graphics Indigo workstation, which generated the graphics contained in this paper.

    RESULTS
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

Experimental design

Each simulation involves a number of subunits, representing coat proteins, which move freely throughout a simulated solution. Subunits are capable of forming binding interactions with each other in accordance with local rules specifying allowed binding patterns, provided they are within specified angular and distance tolerances of the ideal values given by the local rules. Although the ideal T = 1 rules can only produce correctly formed T = 1 capsids, the angular and distance tolerances and the flexibility of existing binding interactions allow for nonoptimal binding, which can lead to malformed capsids. Simulations can be halted at various times to record data on the current state of the population, allowing measurements of the numbers of on- and off-pathway subunits at regular intervals. The reader is referred to Materials and Methods for details of the simulation model.

For these experiments, we did not attempt to model any particular virus, but rather to create a generic model of T = 1 capsid assembly. Parameters were initially chosen to be reasonably close to what would be expected for actual virus capsids and then were adjusted empirically, based on simulation runs, to give rapid growth. Subunits were designed to polymerize according to the autostery model of Caspar (1980); all subunits therefore shift between two conformations, a more stable, nonbinding conformation with a potential energy of -2 kcal/mol, and an unstable binding conformation with zero potential energy. Bond angles for the binding conformation were chosen to be ideal for a T = 1 capsid. A single sphere was used to represent each coat protein. For this sphere, a mass of 100 kDa and a diameter of 3 nm in the binding conformation and 1 nm in the nonbinding conformation were employed. The different sizes were chosen to make the two conformations visually distinct, rather than for any anticipated effect on the simulation results. Parameters associated with binding interactions were selected empirically to allow a rapid rate of capsid growth while keeping the rate of malformations low. The base binding energies, which control the probability of bonds forming when they are within allowed tolerances and their probability of subsequently breaking, were set at -8500 kcal/mol. An angular tolerance of 30° and a base distance tolerance of 10 nm were chosen. Each test used 300 subunits. The base work space size was 125 nm in each dimension, giving a concentration of 2.47 × 10-4 mol/liter, or 24.7 mg/ml. The unusually high concentration was meant to give a rapid growth rate, a constraint made necessary by the high computational costs of simulations. However, the authors believe qualitatively similar results could be achieved at substantially lower concentrations, given sufficient time, by adjusting binding energies or tolerances to compensate for the reduced thermodynamic probabilities of assembly a low concentration would produce. The binding energy, tolerance, and work space size were modified for some simulations, as will be described below. The meanings of the various simulation parameters are described in more detail in Materials and Methods.

Growth and malformation rates were measured by examining, at five multiples of 5000 time steps, the numbers of nucleated on- and off-pathway subunits. For the purposes of the experiment, a cluster of subunits was defined to be nucleated if it was a connected cluster of five or more subunits. A subunit was defined to be on-pathway if it was attached only to a subset of a correctly formed T = 1 capsid and off-pathway if it was attached to a cluster, any part of which was inconsistent with a correctly formed T = 1 capsid. Thus even a single incorrect bond in an otherwise correctly formed cluster resulted in our classifying all subunits in the cluster as off-pathway. The fixed cutoff time of 25,000 time steps was chosen to allow a significant amount of growth. It has not been possible to determine the exact mapping between time steps and physical time in terms of time-dependent phenomena such as the diffusion rate. However, it is expected that the assembly times of the simulated capsids are substantially shorter than the assembly time of an actual capsid, largely because computational limits required biasing the binding parameters toward very rapid growth to perform the experiments in a reasonable time. Each simulation required ~5 h of computer time, so the time required to run them substantially longer would have been prohibitive.

Binding energy

The first experiment compares results of the base simulation to one in which binding energies were altered by -1 kcal/mol and by -2 kcal/mol. The results are shown in Fig. 3.


View larger version (37K):
[in this window]
[in a new window]
 
FIGURE 3   Plot of growth as binding energy is varied. These graphs show the numbers of subunits involved in nucleated clusters, in (A) total growth, (B) off-pathway growth, and (C) on-pathway growth. Each plot shows values for the base simulation and for simulations in which binding energies were altered by -1 kcal/mol and -2 kcal/mol. Data are presented at multiples of 5000 time steps.

Fig. 3 A shows the amount of total growth for each of the three simulations, including both on- and off-pathway growth. The graph shows growth in all three plots increasing at each time step, with higher rates of growth resulting from stronger binding interactions.

Fig. 3 B shows the amount of off-pathway growth for the three simulations. With the exception of time step 10,000, stronger binding interactions again consistently produced more growth. The exception at time step 10,000 occurred because at that time the base simulation happened to experience a temporary increase in malformations that were corrected soon afterward, whereas the -1 kcal/mol simulation experienced a temporary reduction, as several malformations were corrected shortly before that time step. One feature to note is that the amount of off-pathway growth for a particular simulation sometimes decreases between two time steps; this is a result of malformations self-correcting during the simulations.

Fig. 3 C shows on-pathway growth for the three simulations. (This is equivalent to the difference between the values of the two previous plots.) As with off-pathway growth, values occasionally fell between two time points, typically as a result of a correctly formed cluster producing an incorrect bond. However, unlike the previous two graphs, this one does not show the same simple relationship between binding energies and growth rates. At various times, each of the three simulations had the highest total amount of growth. However, by time step 25,000 (the end of the simulation) the intermediate binding energy produced the greatest total amount of on-pathway growth. The other two plots reveal that although the -2 kcal/mol simulation had the most total growth, this was more than offset by its greatly elevated level of malformed growth relative to the other two simulations, giving it ultimately the lowest amount of on-pathway growth. Whereas the base simulation had the lowest amount of off-pathway growth, it also had the least total growth, yielding an intermediate amount of on-pathway growth. The -1 kcal/mol simulation, although it had neither the most total growth nor the least malformed growth, had the highest final amount of on-pathway growth. These plots thus suggest a trade-off between the competing factors of growth rate and malformation rate as functions of binding energy and indicate that producing the most on-pathway growth requires finding a balance between the two.

Binding tolerance

For the second experiment, binding tolerance, which roughly corresponds to the configurational entropy of binding, was increased by two and four times relative to the base simulation. The results are illustrated in Fig. 4. The results show a pattern similar to that described in the previous section, although with some differences.


View larger version (37K):
[in this window]
[in a new window]
 
FIGURE 4   Plot of growth as binding tolerance is varied. These graphs show the numbers of subunits involved in nucleated clusters, in (A) total growth, (B) off-pathway growth, and (C) on-pathway growth. Each plot shows values for the base simulation and for simulations in which binding tolerances were increased by two times and four times.

Fig. 4 A shows the total amount of on- and off-pathway growth. Qualitatively, it shows the same relationship as that seen in Fig. 3 A, with monotonically increasing growth over time for each simulation. Increasingly lenient tolerances yielded increasingly rapid growth.

Fig. 4 B shows the amount of off-pathway growth for the three simulations. It also shows the same qualitative pattern as described in the previous section. Except for time step 10,000, increasingly lenient tolerances led to increasing off-pathway growth. As with Fig. 3 B, the exception at time step 10,000 was caused by the temporary increase in malformation in the base simulation combined with a temporary drop in the 2× tolerance simulation. One contrast to Fig. 3 B is that here, the amount of off-pathway growth for the intermediate simulation continues to grow in the final two time steps, yielding an intermediate value very close to the highest value rather than the lowest.

Fig. 4 C shows the amount of on-pathway growth for the three simulations. It is notable that despite the similarities between Figs. 3 A and 4 A and between Figs. 3 B and 4 B, Fig. 4 C is qualitatively very different from Fig. 3 C. Again, although all three simulations produce the highest total amounts of on-pathway growth at various times, here the base simulation ultimately yielded the highest amount of on-pathway growth. The double-tolerance simulation, despite its high total growth, also had high off-pathway growth; it therefore finished with the lowest amount of on-pathway growth, although it had neither the least amount of total growth nor the greatest amount of off-pathway growth. Although the quadruple-tolerance simulation had the highest amount of total growth, it also had the highest amount of off-pathway growth and thus yielded an intermediate amount of on-pathway growth. The base simulation, despite having the least total growth, had considerably less off-pathway growth than the other simulations and thus finished with the most on-pathway growth of the three. This illustrates a trade-off similar to that in the previous section, in which there is a competition between growth and malformation rates as functions of binding tolerance, although the outcome of the competition was different from the outcome described in the previous section.

Concentration

The final simulations explored the effects of doubling and quadrupling the concentration relative to the base simulation, by reducing the volume of the simulated solution. Results are plotted in Fig. 5. The results show many qualitative similarities to the previous two experiments.


View larger version (36K):
[in this window]
[in a new window]
 
FIGURE 5   Plot of growth as concentration is varied. These graphs show the numbers of subunits involved in nucleated clusters, in (A) total growth, (B) off-pathway growth, and (C) on-pathway growth. Each plot shows values for the base simulation and for simulations in which concentrations were increased by two times and four times.

Fig. 5 A shows the total growth for the three simulations. Again, each simulation displays monotonic growth. In addition, increasing concentration corresponds to increased growth between simulations.

Fig. 5 B shows the amounts of off-pathway growth for the three simulations. At each point in time, the amount of off-pathway growth in the double-concentration simulation is at least as high as that in the base simulation. Furthermore, with the exception of time step 5000, the amount of off-pathway growth in the quadruple-concentration simulation is at least as high as that in either of the other simulations. The exception at time step 5000 occurred because of a large number of malformations in the double-concentration simulation, most of which were corrected by time step 10,000.

Fig. 5 C shows the total amounts of growth for the three simulations. As with the previous two experiments, each of the three simulations has the most total growth at some point in time. By time step 25,000, the intermediate simulation yielded the greatest amount of on-pathway growth. As with varying the binding energies, the simulation with the greatest total growth also had the greatest malformed growth, resulting in the least on-pathway growth for the quadruple-concentration simulation. Furthermore, the base simulation had the smallest amount of off-pathway growth but also the least total growth, yielding an intermediate amount of on-pathway growth. The double-concentration simulation, although it had neither the most total growth nor the least malformed growth, finished with the most on-pathway growth of the three simulations. This again shows a trade-off between growth and malformation rates, in which maximizing on-pathway growth requires finding a parameter value that strikes a balance between the two.

Mechanisms of malformation rates

Although it is not always possible to determine the source of a malformation without saving and analyzing data for many time steps, sources can be determined for many malformations observed in the simulations. These data may be helpful in interpreting other results. Overall, three mechanisms were found to affect the number of malformations observed in a simulation.

One source of observed malformations was interactions of incomplete capsids. Occasionally, the growing edges of two partially formed capsids came sufficiently close to each other that it was possible to form a binding interaction between subunits in the two capsids. The two capsids thus become connected into a single malformed structure. Malformed structures of this type can continue to add subunits after they have formed. An example of such a malformation is shown in Fig. 6 A.


View larger version (73K):
[in this window]
[in a new window]
 
FIGURE 6   Demonstration of sources of increased malformations in simulations with elevated growth rates. (A) A simulated malformation produced when a binding interaction formed between the growing edges of two incomplete capsids. (B) A malformed capsid produced when a group of subunits incorrectly formed a tetrameric structure where they should have formed a pentamer. The capsid was able to continue growing to produce a closed particle with an incorrect size. (C) A malformation produced by local a binding error that is later corrected. The capsidcontains a tetramer in place of a pentamer, as in (B). However, the incorrect tetramer breaks apart later in the simulation and is corrected into a pentamer before the mistake can lead to larger structural errors. The partial shell goes on to form a correct, complete capsid.


View larger version (84K):
[in this window]
[in a new window]
 
FIGURE 7   Two examples of simulated proteins. (A) A simple subunit model consisting of a single sphere with three edges projecting from it, representing binding sites. (B) A more complicated subunit built from a union of spheres. The separate spheres are treated as if they are rigidly connected for the purpose of calculating the effects on the entire subunit of forces acting on any part of it.

Another source of malformations is the development of small local errors in a growing capsid, which can lead to significant structural deformities. An example is shown in Fig. 6 B, in which a group of subunits that should have formed a pentamer have instead incorrectly bonded into a tetrameric structure. The capsid continued to grow after the malformation appeared, producing a closed structure in which all other binding interactions were locally consistent with a correct capsid, but in which the overall capsid was abnormally small.

A final factor affecting malformation rate is the ability of malformed capsids to correct themselves after a malformation has appeared. Fig. 6 C shows a local malformation similar to that shown in Fig. 6 B, in which subunits have formed a tetrameric structure where a pentamer would be expected. However, unlike in Fig. 6 B, the malformation in Fig. 6 C broke apart later in the simulation, allowing the insertion of another subunit to yield a correct pentamer. The cluster later went on to produce a correct, fully formed capsid. Malformations were frequently corrected in the simulations through similar local rearrangements of binding patterns. This was true not only for local errors, but also for errors resulting from interactions of partially formed capsids, which occasionally broke apart to again yield two on-pathway partial capsids.

    DISCUSSION
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

We have presented a new technique for simulating self-assembly systems, described an implementation of that technique, and discussed the results of three sample simulation experiments. The technique allows computationally tractable simulations of the complex process of self-assembly of virus capsids from individual subunits. These simulations are configurable to different growth models and different values for a variety of subunit parameters. In addition, they allow the gathering of qualitative and quantitative information on the behavior of the simulations under different models.

The specific experiments described above were designed to illustrate some of the value of simulation models in general and of local-rules simulation in particular. The experiments involved changing various parameters of a physical system. Whereas some parameters, such as concentration, may be easily adjusted in a laboratory setting, others, such as binding energy or the entropic penalty of binding, may be difficult or impossible to alter with precision and without affecting other binding properties. However, in a simulation, parameters such as these can be easily and precisely adjusted in isolation from any other factors. Furthermore, the experiments involved examining low-level details of the experimental state at specific time intervals. Many aspects of the state of a laboratory experiment cannot be precisely examined by existing experimental techniques; these include the data on distributions of on- and off-pathway subunits examined in the present work. However, in a computer simulation, the complete state of the system is available to the user. In addition, it may not be possible to quench an actual biochemical reaction at a specific point in time without possibly altering its state. A simulation, though, can easily be frozen, examined, and resumed at any time. Purely theoretical work is unlikely to be a valid replacement for simulation work for applications such as this, which involve the interplay of many parameters in a complicated and incompletely understood system. For the reasons given above, the simulation experiments presented illustrate how simulation work may provide insights that cannot be found either through laboratory work or through purely theoretical analysis. Such insights, in turn, can suggest hypotheses that might themselves be independently confirmed in the laboratory.

Our particular experience suggests that the abstract local-rules framework can be useful for simplifying otherwise intractable problems involving the assembly of repeating or symmetrical structures. Virus capsids represent an excellent application of local-rules simulation due to the relative complexity of the problem and the consequent difficulty of creating predictive mathematical models. However, local-rules simulation may also be used for simulating other self-assembly systems. An earlier local-rules simulator has already been adapted to the study of carbon-silicon compounds (Hobbs et al., 1998). A particularly useful aspect of local-rules-based simulations that was demonstrated by the sample experiments is their ability to model malformations, which have proved difficult to analyze by both experimental techniques and purely mathematical models.

Although our central result is the simulator itself rather than our specific simulation experiments, we can also consider possible implications of those experiments. The experimental results suggest a trade-off between growth and malformation rates as functions of the parameters we examined. For all three parameters, modifications that resulted in increased growth rates also led to increased incidence of malformations. This implies that achieving a maximum number of on-pathway subunits required striking a balance between these competing factors. If these results accurately reflect the biological system they model, then they may have several implications. First, if the observed competition between growth and malformation rates is found in actual viruses, then this may provide an evolutionary argument for malformations observed in real virus growth. Specifically, if viruses can only suppress their malformation rate by sacrificing growth rate, then it may be that the optimum evolutionary strategy for a virus is to allow some small number of malformations in exchange for an accelerated growth rate over a similar virus that produces no malformed capsids. Such a trade-off might also have implications for the design of capsid assembly-targeted antiviral drugs. If virus capsids have evolved to find a balance between competing growth and malformation rates, then this suggests the possibly counterintuitive strategy of introducing agents that will promote capsid growth as a means of attacking that growth. For example, an agent that catalyzes capsid nucleation or stabilizes binding interactions might increase malformation rates more than enough to offset the increased growth rates it would produce and thus could be an effective antiviral.

The nature of malformations observed in the simulations suggests possible mechanisms by which changes in binding parameters of actual viruses may affect growth and malformation rates in ways similar to those described above. Although it was not possible to reliably determine the sources of all malformations observed in the simulations, the available data make it possible to propose reasons why changing parameters that influence growth rate might also affect rates of malformation. Increased binding energies might increase the probability of binding interactions occurring when two coat proteins are in close proximity, even if they are both already bound to separate partial capsids or to nonadjacent points on the same partial capsid. Furthermore, increased binding energies will stabilize such errors when they do occur. More lenient binding tolerances would allow coat proteins to bind in positions farther from optimal, potentially promoting local errors, and could yield a greater probability that a collision between partial capsids would cause coat proteins on the two capsids to move within their binding tolerances of each other. More lenient binding tolerances might also stabilize errors, by ensuring that if an incorrect binding interaction is broken, the proteins involved will have a greater opportunity to reestablish that interaction. Finally, greater concentrations could increase the probability of collisions and the probability that a subunit bound in a slightly nonoptimal position will be able to form a new binding interaction that stabilizes that position before it can be corrected. Thus, if actual viruses exhibit mechanisms for producing and correcting errors similar to those observed in local-rules simulations, then there are theoretical arguments for proposing that the relationship between growth and malformation rates observed with local-rules simulations might hold for actual viruses as well.

Avenues for continuing this work include several areas in which our techniques can be applied and our model refined. Exploring models of kinetics for more complicated rule sets may allow examination of a wider range of virus growth properties. Simulations could also be improved by developing more realistic models of simulated proteins that better approximate known structures and measured physical properties as the relevant biochemical data become available and as increasing computer speeds reduce the run times of our simulations; more detailed models may assist in determining the dependence of assembly properties on those details. The approach described by Reddy et al. (1998) offers a possible means of deriving some of this data and may therefore provide a valuable complement to the simulation approach described in this paper. A potentially significant focus for future work is exploring the feasibility of different avenues for blocking or misdirecting capsid growth to assist in the development of capsid assembly targeted antivirals. This work will require studying the sources of naturally occurring malformations, the circumstances under which a malformed capsid can recover, and the dependence of these events on different binding properties, to gain clues to the aspects of binding an antiviral agent should affect to achieve maximum effectiveness.

    ACKNOWLEDGMENTS

We thank Jonathan King for encouraging us to apply computational techniques to exploring the kinetics of virus capsid assembly. We also thank Charles Leiserson and the Cilk group at MIT for their suggestions on using Cilk efficiently and for allowing us use of their computers. Finally, we thank Seth Teller for allowing us the use of his workstations.

This material is based upon work supported under a National Science Foundation Graduate Research Fellowship. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the author and do not necessarily reflect the views of the National Science Foundation. BB is supported by National Science Foundation Career Award 95019997-CCR and Department of Energy grant DEFG0295ER25253. BB and PP are supported by National Science Foundation grants DBI9711234 and DBI9726698.

    FOOTNOTES

Received for publication 20 April 1998 and in final form 18 August 1998.

Address reprint requests to Dr. Peter E. Prevelige, Department of Microbiology, University of Alabama at Birmingham, Birmingham, AL 35294. Tel.: 713-798-6989; Fax: 713-796-9483; E-mail: prevelig{at}uab.edu; or to Dr. Bonnie Berger, Department of Mathematics 2-389, Massachusetts Institute of Technology, Cambridge, MA 02139. Tel: 617-253-4986; Fax: 617-253-3480; E-mail: bab{at}mit.edu.

    REFERENCES
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

Biophys J, December 1998, p. 2626-2636, Vol. 75, No. 6
© 1998 by the Biophysical Society   0006-3495/98/12/2626/11  $2.00



This article has been cited by other articles:


Home page
Biophys. JHome page
B. Sweeney, T. Zhang, and R. Schwartz
Exploring the Parameter Space of Complex Self-Assembly through Virus Capsid Models
Biophys. J., February 1, 2008; 94(3): 772 - 783.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
M. F. Hagan and D. Chandler
Dynamic Pathways for Viral Capsid Assembly
Biophys. J., July 1, 2006; 91(1): 42 - 54.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
M. Hemberg, S. N. Yaliraki, and M. Barahona
Stochastic Kinetics of Viral Capsid Assembly Based on Detailed Protein Structures
Biophys. J., May 1, 2006; 90(9): 3029 - 3042.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
R. Zandi, P. van der Schoot, D. Reguera, W. Kegel, and H. Reiss
Classical Nucleation Theory of Virus Capsids
Biophys. J., March 15, 2006; 90(6): 1939 - 1948.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
T. Zhang and R. Schwartz
Simulation Study of the Contribution of Oligomer/Oligomer Binding to Capsid Assembly Kinetics
Biophys. J., January 1, 2006; 90(1): 57 - 64.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
D. Endres, M. Miyahara, P. Moisant, and A. Zlotnick
A reaction landscape identifies the intermediates critical for self-assembly of virus capsids and other polyhedral structures
Protein Sci., June 1, 2005; 14(6): 1518 - 1525.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. Zlotnick
Viruses and the physics of soft condensed matter
PNAS, November 2, 2004; 101(44): 15549 - 15550.
[Full Text] [PDF]


Home page
Biophys. JHome page
D. Endres and A. Zlotnick
Model-Based Analysis of Assembly Kinetics for Virus Capsids or Other Spherical Polymers
Biophys. J., August 1, 2002; 83(2): 1217 - 1230.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. D. Moore and P. E. Prevelige Jr.
Structural Transformations Accompanying the Assembly of Bacteriophage P22 Portal Protein Rings in Vitro
J. Biol. Chem., February 23, 2001; 276(9): 6779 - 6788.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal