- •Foreword
- •Preface
- •Contents
- •Introduction
- •Oren M. Becker
- •Alexander D. MacKerell, Jr.
- •Masakatsu Watanabe*
- •III. SCOPE OF THE BOOK
- •IV. TOWARD A NEW ERA
- •REFERENCES
- •Atomistic Models and Force Fields
- •Alexander D. MacKerell, Jr.
- •II. POTENTIAL ENERGY FUNCTIONS
- •D. Alternatives to the Potential Energy Function
- •III. EMPIRICAL FORCE FIELDS
- •A. From Potential Energy Functions to Force Fields
- •B. Overview of Available Force Fields
- •C. Free Energy Force Fields
- •D. Applicability of Force Fields
- •IV. DEVELOPMENT OF EMPIRICAL FORCE FIELDS
- •B. Optimization Procedures Used in Empirical Force Fields
- •D. Use of Quantum Mechanical Results as Target Data
- •VI. CONCLUSION
- •REFERENCES
- •Dynamics Methods
- •Oren M. Becker
- •Masakatsu Watanabe*
- •II. TYPES OF MOTIONS
- •IV. NEWTONIAN MOLECULAR DYNAMICS
- •A. Newton’s Equation of Motion
- •C. Molecular Dynamics: Computational Algorithms
- •A. Assigning Initial Values
- •B. Selecting the Integration Time Step
- •C. Stability of Integration
- •VI. ANALYSIS OF DYNAMIC TRAJECTORIES
- •B. Averages and Fluctuations
- •C. Correlation Functions
- •D. Potential of Mean Force
- •VII. OTHER MD SIMULATION APPROACHES
- •A. Stochastic Dynamics
- •B. Brownian Dynamics
- •VIII. ADVANCED SIMULATION TECHNIQUES
- •A. Constrained Dynamics
- •C. Other Approaches and Future Direction
- •REFERENCES
- •Conformational Analysis
- •Oren M. Becker
- •II. CONFORMATION SAMPLING
- •A. High Temperature Molecular Dynamics
- •B. Monte Carlo Simulations
- •C. Genetic Algorithms
- •D. Other Search Methods
- •III. CONFORMATION OPTIMIZATION
- •A. Minimization
- •B. Simulated Annealing
- •IV. CONFORMATIONAL ANALYSIS
- •A. Similarity Measures
- •B. Cluster Analysis
- •C. Principal Component Analysis
- •REFERENCES
- •Thomas A. Darden
- •II. CONTINUUM BOUNDARY CONDITIONS
- •III. FINITE BOUNDARY CONDITIONS
- •IV. PERIODIC BOUNDARY CONDITIONS
- •REFERENCES
- •Internal Coordinate Simulation Method
- •Alexey K. Mazur
- •II. INTERNAL AND CARTESIAN COORDINATES
- •III. PRINCIPLES OF MODELING WITH INTERNAL COORDINATES
- •B. Energy Gradients
- •IV. INTERNAL COORDINATE MOLECULAR DYNAMICS
- •A. Main Problems and Historical Perspective
- •B. Dynamics of Molecular Trees
- •C. Simulation of Flexible Rings
- •A. Time Step Limitations
- •B. Standard Geometry Versus Unconstrained Simulations
- •VI. CONCLUDING REMARKS
- •REFERENCES
- •Implicit Solvent Models
- •II. BASIC FORMULATION OF IMPLICIT SOLVENT
- •A. The Potential of Mean Force
- •III. DECOMPOSITION OF THE FREE ENERGY
- •A. Nonpolar Free Energy Contribution
- •B. Electrostatic Free Energy Contribution
- •IV. CLASSICAL CONTINUUM ELECTROSTATICS
- •A. The Poisson Equation for Macroscopic Media
- •B. Electrostatic Forces and Analytic Gradients
- •C. Treatment of Ionic Strength
- •A. Statistical Mechanical Integral Equations
- •VI. SUMMARY
- •REFERENCES
- •Steven Hayward
- •II. NORMAL MODE ANALYSIS IN CARTESIAN COORDINATE SPACE
- •B. Normal Mode Analysis in Dihedral Angle Space
- •C. Approximate Methods
- •IV. NORMAL MODE REFINEMENT
- •C. Validity of the Concept of a Normal Mode Important Subspace
- •A. The Solvent Effect
- •B. Anharmonicity and Normal Mode Analysis
- •VI. CONCLUSIONS
- •ACKNOWLEDGMENT
- •REFERENCES
- •Free Energy Calculations
- •Thomas Simonson
- •II. GENERAL BACKGROUND
- •A. Thermodynamic Cycles for Solvation and Binding
- •B. Thermodynamic Perturbation Theory
- •D. Other Thermodynamic Functions
- •E. Free Energy Component Analysis
- •III. STANDARD BINDING FREE ENERGIES
- •IV. CONFORMATIONAL FREE ENERGIES
- •A. Conformational Restraints or Umbrella Sampling
- •B. Weighted Histogram Analysis Method
- •C. Conformational Constraints
- •A. Dielectric Reaction Field Approaches
- •B. Lattice Summation Methods
- •VI. IMPROVING SAMPLING
- •A. Multisubstate Approaches
- •B. Umbrella Sampling
- •C. Moving Along
- •VII. PERSPECTIVES
- •REFERENCES
- •John E. Straub
- •B. Phenomenological Rate Equations
- •II. TRANSITION STATE THEORY
- •A. Building the TST Rate Constant
- •B. Some Details
- •C. Computing the TST Rate Constant
- •III. CORRECTIONS TO TRANSITION STATE THEORY
- •A. Computing Using the Reactive Flux Method
- •B. How Dynamic Recrossings Lower the Rate Constant
- •IV. FINDING GOOD REACTION COORDINATES
- •A. Variational Methods for Computing Reaction Paths
- •B. Choice of a Differential Cost Function
- •C. Diffusional Paths
- •VI. HOW TO CONSTRUCT A REACTION PATH
- •A. The Use of Constraints and Restraints
- •B. Variationally Optimizing the Cost Function
- •VII. FOCAL METHODS FOR REFINING TRANSITION STATES
- •VIII. HEURISTIC METHODS
- •IX. SUMMARY
- •ACKNOWLEDGMENT
- •REFERENCES
- •Paul D. Lyne
- •Owen A. Walsh
- •II. BACKGROUND
- •III. APPLICATIONS
- •A. Triosephosphate Isomerase
- •B. Bovine Protein Tyrosine Phosphate
- •C. Citrate Synthase
- •IV. CONCLUSIONS
- •ACKNOWLEDGMENT
- •REFERENCES
- •Jeremy C. Smith
- •III. SCATTERING BY CRYSTALS
- •IV. NEUTRON SCATTERING
- •A. Coherent Inelastic Neutron Scattering
- •B. Incoherent Neutron Scattering
- •REFERENCES
- •Michael Nilges
- •II. EXPERIMENTAL DATA
- •A. Deriving Conformational Restraints from NMR Data
- •B. Distance Restraints
- •C. The Hybrid Energy Approach
- •III. MINIMIZATION PROCEDURES
- •A. Metric Matrix Distance Geometry
- •B. Molecular Dynamics Simulated Annealing
- •C. Folding Random Structures by Simulated Annealing
- •IV. AUTOMATED INTERPRETATION OF NOE SPECTRA
- •B. Automated Assignment of Ambiguities in the NOE Data
- •C. Iterative Explicit NOE Assignment
- •D. Symmetrical Oligomers
- •VI. INFLUENCE OF INTERNAL DYNAMICS ON THE
- •EXPERIMENTAL DATA
- •VII. STRUCTURE QUALITY AND ENERGY PARAMETERS
- •VIII. RECENT APPLICATIONS
- •REFERENCES
- •II. STEPS IN COMPARATIVE MODELING
- •C. Model Building
- •D. Loop Modeling
- •E. Side Chain Modeling
- •III. AB INITIO PROTEIN STRUCTURE MODELING METHODS
- •IV. ERRORS IN COMPARATIVE MODELS
- •VI. APPLICATIONS OF COMPARATIVE MODELING
- •VII. COMPARATIVE MODELING IN STRUCTURAL GENOMICS
- •VIII. CONCLUSION
- •ACKNOWLEDGMENTS
- •REFERENCES
- •Roland L. Dunbrack, Jr.
- •II. BAYESIAN STATISTICS
- •A. Bayesian Probability Theory
- •B. Bayesian Parameter Estimation
- •C. Frequentist Probability Theory
- •D. Bayesian Methods Are Superior to Frequentist Methods
- •F. Simulation via Markov Chain Monte Carlo Methods
- •III. APPLICATIONS IN MOLECULAR BIOLOGY
- •B. Bayesian Sequence Alignment
- •IV. APPLICATIONS IN STRUCTURAL BIOLOGY
- •A. Secondary Structure and Surface Accessibility
- •ACKNOWLEDGMENTS
- •REFERENCES
- •Computer Aided Drug Design
- •Alexander Tropsha and Weifan Zheng
- •IV. SUMMARY AND CONCLUSIONS
- •REFERENCES
- •Oren M. Becker
- •II. SIMPLE MODELS
- •III. LATTICE MODELS
- •B. Mapping Atomistic Energy Landscapes
- •C. Mapping Atomistic Free Energy Landscapes
- •VI. SUMMARY
- •REFERENCES
- •Toshiko Ichiye
- •II. ELECTRON TRANSFER PROPERTIES
- •B. Potential Energy Parameters
- •IV. REDOX POTENTIALS
- •A. Calculation of the Energy Change of the Redox Site
- •B. Calculation of the Energy Changes of the Protein
- •B. Calculation of Differences in the Energy Change of the Protein
- •VI. ELECTRON TRANSFER RATES
- •A. Theory
- •B. Application
- •REFERENCES
- •Fumio Hirata and Hirofumi Sato
- •Shigeki Kato
- •A. Continuum Model
- •B. Simulations
- •C. Reference Interaction Site Model
- •A. Molecular Polarization in Neat Water*
- •B. Autoionization of Water*
- •C. Solvatochromism*
- •F. Tautomerization in Formamide*
- •IV. SUMMARY AND PROSPECTS
- •ACKNOWLEDGMENTS
- •REFERENCES
- •Nucleic Acid Simulations
- •Alexander D. MacKerell, Jr.
- •Lennart Nilsson
- •D. DNA Phase Transitions
- •III. METHODOLOGICAL CONSIDERATIONS
- •A. Atomistic Models
- •B. Alternative Models
- •IV. PRACTICAL CONSIDERATIONS
- •A. Starting Structures
- •C. Production MD Simulation
- •D. Convergence of MD Simulations
- •WEB SITES OF INTEREST
- •REFERENCES
- •Membrane Simulations
- •Douglas J. Tobias
- •II. MOLECULAR DYNAMICS SIMULATIONS OF MEMBRANES
- •B. Force Fields
- •C. Ensembles
- •D. Time Scales
- •III. LIPID BILAYER STRUCTURE
- •A. Overall Bilayer Structure
- •C. Solvation of the Lipid Polar Groups
- •IV. MOLECULAR DYNAMICS IN MEMBRANES
- •A. Overview of Dynamic Processes in Membranes
- •B. Qualitative Picture on the 100 ps Time Scale
- •C. Incoherent Neutron Scattering Measurements of Lipid Dynamics
- •F. Hydrocarbon Chain Dynamics
- •ACKNOWLEDGMENTS
- •REFERENCES
- •Appendix: Useful Internet Resources
- •B. Molecular Modeling and Simulation Packages
- •Index
7
Implicit Solvent Models
Benoıˆt Roux
Weill Medical College of Cornell University, New York, New York
I.INTRODUCTION
An understanding of a wide variety of phenomena concerning conformational stabilities and molecule–molecule association (protein–protein, protein–ligand, and protein–nucleic acid) requires consideration of solvation effects. In particular, a quantitative assessment of the relative contribution of hydrophobic and electrostatic interactions in macromolecular recognition is a problem of central importance in biology.
There is no doubt that molecular dynamics simulations in which a large number of solvent molecules are treated explicitly represent one of the most detailed approaches to the study of the influence of solvation on complex biomolecules [1]. The approach, which is illustrated schematically in Figure 1, consists in constructing detailed atomic models of the solvated macromolecular system and, having described the microscopic forces with a potential function, applying Newton’s classical equation F ma to literally ‘‘simulate’’ the dynamic motions of all the atoms as a function of time [1,2]. The calculated classical trajectory, though an approximation to the real world, provides ultimate detailed information about the time course of the atomic motions, which is difficult to access experimentally. However, statistical convergence is an important issue because the net influence of solvation results from an averaging over a large number of configurations. In addition, a large number of solvent molecules are required to realistically model a dense system. Thus, in practical situations a significant fraction of the computer time is used to calculate the detailed trajectory of the solvent molecules even though it is often the solute that is of interest.
An alternative approach, illustrated schematically in Figure 2, consists in incorporating the influence of the solvent implicitly. Such approximate schemes can provide useful quantitative estimates of solvation free energies while remaining computationally tractable. Implicit solvent approaches avoid the statistical errors associated with averages extracted from simulations with a large number of solvent molecules. Furthermore, implicit solvent models are sometimes better suited for particularly complex situations. For example, an explicit representation of the cellular membrane potential would require prohibitively large atomic simulation systems and is currently impractical. Finally, implicit solvent representations can be very useful conceptual tools for analyzing the results of
133
134 |
Roux |
Figure 1 Schematic representation of an atomic model of a biomolecular solute surrounded by explicit water molecules.
simulations generated with explicit solvent molecules and to better understand the nature of solvation phenomena in general. The complexity of the environment in which biomolecules must perform their functions is such that information extracted from simple theoretical models may be helpful to further our understanding of these systems.
In this chapter we provide an introductory overview of the implicit solvent models commonly used in biomolecular simulations. A number of questions concerning the formulation and development of implicit solvent models are addressed. In Section II, we begin by providing a rigorous formulation of implicit solvent from statistical mechanics. In addition, the fundamental concept of the potential of mean force (PMF) is introduced. In Section III, a decomposition of the PMF in terms of nonpolar and electrostatic contributions is elaborated. Owing to its importance in biophysics, Section IV is devoted entirely to classical continuum electrostatics. For the sake of completeness, other computational
Implicit Solvent Models |
135 |
Figure 2 Schematic representation of a biomolecular solute in a solvent environment that is taken into account implicitly.
schemes such as statistical mechanical integral equations, implicit/explicit solvent boundary potential, solvent-accessible surface area (SASA), and knowledge-based potentials are briefly reviewed in Section V. Finally, the chapter is concluded in Section VI with a short summary of the principal ideas.
II. BASIC FORMULATION OF IMPLICIT SOLVENT
A. The Potential of Mean Force
As a first step, it is important to establish implicit solvent models on fundamental principles. For the sake of concreteness, let us consider a solute u immersed in a bulk solution v. The configuration of the solute is represented by the vector X {x1, x2, . . .}. All other degrees of freedom of the bulk solution surrounding the solute, which may include solvent
136 |
Roux |
molecules as well as mobile counterions, are represented by the vector Y. It is expected that the system is fluctuating over a large number of configurations. It is therefore necessary to consider the problem from a statistical point of view. For a system in equilibrium with a thermal bath at temperature T, the probability of a given configuration (X, Y) is given by the function P(X, Y) [3],
P(X, Y) |
exp{ U(X,Y)/kBT} |
(1) |
∫ dX dY exp{ U(X,Y)/kBT} |
where U(X, Y) is the total potential energy of the system. For the sake of simplicity, we neglect nonadditive interactions and assume that the total potential energy can be written as
U(X, Y) Uu(X) Uvv(Y) Uuv(X, Y) |
(2) |
where Uu(X) is the intramolecular potential of the solute, Uvv(Y) is the solvent–solvent potential, and Uuv(X, Y) is the solute–solvent potential. All observable properties of the system are fundamentally related to averages weighted by the probability function P(X, Y). For example, the average of any quantity Q(X) depending on the solute configuration is given by
Q ∫ dX dY Q(X)P(X, Y) |
(3) |
An important question is whether one can rigorously express such an average without referring explicitly to the solvent degrees of freedom. In other words, Is it possible to avoid explicit reference to the solvent in the mathematical description of the molecular system and still obtain rigorously correct properties? The answer to this question is yes. A reduced probability distribution P(X) that depends only on the solute configuration can be defined as
P |
(X) ∫dY P(X, Y) |
(4) |
The reduced probability distribution does not depend explicitly on the solvent coordinates Y, although it incorporates the average influence of the solvent on the solute. The operation symbolized by Eq. (4) is commonly described by saying that the solvent coordinates Y have been ‘‘integrated out.’’ In a system at temperature T, the reduced probability has the form
P(X) |
∫ dY exp{ [Uu(X) Uvv(Y) Uuv(X, Y)]/kBT} |
|
∫ dX dY exp{ [Uu(X) Uvv(Y) Uuv(X, Y)]/kBT} |
(5) |
exp{ W(X)/kBT}
∫dX exp{ W(X)/kBT}
The function W(X) is called the potential of mean force (PMF). The fundamental concept of the PMF was first introduced by Kirkwood [4] to describe the average structure of liquids. It is a simple matter to show that the gradient of W(X) in Cartesian coordinates is related to the average force,
∂W(X) |
∂U |
|
|
∂xi |
∂xi (X) |
Fi (X) |
(6) |
Implicit Solvent Models |
137 |
where the symbol (X) represents an average over all coordinates of the solvent, with the solute in the fixed configuration specified by X. All solvent effects are included in W(X) and consequently in the reduced distribution function P(X). The PMF is an effective configuration-dependent free energy potential W(X) that makes no explicit reference to the solvent degrees of freedom, such that no information about the influence of solvent on equilibrium properties is lost.
As long as the normalization condition given by Eq. (5) is satisfied, an arbitrary offset constant may be added to W(X) without affecting averages in Eq. (3). The absolute value of the PMF is thus unimportant. For convenience, it is possible to choose the value of the free energy W(X) relative to a reference system from which the solute–solvent interactions are absent. The free energy W(X) may thus be expressed as
exp{ W(X)/kBT} |
∫dY exp{ [Uu(X) Uvv(Y) Uuv(X, Y)]/kBT} |
(7) |
|
∫dY exp{ Uvv(Y)/kBT} |
|
It is customary to write W(X) Uu(X) ∆W(X), where Uu(X) is the intramolecular solute potential and ∆W(X) is the solvent-induced influence. In practice, ∆W depends on X, the configuration of the solute, as well as on thermodynamic variables such as the temperature T and the pressure p.
B.Relative and Absolute Values: Reversible Work
As shown by Eq. (6) the PMF is the reversible work done by the average force. It is possible to express relative values of the PMF between different solute configurations X1 and X2 using Eq. (6) and the reversible work theorem [4]:
X2 |
|
∂W(X) |
|
W(X2) W(X1) X1 |
dxi |
|
|
∂xi |
(8) |
||
|
i |
|
|
X |
|
|
|
W(X1) X12 |
dxi Fi (X) |
|
|
|
i |
|
|
This relationship makes it clear that the PMF is not equal to an average potential energy because one needs to compute a reversible work against an average force to get W(X). It is also possible to express the free energy in terms of a thermodynamic integral. Introducing the thermodynamic solute–solvent coupling parameter λ [4], we write the potential energy as
U(X, Y; λ) Uu(X) Uvv(Y) Uuv(X, Y; λ) |
(9) |
constructed such that λ 0 corresponds to a noninteracting reference system with Uuv(X, Y; 0) 0 and λ 1 corresponds to the fully interacting system. As long as the end points are respected, any form of thermodynamic coupling is correct. Therefore, we have
∆W(X) 0 |
1dλ ∂∂λUuv (X,λ) |
(10) |
where the symbol (X,λ) represents an average over all coordinates of the solvent for a solute in the fixed configuration X with thermodynamic coupling λ. It may be noted