Matta, Boyd. The quantum theory of atoms in molecules
.pdf
12.4 Invariom Modeling 333
Fig. 12.10 Series of amides with low fragment transferability. From left to right: formamide, acetamide, N-methylformamide, and N-methylacetamide.
If one wishes to reconstruct the electron density based on the sum of atomic QTAIM fragments as successfully attempted earlier [56], the carbon atom attached to the OH group cannot be regarded as transferable and the assumption of carbon transferability would introduce a systematic error in a scattering model, whereas the oxygen and hydrogen atoms have very similar or identical charges.
We have also calculated atomic QTAIM charges of a number of amides (Fig. 12.10 and Table 12.4). For these molecules transferability is only found for the oxygen atoms that are partly double-bonded to the carbon atom of the peptide bond. The nitrogen atoms of the amino group are similar in formamide and acetamide, and in N-methylformamide and N-methylacetamide, but di er in the two groups. The situation is analogous with the carbon atom – charges di er between substituted and nonsubstituted atoms, but are in agreement when the neighbors are similar. We conclude that, to a good approximation, atoms have similar QTAIM charges when their nearest neighbored atoms are identical [57]; this approach is termed the nearest-neighbor approximation (NNA).
We can also observe ‘‘compensatory transferability’’ in the charges, to use the term introduced by Bader [53]. When two possible mesomeric Lewis formulae can be written for a molecule, charge compensation can be assumed to be more pronounced than for a molecule consisting of single-bonded atoms only.
These observations illustrate the reasons for the choice of model compounds used to predict the electron density for invariom modeling as mentioned above.
Table 12.4 Atomic charges for a series of amides.
Molecule |
Atom |
Charge (e) |
Atom |
Charge (e) |
Atom |
Charge (e) |
|
|
|
|
|
|
|
Formamide |
O |
1.16 |
N |
1.16 |
C |
1.50 |
Acetamide |
O |
1.18 |
N |
1.16 |
C |
1.44 |
N-methylformamide |
O |
1.16 |
N |
1.15 |
C |
1.49 |
N-methylacetamide |
O |
1.17 |
N |
1.15 |
C |
1.43 |
33412 Fragment Transferability Studied Theoretically and Experimentally with QTAIM
12.5
Applications of Aspherical Invariom Scattering Factors
12.5.1
Molecular Geometry and Anisotropic Displacement Properties
Invariom modeling improves the accuracy of molecular geometry from X-ray single crystal di raction, especially for hydrogen atoms, for which bond distances become comparable with results from neutron di raction or theoretical calculations. Modeling with aspherical scattering factors has been shown to be more appropriate than application of the IAM [58]. Asphericity shifts [59] are bond-length aberrations because of the use of the IAM. They occur, for example, in longer CaO bonds, because of spherical averaging over the oxygen valence density in the IAM at low resolution; CbC bonds can also be a ected. Asphericity shifts disappear in invariom model refinements, although such di erences are often within the error range of the least squares.
Improvement of molecular geometry for dl-serine by using invarioms is shown in Table 12.5 by comparison of bond distances of the 298 K X-ray invariom data with room-temperature neutron study results for di erent resolution [60]. In essence, invariom modeling provides an improved scattering factor model and, as a result, the least-squares fit of aspherical scatterers to experimental structure factors usually leads to substantial improvement of the crystallographic R-factors and the goodness of fit, and to a reduction of the remaining residual electron density.
Table 12.5 Neutron (N) bond lengths of dl-serine compared with X-ray (X) invariom and the IAM model at room temperature.
Bond |
Distance (N) |
|
Distance (X, invarioms) |
|
Distance (X, IAM) |
||
|
|
|
|
|
|
|
|
|
|
|
sin y/l F0:98 |
sin y/l F0:6 |
sin y/l F0:98 |
sin y/l F0:6 |
|
|
|
|
|
|
|
||
O(1)aC(1) |
1.248(1) |
1.2487(5) |
1.248(2) |
1.2479(7) |
1.246(2) |
||
O(2)aC(1) |
1.257(1) |
1.2585(5) |
1.257(2) |
1.2586(8) |
1.256(2) |
||
O(3)aC(3) |
1.414(1) |
1.4150(5) |
1.414(2) |
1.4172(8) |
1.417(2) |
||
O(3)aH(4) |
0.981(1) |
0.95(1) |
0.97(2) |
0.92(2) |
0.92(3) |
||
N(1)aC(2) |
1.487(1) |
1.4884(5) |
1.488(2) |
1.4883(7) |
1.490(2) |
||
N(1)aH(11) |
1.037(1) |
1.048(9) |
1.05(2) |
0.96(2) |
0.94(2) |
||
N(1)aH(12) |
1.045(1) |
1.03(1) |
1.03(2) |
0.95(2) |
0.96(2) |
||
N(1)aH(13) |
1.041(1) |
1.03(1) |
1.02(2) |
0.94(2) |
0.94(2) |
||
C(1)aC(2) |
1.531(1) |
1.5316(5) |
1.532(2) |
1.5291(7) |
1.528(2) |
||
C(2)aC(3) |
1.518(1) |
1.5201(5) |
1.519(2) |
1.5165(8) |
1.514(2) |
||
C(2)aH(2) |
1.101(1) |
1.080(7) |
1.07(2) |
0.956(9) |
0.93(2) |
||
C(3)aH(31) |
1.095(1) |
1.096(9) |
1.10(2) |
0.97(2) |
0.97(2) |
||
C(3)aH(32) |
1.095(1) |
1.121(9) |
1.12(2) |
1.00(2) |
0.98(2) |
||
|
|
|
|
|
|
|
|
12.5 Applications of Aspherical Invariom Scattering Factors 335
Fig. 12.11 Mean value for bonds between C, N, and O for the Hirshfeld test plotted against resolution. Above 10 10 4 A˚ 2 (dotted line) the test is not regarded as fulfilled, as suggested by Hirshfeld [58] Copyright 2005 and Reproduction with Permission from IUCr.
Another result of the modeling process is the increased physical significance of the anisotropic displacement parameters that describe thermal motion (and disorder) in a structure, as can be proven by the results of the Hirshfeld test [5]. Figure 12.11 shows the e ect of the inclusion of the aspherical density in the anisotropic temperature parameters with regard to data resolution for dl-serine. Temperature data for carbon, nitrogen and oxygen are not regarded as including bonding e ects if the di erence of the mean-square displacement amplitudes (DMSDA) is smaller than 0.001 A˚ 2. A mean DMSDA value for the six nonhydrogen bonds was used in Fig. 12.11 in the study of dl-serine [58] and this value was plotted against resolution for invariom scattering factors obtained with the basis set B3LYP/6-311þþG(3df,3pd). Whereas for the IAM at 100 K deconvolution of electron density and thermal e ects was not achieved and the test was not fulfilled at a resolution of 0.55 A˚ 1 in sin y=l (or d ¼ 0:9 A˚ ), by using the invariom model the Hirshfeld test was fulfilled for all three temperatures (20 K, 100 K, 298 K) investigated. It was concluded that this, or higher, resolution is recommended for invariom modeling.
12.5.2
Using the Enhanced Multipole Model Anomalous Dispersion Signal
Absolute configuration of light atom structures is of crucial importance in the pharmaceutical industry. Although the phenomenon of chirality was discovered many years ago, the relevance of the absolute structure of a drug applied to the
336 12 Fragment Transferability Studied Theoretically and Experimentally with QTAIM
human organism was not recognized until the Contergan/Thalidomide scandal in the early sixties. These events made clear that absolute structure can be of outmost importance and that it should be carefully analyzed for every drug candidate before it is registered as a drug, even though it was later found that Thalidomide racemizes in vivo. The proportion of single-enantiomer drugs among new drugs introduced into the market is rapidly growing, and reached @70% in 2002 [61]. There are stringent legal requirements to determine the absolute configuration of drug molecules. A related aspect is the possibility of extending or sidestepping patents by invoking chirality. Single-crystal X-ray structure analysis enables determination of the absolute structure from the intensity di erence between Friedel pairs, because of anomalous dispersion. The objective of such an analysis is, usually, to assign absolute configuration to chiral molecules in a crystal structure and considerable e ort has been devoted to this research topic. Introduction of the x parameter by Flack [62] enabled unambiguous assignment of enantiomorph polarity and the x parameter is widely used and implemented in many least-squares refinement software, for example crystals [63], shelxl [64] or gfmlx [65]. Absolute determination of structure requires that the anomalous dispersion signal is su ciently pronounced. We have recently shown that the multipole model provides additional inversion distinguishing power [66]. Invariom modeling with fixed density terms provides a means of exploiting this to improve absolute structure determination with CuKa, and it has been found that high-res- olution MoKa datasets can provide almost similar information when evaluated with the invariom approach.
12.5.3
Modeling the Electron Density of Oligopeptide and Protein Molecules
For the purpose of structural refinement of amino acid, oligopeptide and protein molecules the naturally occurring amino acids were analyzed in terms of their invariom fragments, and a database with 73 entries that covers this class of compound was generated from 37 model compounds. This invariom database has been validated on 42 experimental small-molecule crystal structures (from IUCr journals), of di erent quality and resolution, covering not only the naturally occurring amino acids, but some of their derivatives, their protonated/unprotonated states, and most common solvents (details of the structures studied have been published elsewhere [67]).
Figure 12.12a shows the crystallographic R-factor for IAM and invariom models for these trial structures. R(F) is always equal to or better for the invariom model, when compared with the IAM. The di erence RðFIAMÞ RðFinvÞ depends on the resolution and temperature of an experiment. Improvements are more apparent at lower temperatures, because of better deconvolution of thermal and electronic e ects. Because low-resolution datasets do not contain as much information on the aspherical part of r(r), improvement of the R-factor is smaller than when more extended datasets are used. Figure 12.12b compares the positive residual electron density Dr(r) for the IAM and invariom models. Here, the most interesting feature can be observed for a high resolution structure containing sulfur,
12.5 Applications of Aspherical Invariom Scattering Factors 337
Fig. 12.12 (a) Comparison of the crystallographic R(F) factor for the IAM and invariom models. (b) Comparison of the residual density Dr(r) for the IAM and invariom models [67] Copyright 2006 and Reproduction with Permission from IUCr.
where the residual density for the invariom model is substantially reduced and becomes comparable with that of other structures that do not contain sulfur.
In disordered structures analogous behavior is seen for Dr(r) and reduction of the R-factor. For such structures the residual density is similar or even increased
338 12 Fragment Transferability Studied Theoretically and Experimentally with QTAIM
by use of the invariom model. One can conclude that the remaining, unmodeled, density is the reason for only small improvements of the R-factor for disordered structures.
An important conclusion for protein refinement can therefore be drawn from disordered structures – it is the modeling of disorder and the completeness of the structural model, rather than the aspherical electron density contribution, that limit the fit of calculated and experimental structure factors and, therefore, the quality of the results. As already remarked, invariom modeling requires data resolution of d a0:9 A˚ or sin y=l b0:55 A˚ 1. This resolution is also recommended for protein data.
12.6 Conclusion
From systematic study of amino acids and oligopeptides by experimental electron density analysis using Bader’s QTAIM an understanding of fragment transferability emerged. Use of appropriate model compounds now enables reproduction of molecular electron density r(r) from fragments; the result is a good approximation to the total density when the invariom pseudoatoms density description is employed. A pseudoatomic fragmentation of a molecular electron density retaining the local chemical environment of an atom enables convenient extension of the IAM scattering model. Such invariom modeling improves the inversion distinguishing power in absolute structure determination and yields ‘‘charge-density quality’’ geometries from low-resolution standard X-ray datasets. Anisotropic displacement parameters become physically meaningful and all properties that can be derived from a model electron density can be rapidly calculated for larger molecules so that a convenient approach to electron densities of macromolecules (proteins, polynucleotides) is feasible.
To apply invariom modeling for standard small-molecule structures no further calculations nor extra experimental procedures are necessary, making it a rapid, readily accessible, and useful tool for standard crystallographic work. In this way high-throughput techniques, which have been established in a variety of fields of X-ray di raction-based structure research, can also be applied in experimental electron-density work. This would be of special importance in the biological sciences. In medical chemistry, where it is a fundamental challenge to understand drug–target recognition processes, knowledge of electron-density distribution is a valuable completion of structure information and serves as a basis for a better understanding of such interactions than consideration of steric properties only. Since rapid screening of a large number of chemical compounds is indispensable in structure-guided drug discovery, the generation of electronic information of entire classes of chemically or pharmacologically related compounds would be highly desirable. This can become a routine task to be performed in time periods comparable to those currently needed for conventional X-ray analyses.
|
|
|
References |
339 |
|
References |
|
|
|
|
|
|
|
|
1 |
Debye, P. Annalen der Physik 46 |
18 |
Matta, C. F., and Bader, R. F. W. |
|
|
(1915), 809. |
|
Proteins: Structure, Function and |
|
2 |
Coppens, P. Angew. Chem. Int. Ed. 44 |
|
Genetics 52 (2003), 360. |
|
|
(2005), 6810. |
19 |
Mebs, S., Messerschmidt, M., |
|
3 |
Bader, R. F. W. Atoms in Molecules: A |
|
Luger, P. Z. Kristallogr. 221 (2006), |
|
|
Quantum Theory, Clarendon Press, |
|
656. |
|
|
Oxford, 1990. |
20 |
Legros, J.-P.; Kvick, A. Acta Cryst. B |
|
4 |
Hansen, N. K., and Coppens, P. Acta |
|
36 (1980), 3052. |
|
|
Cryst. A 34 (1978), 909. |
21 |
Destro, R., Roversi, P., Barzaghi, M., |
|
5 |
Hirshfeld, F. L. Acta Cryst. A 32 |
|
and Marsh, R. E. J. Phys. Chem. A 104 |
|
|
(1976), 239. |
|
(2000), 1047. |
|
6 |
Koritsa´nszky, T., Flaig, R., Zobel, D., |
22 |
Destro, R., Marsh, R. E., and |
|
|
Krane, H.-G., Morgenroth, W., |
|
Bianchi, R. J. Phys. Chem. 92, 4 |
|
|
and Luger, P. Science 279 (1998), |
|
(1988), 966. |
|
|
356. |
23 |
Gatti, C., Bianchi, R., Destro, R., and |
|
7 |
Iversen, B. B., Larsen, F. K., |
|
Merati, F. J. Mol. Struct. (Theochem) |
|
|
Pinkerton, A. A., Martin, A., |
|
255 (1992), 409. |
|
|
Darovsky, A., and Reynolds, P. A. |
24 |
Flaig, R., Koritsanszky, T., Dittrich, |
|
|
Acta Cryst. B 55 (1999), 363. |
|
B., Wagner, A., and Luger, P. J. Am. |
|
8 |
Dahaoui, S., Jelsch, C., Howard, |
|
Chem. Soc. 124 (2002), 3407. |
|
|
J. A. K., and Lecomte, C. Acta Cryst. B |
25 |
Flaig, R. Neue experimentelle Methoden |
|
|
55 (1999), 226. |
|
der Ladungsdichtebestimmung. PhD |
|
9 |
Wagner, A., Flaig, R., Zobel, D., |
|
thesis, Freie Universita¨t Berlin, |
|
|
Dittrich, B., Bombicz, P., Stru¨mpel, |
|
2000. |
|
|
M., Luger, P., Koritsa´nszky, T., and |
26 |
Flaig, R., Koritsa´nszky, T., Janczak, J., |
|
|
Krane, H.-G. J. Phys. Chem. A 106 |
|
Krane, H.-G., Morgenroth, W., and |
|
|
(2002), 6581. |
|
Luger, P. Angew. Chem. Int. Ed. 38, 10 |
|
10 |
Lamparth, I., Maichle-Mo¨ssmer, C., |
|
(1999), 1397. |
|
|
and Hirsch, A. Angew. Chem. Int. Ed. |
27 |
Arnold, W. D., Sanders, L. K., |
|
|
34 (1995), 1607. |
|
McMahon, M. T., Volkov, A. V., Wu, |
|
11 |
Luger, P., Messerschmidt, M., |
|
G., Coppens, P., Wilson, S. R., |
|
|
Scheins, S., and Wagner, A. Acta |
|
Godbout, N., and Oldfield, E. J. Am. |
|
|
Cryst. A 60 (2004), 390. |
|
Chem. Soc. 122 (2000), 4708. |
|
12 |
Dittrich, B., Koritsa´nszky, T., Volkov, |
28 |
Wagner, A., and Luger, P. J. Mol. |
|
|
A., Mebs, S., Luger, P., Angew. Chem. |
|
Struct. 595 (2001), 39. |
|
|
Int. Ed. 46 (2007), in press. |
29 |
Flaig, R., Koritsa´nszky, T., Zobel, D., |
|
13 |
Zobel, D., Luger, P., Dreissig, W., and |
|
and Luger, P. J. Am. Chem. Soc. 120, |
|
|
Koritsa´nszky, T. Acta Cryst. B 48 |
|
10 (1998), 2227. |
|
|
(1992), 837. |
30 |
Kingsford-Adaboh, R., Dittrich, B., |
|
14 |
Messerschmidt, M., Meyer, M., and |
|
Wagner, A., Messerschmidt, M., Flaig, |
|
|
Luger, P. J. Appl. Cryst. 36 (2003), |
|
R., and Luger, P. Z. Krist. 217 (2002), |
|
|
1452. |
|
168. |
|
15 |
Luger, P., Wagner, A., Hu¨bschle, C. B., |
31 |
Coppens, P., Abramov, Y., Carducci, |
|
|
and Troyanov, S. I. J. Phys. Chem. A |
|
M., Korjov, B., Novozhilova, I., |
|
|
109 (2005), 10177. |
|
Alhambra, C., and Pressprich, M. R. |
|
16 |
Matta, C. F., and Bader, R. F. W. |
|
J. Am. Chem. Soc. 121, 11 (1999), |
|
|
Proteins: Structure, Function and |
|
2585. |
|
|
Genetics 40 (2000), 310. |
32 |
Scheins, S., Dittrich, B., |
|
17 |
Matta, C. F., and Bader, R. F. W. |
|
Messerschmidt, M., Paulmann, C., |
|
|
Proteins: Structure, Function and |
|
and Luger, P. Acta Cryst. B 60 (2004), |
|
|
Genetics 48 (2002), 519. |
|
184. |
|
|
|
|
References |
341 |
|
|
|
|
|
59 |
Coppens, P., Sabine, T. M., |
64 |
Sheldrick, G. M. SHELXL-97. A |
|
|
Delaplane, R. G., and Ibers, J. A. Acta |
|
Program for Refinement of Crystal |
|
|
Cryst. B 25 (1969), 2451. |
|
Structures. University of Go¨ttingen, |
|
60 |
Frey, M. N., Lehmann, M. S., Koetzle, |
|
1997. |
|
|
T. F., and Hamilton, W. C. Acta Cryst. |
65 |
Bernardinelli, G., and Flack, H. D. |
|
|
B 29 (1973), 876. |
|
Acta Cryst. A 41 (1985), 500. |
|
61 |
Agranat, I., Caner, H., and Caldwell, |
66 |
Dittrich, B., Stru¨mpel, M., |
|
|
J. Nature Rev. Drug Discov. 1 (2002), |
|
Koritsa´nszky, T., Scha¨fer, M., and |
|
|
753. |
|
Spackman, M. A. Acta Cryst. A 62 |
|
62 |
Flack, H. D. Acta Cryst. A 39 (1983), |
|
(2006), 217. |
|
|
876. |
67 |
Dittrich, B., Hu¨bschle, C. B., |
|
63 |
Betteridge, P. W., Carruthers, J. R., |
|
Luger, P., and Spackman, M. A. |
|
|
Cooper, R. I., Prout, K., and Watkin, |
|
Acta Cryst. D 62 (2006), 1325. |
|
|
D. J. J. Appl. Cryst. 36 (2003), 1487. |
|
|
|
