solution and solid state studies on the interactions of protonated cytosine salts. iv. asymmetric...

Advances in Molecular Relaxation and Interaction Processes, 17 (1980) 113-134 Elsevier Scientific Publishing Company, Amsterdam -Printed in Belgium

113

Solution and Solid State Studies on the Interactions of Protonated Cytosine Salts.

IV. Asymmetric Interbase Hydrogen Bonding and Interpyrimidine Base Stacking in

Triply Hydrogen-Bonded Cytosine Complexes. Crystal and Molecular Structure of

Bis[l-Methylcytosine, 1-Methylcytosinium] Hexafluorosilicate Dihydrate

THOMAS J. KISTENMACHER, MIRIAM ROSSI, CHIAN C. CHIANG, JOHN P. CARADONNA AND

LUIGI G. MARZILLI

Department of Chemistry, The Johns Hopkins University, Baltimore,

Maryland 21218, U. S. A.

(Received 30 January 1980 )

ABSTRACT

The crystal and molecular structuresof the complex bis(l-methylcytosine,

1-methylcytosinium) hexafluorosilicate dihydrate are reported. In the solid, a

1:l triply hydrogen-bonded complex consisting of one protonated and one neutral

1-methylcytosine base is observed. The hydrogen bonding in the complex cation is

asymmetric, and the asymmetry in the interbase hydrogen bonding is stimulated to a

large part by base stacking. The observed infinite, helical array of stacked

complexes, in which the base stacking is strong [mean stacking distance = 3.36A]

is such that the molecular overlap is between protonated and neutral l-methylcyto-

sine bases. The asymmetric interbase hydrogen bonding and the protonated/neutral

base stacking mode coexist in a synergistic interrelationship which maximizes the

molecular and crystal forces. The results found in this study are compared with a

variety of other systems in which asymmetrically hydrogen-bonded cytosine complexes

are observed, and it is found that although base stacking is also a predominant

feature of these complexes in the solid, the base stacking found in this study is

different.

0165.1919/80/0000~000/$0225 D 1980 Elsevier Scientific Publishing Company

114

INTRODUCTION

Protonation or deprotonation of nucleic acid bases has a critical effect on

their hydrogen bonding capabilities and often leads to non-Watson-Crick[l] base

pairing schemes. A particularly interesting example is the pyrimidine ribonucleo-

side cytidine. At neutral pH, the ring nitrogen atom N(3) of cytidine has an

available lone pair and acts as a hydrogen-bond acceptor, notably in the triply

hydrogen-bonded guanosine-cytidine base pairs in duplex regions of DNA and RNA.

At low pH, the N(3) site of cytidine bears a proton which acts as a hydrogen-bond

donor and the formation of guanosine-cytidine base pairing is precluded. However,

N(3)-protonated cytidine can form a triply hydrogen-bonded pair with a neutral

cytidine residue in a fashion exactly analogous to the guanosine-cytidine pair.

Such a cytidine-cytidineH+complex involves a base and its conjugate acid and, in

the absence of outside factors, one expects that the proton will be on either of

the N(3) sites in the pair with random probability[2].

Outside stimuli which can lead to the localization of the proton on one base

(and to the formation of an asymmetric N(3)-He **N(3) hydrogen bond) are most easily

studied in crystalline solids. Qualitatively, there are two attractive interaction

potentials which can be most easily utilized. First, advantage may be taken of the

difference in the hydrogen bonding affinities of the protonated and neutral

cytidine molecules. Second, it is conceivable that interacting dimers or higher

oligomers in which base-base stacking of the hydrogen-bonded pairs takes place

could provide an asymmetric environment conducive to proton localization.

We have previously reported an example of a triply hydrogen-bonded l-methyl-

cytosine, 1-methylcytosineH+complex in (1-methylcytosine, 1-methylcytosinium)

iodide monohydrate [3] . Association into dimers via a base stacking mode in which

a protonated base stacked with a neutral base and vice versa was taken to be an --

integral part of the potential which led to the localization of the proton and the

asymmetric interbase hydrogen bonding. The hemiprotonated form could be isolated

from a wide range of pH conditions and could be avoided only by removing the iodide

115

counterion by ion-exchange chromatography. In this paper, we wish to present an

analysis of the new complex bis(l-methylcytosine, 1-methylcytosinium) hexafluoro-

silicate dihydrate. This complex allows a further evaluation of the roles of

hydrogen bonding and base stacking in stimulating and stabilizing the localization

of the acidic proton on one of the hydrogen bonded bases.

EXPERIMENTAL

Crystals of the title compound, first erroneously assumed to be a hydrate of

I-methylcytosine.HF, were obtained by the slow evaporation of a solution of highly

purified l-methylcytosine(hereinafter l-MeC)[3] in 1M aqueous HF. Our X-ray

structure analysis, vide infra, yields the correct identity of the crystalline

material as bis(l-MeC, l-MeCH)SiF6.2H20, [C1002N6H15]2(SiF2).2H20. Chemical

analysis of a sample of the crystals is in quantitative agreement with the X-ray

formulation: Calcd.: C, 35.29; H, 5.04; N, 24.69. Found: C, 35.43; H, 4.42; N,

24.81. As all our manipulations were carried out in plastic ware, the hexafluoro-

silicate dianion was apparently already available in sufficient quantity in the

reagent grade HF utilized or generated from silica in the distilled water supply.

Whatever the source, we are confident of the presence of SiF6 2- in the crystalline

sample.

The crystals are unstable in humid air and to prevent decomposition were

coated with a thin layer of grease and placed in capillary tubes. The crystals

form as elongated prisms with [OlO] as the prism axis. The space group and

approximate cell dimensions were obtained from preliminary Weissenberg photographs.

The crystal system is monoclinic, with systematic absences (h + k = 2n + 1)

consistent with a C-centered cell and the space groups C2, Cm and C2/m. We have

employed the latter space group in all of our computations. The observed density,

measured by flotation in a mixture of cyclohexane and carbon tetrachloride,

indicated two formula units per cell. Precise unit-cell dimensions were obtained

from the setting angles for 15 reflections on a Syntex Pi automated diffractometer.

Complete crystal data are given in Table 1.

116

Table 1. Crystal data for [l-MeC, l-MeCH]2(SiF6).2HZ0

a = 23.754(5)A [C1602N6H15]2(SiF6)'2H20.

b = 6.719(2)A Mol wt. = 680.65

c = 9.258(2)A D = 1.543(S) g cm -3 - measd

B = 97.16(2)" D = 1.542 g cm -3 calcd

V = 1466.1 A3 z=2

X(MoKa) = 0.71069A Space Group C2/m

~(MoK~) = 1.9 cm -1

.~-____~ ~___---

The 4256 reflections in the +h hemisphere to 26 = 55" were surveyed on the

diffractometer, employing graphite monochromatized MoKa radiation. The crystal

used in data collection had the following well-developed faces and mean separations:

(loo)-(100) 0.07 mm, (001)-(001) 0.10 mm, (OlO)-(Oi?l) 0.18 mm. The crystal was

mounted with the b axis slightly tilted relative to the #I axis of the spectrometer

in order to reduce the possibility of multiple scattering. Intensity data were

collected in the 0-28 mode; individual scan speeds were determined from the number

of counts collected in a rapid scan at the calculated Bragg peak and the rate of

scanning (28) varied linearly from 1.5' min -L (less than 100 counts in the rapid

scan) to 12' min -1 (more than 1000 counts in the rapid scan). The intensities of

three standards were monitored after every 100 reflections and showed only

statistical variations over the course of the experiment. All reflection intensi-

ties were assigned observational variances based on the equation

02(I) = S + (BL + B,)(TS/ZT,)' + (PU2, where S, B1, and B2 are the scan and

extremum background counts, TS and TB are the scan and individual background

counting times (TN = TS/4 for all reflections), and p was taken to be 0.03 and

represents our estimate of the error proportional to the diffracted intensity[4].

Net intensities and their associated standard deviations were corrected for

Lorentz and polarization effects. No correction for the effects of absorption was

117

deemed necessary, as the neglect of such effects was expected to introduce errors

of less than 1%. An approximate absolute scale was determined by the method of

Wilson [S] , and a set of normalized structure factors, /E/Is, was obtained by a

standard procedure. Symmetry averaging (R = 0.028) of the data led to a set of

1835 independent intensities, of which 1488 had I > o(I) and have been employed in

the refinement of the structural parameters.

As indicated above, our preliminary formulation of the crystalline specimens

was as a hydrate of l-MeC.HF, and all of our initial trial solutions were deduced

and analysed with that stoichiometry in mind. The presence of a very strong 020

reflection suggested high atomic occupancy in planes normal to the b axis and -

separated by b/2. Such a situation is readily consistent with space groups Cm and

C2/m, with the eight l-MeC residues per cell lying on the crystallographic mirror

planes normal to the b axis. Statistical analysis of the normalized structure . _

factors as well as the zero moment test of Howells, Phillips, and Rogers[6]

encouraged the employment of the centrosymmetric space group C2/m. Our early

attempts at deducing the structure focused on the utilization of direct methods;

while several tantalizing solutions were offered by various signed E-maps, we were

unable to extend any of these to a satisfactory model. The failure of direct

methods (in our hands, at least) is not particularly surprising in view of the

tremendous vector overlap observed in the Patterson synthesis.

We were, however, eventually able to utilize coherent vectors in an EL - 1

sharpened Patterson map and to effect a structural solution. Our analysis

hinged on three outstanding features of this well resolved synthesis: (1) as

expected, most of the vector density lies in the (U, 0 W) plane, and we confined

our attention there; (2) in this plane, recognition of two strong coherent

transforms was an essential aspect of the solution. One of these vectors, of

length 5.5A, was concluded to be the intracomplex cross vector between coplanar

molecules (one l-MeC and one 1-MeCH+) hydrogen bonded together. A second cross

vector, between hydrogen-bonded base pairs, was then deduced, and finally two

118

coherent symmetry vectors were identified and allowed the position and orientation

of each of the two independent l-MeC residues to be obtained. A structure

factor-difference Fourier sequence (R = Cl IF,I-IF,1 I/ZIFol = 0.63) revealed a

strong peak (-7 e/A3) at the 2/m symmetry site (0, 0, l/2); about this central peak

were six smaller peaks (-3 e/A3) at the vertices of a regular octahedron. We

immediately recognized and identified this set of peaks as resulting from an SiF6 2-

anion, consistent with the solution chemistry which yielded the crystalline

specimens, vide supra. Extending the model in this fashion lowered the R value to

0.48, and a difference-Fourier map showed a residual peak due to the now expected

water of crystallization. At a second glance, the Patterson synthesis had vector

density clearly in accord with the presence of the SiF6 2- group.

Refinement of the now complete (except for the location of the hydrogen atoms)

model was considerably more straightforward than the structural solution. Several

cycles of isotropic refinement, minimizing the quantity ZW(~F~/-IF~~)~ where

w = 4F02 / (F,*), quickly converged and lowered the R value [R = gIjFol-/Fc/l/gl~,I] o2

to 0.12. Two further cycles of refinement, employing anisotropic thermal parameters

for the nonhydrogen atoms, gave an R value of 0.09. A subsequent difference-Fourier

synthesis allowed the positioning of the hydrogen atoms, including the hydrogen atom

involved in the N(3)-11...N(3) hydrogen bond. This latter hydrogen atom is

asymmetrically positioned between the two independent N(3) atoms, yielding a base

pair formally made up of one protonated and one neutral l-MeC molecule. In the

remaining cycles of refinement, the positional and isotropic thermal parameters of

the hydrogen atoms were also allowed to vary. Furthermore, the 020 reflection was

given zero weight as its observed structure factor amplitude was -10% smaller than

its calculated amplitude; we attribute this disparity to a combination of secondary

extinction effects and counting losses. Three cycles in such a mode produced

convergence (all shift/error less than 0.5 for the nonhydrogen atom parameters and

less than 0.7 for the hydrogen atom parameters) and a final R value of 0.071. The

final weighted R value [R w = (sw(~F~]-/F~~)~/~w]F~~~)~'~] and goodness of fit

119

Table 2. Final atomic coordinates with their estimated standard deviations in

parenthesesa

Atom x Y z Atom x Y Z

(a) [SiF6]*-

Si Ob

F(1) 85(l)

F(2) 503(l)

(b) [l-M&H]+

O(*)A

N(llA

N(3lA

N(4lA

C(l)A

C(*lA

C(4)A

C(5)A

C(6)A

H(N3)A

H(5)A

H(6)A

H(1l)A

H(12)A

H(13)A

H(41)A

H(42)A

2496(l)

1539(l)

1966(l)

1463(l)

1583(3)

2030(l)

1458(l)

962(l)

1017(2)

234(2)

59(2)

70(l)

123(2)

193(2)

161(3)

118(2)

187(2)

0

0

1731(3)

0

0

0

0

0

0

0

0

0

0

0

0

64(8)

93(7)

-lll(ll)

0

0

l/2

3268(2)

5334(2)

1870(2)

1791(3)

-338(3)

-2603(3)

3392(4)

1167(3)

-1199(3)

-495(4)

956(4)

-82(4)

-116(4)

157(3)

377(4)

382(4)

385(8)

-318(4)

-306(4)

(c) [l-MeC]

OG’)B

N(l)B

N(3)B

N(4lB

C(l)B

C(*)B

C(4)B

C(5)B

C(6)8

H(5)B

H(6)B

H(11)B

H(12)B

H(13)B

H(41)B

H(42)B

(d) H20

W(1)

HWl)

2483(l)

3432(l)

3019(l)

3577(2)

3356(2)

2953(l)

3542(l)

4031(l)

3959(l)

440(l)

427(l)

301(2)

373(2)

328(3)

326(2)

392(2)

4631(l)

467(l)

0 -3687(2)

0 -3679(3)

0 -1478(3)

0 721(3)

0 -5282(4)

0 -2967(4)

0 -732(4)

0 -1464(4)

0 -2915(4)

0 -88(3)

0 -357(3)

93(g) -568(S)

59171 -567(4)

136(8) -559(6)

0 112(3)

0 124(3)

0

92(5)

2639(4)

324(3)

aNonhydrogen atoms x 104; hydrogen atoms x 103.

b Parameters devoid of esd's are fixed by symmetry.

120

[(Zw(iFo]-IFcl)'/(NO - NV)fl'2, where NO = 1487 observations and NV = 189 variables]

were 0.045 and 1.5, respectively. A final difference-Fourier synthesis was

featureless, except for peaks at + 0.3 e/A3 near the region of the SiF6'- anion.

Neutral scattering factor curves for the nonhydrogen[7] and hydrogen[8] atoms

were taken from standard sources. Anomalous dispersion corrections were applied

to the scattering curves for all the nonhydrogen atoms[9]. Final positional

parameters are collected in Table 2. Tables of thermal parameters and observed and

calculated structure-factor amplitudes have been deposited[lO]. The crystallo-

graphic computations were performed with a standard set of computer programs[ll].

RESULTS AND DISCUSSION

The [1-Methylcytosine, 1-Methylcytosinium] Cation. A projection view of the

[l-MeC, l-MeCH]+ cation is presented in Figure 1, with nonhydrogen-atom distances

Figure 1. An illustration of the asymmetric interbase hydrogen bonding in the

[l-MeC, I-MeCH]+ complex cation in the hexafluorosilicate salt.

121

and angles for both the formally neutral and protonated residues given in Table 3.

The complex cation exhibits a triply hydrogen-bonded base pairing scheme similar

to the Watson-Crick base pairing[l] between guanosine and cytidine in duplex

regions of DNA and RNA. The interbase hydrogen bonding is asymmetric, with one of

the l-MeC bases protonated at N(3), labeled A, and the other in its neutral form,

labeled B. In addition to the unambiguous location and refinement of the hydrogen

atom H(N3)A in the N(3)-H...N(3) hydrogen bond, the differences in the nonhydrogen-

atom geometry in the two l-MeC rings, Table 3, are in complete accord with the

presence of a protonated and a neutral ring system. We note in particular that the

highly sensitive endocyclic bond angle at N(3) of the pyrimidine ring is -4"

larger for the protonated base [C(Z)A-N(3)A-C(4)A = 124.5(2)'] than for the neutral

base [C(2)B-N(3)B-C(4)B = 120.1(2)"]. In addition, the observed differences,

Table 3, in several other molecular dimensions which are also sensitive to

protonation at N(3) [the bond lengths N(l)-C(2), N(l)-C(6), N(3)-C(2), N(3)-C(4),

and N(4)-C(4) and the bond angles 0(2)X(2)-N(l), N(l)-C(2)-N(3), N(3)-C(4)-C(S),

and N(4)-C(4)-C(S)] show exactly the same trend as noted on comparing the parameters

in neutral l-MeC[12] itself with several protonated salts of N(l)-substituted

cytosine derivatives [for example, protonated l-MeC chloride[l3], perchlorate[l4]

and protonated l-benzylcytosine nitrate[l4] and protonated cytidine nitrate[lS]].

Thus, it is well established in the solid that the present complex cation formally

contains a protonated and a neutral base and concomitantly an asymmetric interbase

hydrogen bond.

In the asymmetrically hydrogen-bonded base pair, the central N(3)A_H(N3)A...N(3)B

hydrogen bond is strong [N(3)A. ..N(3)B = 2.833(4)A] and nearly linear [the bond angle

at H(N3)A is 174(2)"]. The other two hydrogen bonds in the complex involve the

exocyclic amino group of the protonated base acting as a donor and the carbonyl

group of the neutral base acting as the acceptor and vice versa. These two hydrogen --

bonds, Figure 1 and Table 4, are of different strength. The hydrogen bond between

the amino group of the protonated base and the carbonyl group of the neutral base

122

Table 3. Nonhydrogen atom bond lengths(A) and bond angles(deg)

(a) The [l-MeC, l-MeCH]+ complex

Bond

0(2)-C(2)

N(l)-C(1)

N(l)-C(2)

N(l)-C(6)

N(3)-C(2)

N(3)-C(4)

N(4)-C(4)

C(4)-C(5)

C(5)-C(6)

Bond angle

C(l)-N(l)-C(2)

C(l)-N(l)-C(6)

C(2)-N(l)-C(6)

C(2)-N(3)-C(4)

O(2)-C(2)-N(1)

O(2)-C(2)-N(3)

N(l)-C(2)-N(3)

N(3)-C(4)-N(4)

N(3)-C(4)-C(5)

N(4)-C(4)-C(5)

C(4)-C(5)-C(6)

N(l)-C(6)-C(5)

[l-MeCH];

1.212(3)

1.473(4)

1.365(3)

1.376(3)

1.383(3)

1.363(3)

1.301(3)

1.416(3)

1.333(3)

117.7(2)

121.0(Z)

121.3(2)

124.5(Z)

123.0(2)

121.2(2)

115.8(2)

117.7(2)

117.3(2)

124.9(Z)

118.7(2)

122.4(2)

[l-MeCIB

1.228(3)

1.473(4)

1.383(3)

1.360(3)

1.367(3)

1.343(3)

1.337(3)

1.416(3)

1.333(3)

118.5(2)

120.8(2)

120.8(2)

120.1(2)

119.2(2)

122.0(2)

118.8(2)

117.2(2)

121.0(2)

121.9(2)

118.2(2)

121.1(2)

123

Table 3, concluded.

(b) The SiF6'- dianion

Si-F(1) 1.641(Z) Si-F(Z) 1.66812)

F(l)-Si-F(2) 90.5(l) F(l)-Si-F(2)a 89.5(l)

F(2)-Si-F(2)b 88.4(l) F(Z)-Si-F(2)' 91.6(l)

Symmetry transforms: (a) -x, -y, 1 - 2

cc3 -x, Y, 1 - 7.

(b) x, -y, 2

Table 4. Comparison of hydrogen bonding parameters within a variety of

triply-hydrogen bonded base pairs containing cytosine derivatives a

D A D-H D-.*A H...A ID-H***A

(a) [I-MeC, 1-MeC11]2(SiF6).2H20

N(3A N(3)B l.O4(2)A 2.833(4)A l.SO(4)A 174(2)'

N(4)A 0(2)B 1.10(2) 2.736(4) 1.6414) 178(2)

N(4)B 0(2)A 0.89(2) 2.899(4) 2.01(4) 176(2)

(b) [l-MeC, l-MeCH]I*H20

N(3lA N(3)B 0.95(6) 2.836(7) 1.88(7) 177(4)

N(4)A O(Z)B 0.92(6) 2.755(7) 1.84(7) 180(4)

N(41B 0(23A O-91(6) 2.91717) 2.02(7) 174(4)

124

Table 4, concluded.

(cf [C, CH12(ZnC14f

N(3)A Nf3)B 0*94(S)

N(4)A 0(2)B 0.92(S)

N(4jB O(2)A 1.07(S)

(d) [C, CH] resorcylate*H20

N(3)A N(3)B 1.0

Nf4)A Of2)B 0.9

N(4)B 0(2V 0.9

(e) Cytosine-S-acetic acid

N(3)A,B N(3)B,A l-06(6)

N(4)kB O(2)BJ 0.92(6)

(f) Isocytosine

N(3)A N(3)B 0.95(5)

N(4)A OC2)B 0.89(S)

N(4)B 0(2P 0.95(5)

2.861(7)

2.784(7)

2.954(7)

1.9117) 176(4)

1.87(7) 178(4)

1.91(7) 167(4)

2.82(l) 1.9 157

2.78(l) 1.8 180

2*89(l) 2.0 170

2.823(8) 1.77(8) 174(3)

2.790(8) 1.88(8) 167(3)

2.908(9) 1.96(9) 178(3)

2.861(8) 1.97(8) 177(3)

2.904(8) 1.95(8) 175(3)

'The atomic numbering schemes other than for (a) and (b) have been altered from

the original papers to conform to that we have employed in (a) and (b).

125

is notably stronger [N(4)A*..O(Z)B = 2.736(4)A] than the hydrogen bond between the

amino group of the neutral base and the carbonyl group of the protonated base

tN(4)B *a-O(2)A = 2.899(4)A]. Moreover, we note that the parameters found here are

virtually identical with those we recently determined in the asymmetrically hydrogen-

bonded base pair in the compound [l-MeC, l-MeCH]I*H20[3]. The only noticeable

difference is that the two hydrogen-bonded rings are not coplanar in the iodide

complex (with a dihedral angle of 4,9(4)O)[3], whereas the two rings are required

to be coplanar here in the hexafluorosilicate salt as a consequence of positioning

the two bases on a crystallographic mirror plane.

The Hexafluorosilicate Dianion. A perspective view of the hexafluorosilicate

dianion is presented in Figure 2; the SiF6 2- anion is crystallographically required

to possess at least Z/m(C,,) molecular symmetry and, in fact, the observed molecular

symmetry, as commonly found, is nearly m3m(Oh), Table 3. The two independent Si-F

bond lengths are Si-F(lf = 1.641(2)A and Si-F(2) = 1.668(2)A. For comparison, me

take the expected mean Si-F distance in an idealized SiF6 2- dianion to be 1.695(8)A,

Figure 2. A perspective view of the hexafluorosilicate dianion. The site

symmetry is Z/m(C,h).

126

a value we deduce from the average of ten Si-F distances reported earlier and

corrected for the effects of thermal motion[l6]. The two Si-F bond lengths reported

here are 0.04-0.05A shorter than this deduced value. It is clearly seen in Figure

2, however, that the SiF ‘- 6 group in the present case is undergoing large ‘apparent’

thermal motion. In particular, the root-mean-square displacements parallel to the

b axis are 0.433A and 0.345A, respectively, for F(1) and F(2). The employment of a

very simple, riding-model-like correction[l7] for the thermal motion suggests an

extension of the Si-F lengths in each bond to about 1.70A, in good qualitative

agreement with the expected value given above. Whether such a correction is com-

pletely justified,given the possible rotational disorder [either static or dynamic]

of the SiF62- group, is questionable. In fact, we attempted a disorder model in

space group C2/m, with each of the anisotropic F atoms replaced by two isotropic

atoms displaced along the large axis of each thermal ellipsoid. This model was

slightly less successful than the fully anisotropic model and we did not pursue it

further nor have we attempted a more sophisticated description of the SiF6 2- group

motion.

Crystal Packing. A projection view of the crystal packing for

[l-MeC, I-MeCH] (SiF6)*2H20 is given in Figure 3. The view direction is approxi-

mately normal to the (010) plane, with a rotation of 10’ about the a axis to reveal -

more clearly the integrity of the components of the unit cell. The

[l-MeC, l-MeCH]+ cations and the Si and F(1) atoms of the SiF62- octahedron lie in

the crystallographic mirror planes.

We will first examine several hydrogen-bonding interactions in the structure.

In addition to the hydrogen bonding within the [l-MeC, l-MeCH]+ cation, there are

several interactions involving the cationic complexes, the SiF6 2- dianions, and the

water molecule of crystallization. The amino group of the protonated l-MeC base

acts as a donor in a bifurcated hydrogen-bond system with the acceptors being two

symmetry-related F(2) atoms of the SiF6 2- anion. The parameters in this hydrogen-

bond system are as follows: N(4)A.. .F(2) [x, y, -1 + z and x, -y, -1 + z] = 3.019(4)A,

127

/ Z

F)

X

A

Figure 3. The crystal packing in [I-MeC, I-MeCH]2(SiF6)-2H20. The view direction

is approximately along b. Shaded components lie at y = 1/2; unshaded

components lie at y = 0. Thin lines denote hydrogen bonding interac-

tions, see the text.

H(41)A...F(2) = 2.29(4)A, angle N(4)A-H(41)A-.-F(2) = 149(2) °. The water molecule

of crystallization plays the role of an intermediary in a hydrogen-bond network

the neutral I-MeC molecule and the SiF62- anion. The oxygen atom of the coupling

water molecule accepts a hydrogen bond from the exocyclic amino group of the neutral

I-MeC molecule [N(4)B...W(1) = 2.883(4)A, H(42)B.-.W(1) = 1.99(4)A,

angle N(4)B-H(42)B..~W(~) = 172(2)°], while donating hydrogen bonds to two symmetry-

related F(2) atoms [W(1)-.-F(2)(I/2 - x, 1/2 - y, 1 - z and 1/2 - x, -1/2 + y,

1 - z) = 2.932(3)A, H(WI).--F(2) = 2.13(5)A, angle W(1)-H(WI)..-F(2) = 162(3)°].

128

In contrast to the extensive hydrogen-bonding interactions observed for F(2), the

F(1) atom of the SiF6 2- group has its principal interactions with the hydrogen atoms

off the carbon atoms C(5)A and C(6)A of the protonated l-MeC base [H(5)A.. *F(l) =

2.37(4)A, H(6)A...F(l) = 2.28(4)A]. These interactions are expected to be favorable,

but weak.

The water-bridged hydrogen bond system between the neutral l-MeC molecule and

the SiF62- dlanion provides a reasonable degree of coupling between the atomic

planes at y = 0 and y = l/2, but the major interaction between these layers may well

be produced by the substantial base-base stacking of the complex cationsi Figure 3).

This base stacking arises from applying the two-fold screw axis operation parallel

along the b axis to the complexes; this screw axis intersects the (010) plane near -

to the center of mass of the complex cations. There are three significant conse-

quences of applying the two-fold screw axis to produce the molecular overlap between

base pairs: (1) there is a set of strongly overlapping cations in which the

protonated base of one hydrogen-bonded pair overlaps with the neutral molecule of

the symmetry-related base pair and vice versa; -- (2) the mean separation between

these overlapping base pairs is small (3.36A = b/2) and suggestive of a strong

interaction; (3) finally, the set of overlapping base pairs is not restricted to a

dimeric interaction, as in [l-MeC, l-MeCH]I.H20[3], but is an infinite helical array

of uniformly spaced base pairs with their molecular planes exactly normal to the

helix axis.

Moreover, we believe here, as in other such systems, that the asymmetry in the

base stacking (neutral/protonated and protonated/neutral) may well be the predominant

force responsible for the localization of the proton in the interbase N(3)-H**.N(3)

hydrogen bond. In particular, we note that the asymmetry observed in the base

pairing and in the neutral/protonated stacking arrangement is a synergistic inter-

relationship which allows for maximum molecular and crystal stability[l8].

Furthermore, we have previously argued that the highly favorable, asymmetrically

hydrogen-bonded and stacked sequence observed in the dimeric structure of the iodide

129

A 0

C D

Figure 4. Interbase hydrogen bonding and base-base stacking in (A) cytosine-5-

acetic acid; (8) the [I-MeC, I-MeCH] + cation in [I-MeC, I-MeCH]I.H20,

D = 3.22A; (C) the [C, CH] + cation in [C, CH]2(ZnCI4), D = 3.26A;

(D) the [I-MeC, I-MeCH] + cation in [I-MeC, I-MeCH]2(SiF6)'2H20,

D = 3.36A.

130

salt and repeated here in polymeric form in the hexafluorosilicate salt may well

provide a model for elements of the hemiprotonated form of polycytidylic acid[l9].

The hemiprotonated form of polycytidylic acid is known to exist in solution as a

stacked, hydrogen bonded structure[l9]. We anticipate that this hemiprotonated

polymer will avail itself in some way of the same highly-favorable asymmetric base

pairing and base stacking we have observed in the iodide and now the hexafluoro-

silicate salt of [l-MeC, l-MeCH]+.

Comparison with Other Triply Hydrogen-Bonded Cytosine Systems. Five other struc-

turally characterized systems in addition to the SiF6 2- salt reported here contain

a set of triply hydrogen-bonded base pairs of cytosine derivatives. The first such

species, cytosine-5-acetic acid[20], Figure 4A, partially exists (50% of the time)

in an inner salt or zwitterionic form, such that one half of the time the carboxylate

proton is bound to N(3) of one of the pyrimidine bases. Inversion-related

zwitterions pair into ‘symmetrically’ hydrogen bonded complexes, Figure 4A. In this

complex, the proton in the N(3)-H. **N(3) hydrogen bond is asymmetrically located and

equally distributed over the two sites near each of the symmetry-related N(3) atoms,

Figure 4A and Table 4. No base-base stacking is observed in this zwitterionic

species, as charge repulsion effects mitigate against a base stacking interaction.

Marsh and coworkers[21] subsequently studied the structure of the pyrimidine

base isocytosine. Isocytosine can exist in either of two tautomers depending on

which of the ring nitrogen atoms bears a hydrogen substituent. In the solid, these

tautomers (1,4-dihydro-2-amino-4-oxo-pyrimidine and 3,4-dihydro-2-amino-4-oxo-

pyrimidine) are present in equal amounts and hydrogen bond together in exactly the

fashion depicted in Figure 1. Details of the interbase hydrogen bonding in this

asymmetric complex are given in Table 4. Germane to our discussion here is the

existence in the crystal of dimeric packs of roughly parallel base pairs, Table 5,

with a mean separation of 3.36A. In this dimeric arrangement, the base-base overlap

is substantial and such that one tautomer overlaps the second tautomer and

vice versa[21]. --

131

More recently, Tamura, Sato and Hata[22], while studying drug-related molecular

complexes containing aromatic organic acids, determined the structure of a 2:l

complex of cytosine and resorcylic acid. In the solid, a proton from the planar

organic acid is transferred to one of the cytosine residues and employed in producing

a monomeric pair of triply hydrogen-bonded cytosine bases; here again the hydrogen

bonding is asymmetric, Table 4, and stabilized, we believe, by base stacking inter-

actions involving the hydrogen-bonded cytosine base pair and the resorcylate anion.

A similar example of cytosine, cytosinium base pairing is available in the

compound bis(C, CH)ZnC14[23]. The interbase hydrogen bonding is asymmetric, Table

4, and dimeric base pairs with C/CH+ molecular overlap are observed, Figure 4C.

The mean separation between the essentially coplanar base pairs is 3.26A.

Lastly, we present in Tables 4 and 5 and Figures 4B and 4D, the hydrogen-bond

parameters and base-base overlap diagrams we have found in the iodide and here in

hexafluorosilicate salt of [l-MeC, l-MeCH]+. We note in particular that the degree

of base-base overlap in the dimeric units found in bis(C, CH)ZnC14 and

Table 5. Comparison of base pair-base pair stacking distances within a variety

stacked base pairs containing cytosine derivatives

Compound Dihedral angle between Mean stacking distance

bases in the base pair between base pairs

(a) [l-MeC, l-MeCH]2(SiF6).2H20 o.oO 3.36A

(b) [l-MeC, l-MeCH]I.ZH20 4.9(4) 3.22

(c) [C, W2(ZnC14) 2.4(6) 3.26

(d) Isocytosine 8.9(S) 3.36

132

[l-MeC, l-MeCH]I*2H20 are virtually identical (Figures 4B and 4C), but different

from that found in the polymeric, columnar arrangement in the structure of the

hexafluorosilicate salt, Figure 4D. In the dimeric arrays, the molecular overlap

largely involves the placement of the carbonyl group of the protonated base over

the heterocyclic ring of the neutral molecule. In the polymeric array, the degree

of overlap is noticeably larger and involves a mode in which the N-CH3 bonds of both

neutral and protonated bases lie over heterocyclic ring centers. It is expected

that both observed overlap modes are highly favorable[24], with perhaps that dis-

played in the polymeric arrangement of the hexafluorosilicate salt being slightly

more so.

Finally, the observation of triply hydrogen-bonded base pairs in a number of

protonated cytosine salts suggests that such an entity is competitive with the

formation of fully protonated bases for which the hydrogen bonding and stacking

interactions are solely with the anionic counter ion. Initially, for anions such

as Cl-, Br-, C104- and NO ^ 3 , the preference (for crystallizable materials) seemed to

lie with full protonation and strong association of the cations and anions through

extensive hydrogen bond networks. Counter examples (with the anions resorcylate,

I-, ZnC14'- and SiF o*-) now exist, however, in which hemiprotonated, interbase

hydrogen-bonded complexes are present. One suspects that the hydrogen bonding

affinity of the counter ion plays some role in determining the degree of protonation

of the cytosine residues, since the known examples of hemiprotonated cytosine base

pairs are all found with weak-to-poor hydrogen-bond acceptor anions. Beyond that,

it would seem to be a delicate balance amongst a variety of 'crystal packing' forces

as to whether cytosine derivatives crystallize from acidic solutions as fully

protonated or hemiprotonated salts.

ACKNOWLEDGIMEN'IS

Support of this research was provided by the National Institutes of Health

through Public Health Service Grant No. GM 20544.

133

REFERENCES

1. J. D. Watson, and F. H. C. Crick, Nature 171, (1953), 737. -9

2. J. L. Wood, J. Mol. Struct., -- 17, (1973), 307.

3. Part III, T. J. Kistenmacher, M. Rossi, J. P. Adv. Mol. Relaxation Interact. Processes, l& --

4. W. R. Busing, and H. A. Levy, 2. Chem. w.,

5. A. J. C. Wilson, Nature(London), 150, (1942),

6. R. E. Howells, D. C. Phillips,

7. H. P. Hanson, H. Herman, J. D. (1964), 1040.

8. R. F. Stewart, E. R. Davidson, 3175.

9. D. T. Cromer, and D. Liberman,

10.

11.

12.

13.

14.

15

16

17.

18.

19.

20.

21.

and D. Rogers, Acta Crystallogr., 3, (1950), 210.

Caradonna, and L. G. Marzilli, (1979), 119.

26, (1957), 563.

152.

Lea, and S. Skillman, Acta Crystallogr., 17,

and W. T. Simpson, 2. Chem. Phys., 42, (1965),

J. Chem. Phys., -- 53, (1970), 1891.

Data deposited with B.L.L.D. as supplementary publication number SUP 26162 ( 14 pages).

Busing, Martin and Levy's ORFLS for structure factor and least squares calculations, Busing, Wehe and Levy's ORABS for absorption correction, Johnson's ORTEP for the illustrations. All other calculations were performed with locally written programs.

M. Rossi, and T. J. Kistenmacher, Acta Crystallogr., &, (1977), 3962.

B. L. Trus, and R. E. Marsh, Acta Crystallogr., E, (1972), 1834.

M. Rossi, J. P. Caradonna, L. G. Marzilli, and T. J. Kistenmacher, Adv. Mol. -- Relaxation Interact. Processes, Is, (1979), 103.

J. J. Guy, L. R. Nassimbeni, G. M. Sheldrick, and R. Taylor, Acta Crystallogr., E, (1976), 2909.

S. Ray, A. Zalkin, and D. H. Templeton, Acta Crystallogr., x, (1973), 2741 and 2748.

W. R. Busing, and H. A. Levy, Acta Crystallogr., 5, (1964), 142.

T. J. Kistenmacher, M. Rossi, and L. G. Marzilli, Biopolymers, 17, (1978), 2581.

C. H. Chou, and G. J. Thomas, Biopolymers, 16, (1977), 765.

R. E. Marsh, R. Bierstedt, and E. L. Eichhorn. Acta Crystallogr., 15, (1962), 310.

J. F. McConnell, B. D. Sharma, and R. E. Marsh, Nature(London), 203, (1964), 399 and B. D. Sharma, and J. F. McConnell, Acta Crystallogr., 19, (1965), 797.

134

22. C. Tamura, S. Sato, and T. Hata, Bull. Chem. Sot. Japan, 46, (1973), 2388. -~-

23. F. Fujinami, K. Ogawa, Y. Arakawa, S. Shirotake, S. Fujii, and K. Tomita, Acta Crystallogr., F, (1979), 968.

24. C. E. Bugg, J. M. Thomas, M. Sundaralingam, and S. T. Rao, Biopolymers, lo, (1971), 175.

solution and solid state studies on the interactions of protonated cytosine salts. iv. asymmetric...

Documents