nucleotide sequence of an insertion element, isi · proc. natl. acad.sci. usa75(1978)-40 -20 1 20...

5
Proc. Natl. Acad. Sci. USA Vol. 75, No. 2, pp. 615-619, February 1978 Biochemistry Nucleotide sequence of an insertion element, ISI (Escherichia coli plasmids/deletion events/illegitimate recombination/inverted repeat sequences) HISAKO OHTSUBO AND EIICHI OHTSUBO Department of Microbiology, Health Sciences Center, State University of New York at Stony Brook, Stony Brook, New York 11794 Communicated by Norman Davidson, October 19,1977 ABSTRACT PSM2, PSMl, and PSM15 are small plasmids derived from R100 by spontaneous deletions at either end of the insertion sequence IS1. These plasmids were used to identify regions neighboring ISI as well as the ISI DNA itself, by cleavage with EcoRl, HindIII, Hae II, Hae III, Hpa II, Hha I, Hinf, and Alu I. The nucleotide sequencing results demon- strate that ISI contains 768 bases. About 30 bases at the ends of ISI were found to be repeated in an inverted order. The dele- tions occurring at the ends of ISI were found to be due to ille- gitimate recombination. The hypothesis that RNA polymerase could play an important role in such recombination phenomena is discussed based on the nucleotide sequences surrounding the recombinational hot spots. Insertion sequences (IS) are segments of DNA that occur in several size classes (700-1400 bases) and appear as insertion mutations in various operons (1, 2). These elements are present as repeated sequences in the Escherichia coli chromosome and bacterial plasmids including the fertility factor F and the re- sistance factor R (3-6). Insertion of an IS element into an operon causes strong polarity due to interference at the level of tran- scription (7, 8). Excision of this element restores normal function of the operon (9). In this way, IS elements can control gene expression. IS elements have the ability to move into genes or DNA se- quences as a single unit, indicating that the ends of the'IS ele- ments act as hot spots for recombination. Recent electron mi- croscope heteroduplex studies on R plasmid DNA molecules showed that the ends of IS elements also act as hot spots in the formation of deletions and translocations of large DNA seg- ments adjacent to the IS elements (10-15). These recombination events are illegitimate in nature but are site-specific in terms of involvement of the ends of IS elements. The frequent IS- mediated rearrangements of DNA segments strongly imply that these elements must have played an important role in the ev- olution of the organisms carrying them. In this paper, we report on the restriction endonuclease cleavage map and nucleotide sequencing analysis of an IS ele- ment, IS1, about 700-800 bases long. We describe the entire nucleotide sequence of S1, which greatly extends our previous partial sequencing (16). Some of the structural features of in- terest in the nucleotide sequences of IS1 and its neighboring regions are described and discussed as they relate to the control of gene expression and to site-specific recombination mediated by IS1 DNA. MATERIALS AND METHODS Plasmid Strains. The plasmid strains used were pSMl, pSM2, and pSM15, each of which carries one copy of the IS1 sequence (13). Isolation of closed circular duplex DNA of these plasmids has been described (17). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U. S. C. §1734 solely to indicate this fact. 615 R12 (RIOO) 8 2.7 83.0 L . deletI deletion EcoPRRI 87.2 EcoR site 1 85.5 or/ 87.6 Hind N r-de [site, \Sy IS 89.3/0 me I mer l~- etermi nont 20.0 S// str t ch/ deletion 85.5 -89.3/0/'82.7 5.5 -89.3/0/82J7 EcoR I t IS1 87.4/88.6 FIG. 1. Pedigree of the small circular plasmids (pSM1, pSM2, and pSM15), showing sequence relationships with their parent R factor, R12 (or R100). A critical part of R12 of circular DNA is shown. All numbers are distances in kilobases from an end point of an IS1 sequence on R12. The positions of the replication origin (ori), the EcoRl and HindIII cutting sites, the sites for deletions to generate pSM plasmids, and r-determinant region responsible for resistance to mercuric ion (mer), sulfonamide (sul), streptomycin (str), and chloramphenicol (chl) have been mapped (13, 4, 19). Enzymes. EcoRl and HindIll were purchased from Miles Laboratory and were assayed as recommended by Miles Lab- oratory. Other restriction endonucleases (Hae II, Hae III, Hha I, Hinfl, Hpa II, and Alu I) were kindly supplied by H. Ohmori. The reaction mixture for these endonucleases contained 6 mM Tris-HCI (pH 7.9), 6 mM MgCl2, 6 mM 2-mercaptoethanol, and bovine serum albumin (500 ,ug/ml). Bacterial alkaline phos- phatase (36 units/ml) from Worthington Biochemical, in 0.1 M Tris-HCI, pH 8.0/10 mM MgCl2, was used. Polynucleotide kinase (P-L Biochemicals, Inc.) was used for phosphorylation of the 5' end of the DNA with [y-32P]ATP (specific activity, -2000 Ci/mmol, ICN 35001-HH). The reaction mixture for this enzyme contained 50 mM Tris-HCl (pH 7.5), 10 mM MgCl2, and 5 mM dithiothreitol. Gel Electrophoresis. To separate native DNA fragments, electrophoresis was carried out on a 13 X 15 X 0.2 cm 4% or 10% Abbreviation: IS, insertion sequence. Downloaded by guest on March 3, 2021

Upload: others

Post on 10-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Nucleotide sequence of an insertion element, ISI · proc. natl. acad.sci. usa75(1978)-40 -20 1 20 *-40 0hinf 3,ctgaggttctgtcgtcttcttaatatgctcaccatagacctactamaattmtamagcttctatggm

Proc. Natl. Acad. Sci. USAVol. 75, No. 2, pp. 615-619, February 1978Biochemistry

Nucleotide sequence of an insertion element, ISI(Escherichia coli plasmids/deletion events/illegitimate recombination/inverted repeat sequences)

HISAKO OHTSUBO AND EIICHI OHTSUBODepartment of Microbiology, Health Sciences Center, State University of New York at Stony Brook, Stony Brook, New York 11794

Communicated by Norman Davidson, October 19,1977

ABSTRACT PSM2, PSMl, and PSM15 are small plasmidsderived from R100 by spontaneous deletions at either end of theinsertion sequence IS1. These plasmids were used to identifyregions neighboring ISI as well as the ISI DNA itself, bycleavage with EcoRl, HindIII, Hae II, Hae III, Hpa II, HhaI, Hinf, and Alu I. The nucleotide sequencing results demon-strate that ISI contains 768 bases. About 30 bases at the ends ofISI were found to be repeated in an inverted order. The dele-tions occurring at the ends of ISI were found to be due to ille-gitimate recombination. The hypothesis that RNA polymerasecould play an important role in such recombination phenomenais discussed based on the nucleotide sequences surrounding therecombinational hot spots.

Insertion sequences (IS) are segments of DNA that occur inseveral size classes (700-1400 bases) and appear as insertionmutations in various operons (1, 2). These elements are presentas repeated sequences in the Escherichia coli chromosome andbacterial plasmids including the fertility factor F and the re-sistance factor R (3-6). Insertion of an IS element into an operoncauses strong polarity due to interference at the level of tran-scription (7, 8). Excision of this element restores normal functionof the operon (9). In this way, IS elements can control geneexpression.

IS elements have the ability to move into genes or DNA se-quences as a single unit, indicating that the ends of the'IS ele-ments act as hot spots for recombination. Recent electron mi-croscope heteroduplex studies on R plasmid DNA moleculesshowed that the ends of IS elements also act as hot spots in theformation of deletions and translocations of large DNA seg-ments adjacent to the IS elements (10-15). These recombinationevents are illegitimate in nature but are site-specific in termsof involvement of the ends of IS elements. The frequent IS-mediated rearrangements of DNA segments strongly imply thatthese elements must have played an important role in the ev-olution of the organisms carrying them.

In this paper, we report on the restriction endonucleasecleavage map and nucleotide sequencing analysis of an IS ele-ment, IS1, about 700-800 bases long. We describe the entirenucleotide sequence of S1, which greatly extends our previouspartial sequencing (16). Some of the structural features of in-terest in the nucleotide sequences of IS1 and its neighboringregions are described and discussed as they relate to the controlof gene expression and to site-specific recombination mediatedby IS1 DNA.

MATERIALS AND METHODSPlasmid Strains. The plasmid strains used were pSMl, pSM2,

and pSM15, each of which carries one copy of the IS1 sequence(13). Isolation of closed circular duplex DNA of these plasmidshas been described (17).

The costs of publication of this article were defrayed in part by thepayment of page charges. This article must therefore be hereby marked"advertisement" in accordance with 18 U. S. C. §1734 solely to indicatethis fact.

615

R12(RIOO)

8 2.7 83.0L.deletI

deletion

EcoPRRI

87.2EcoRsite 1

85.5or/

87.6Hind N r-de[site,

\SyIS 89.3/0

meI mer

l~-

etermi nont

20.0

S// str tch/

deletion

85.5

-89.3/0/'82.7

5.5

-89.3/0/82J7

EcoR I

t IS187.4/88.6

FIG. 1. Pedigree of the small circular plasmids (pSM1, pSM2,and pSM15), showing sequence relationships with their parent Rfactor, R12 (or R100). A critical part of R12 of circular DNA is shown.All numbers are distances in kilobases from an end point of an IS1sequence on R12. The positions of the replication origin (ori), theEcoRl and HindIII cutting sites, the sites for deletions to generatepSM plasmids, and r-determinant region responsible for resistanceto mercuric ion (mer), sulfonamide (sul), streptomycin (str), andchloramphenicol (chl) have been mapped (13, 4, 19).

Enzymes. EcoRl and HindIll were purchased from MilesLaboratory and were assayed as recommended by Miles Lab-oratory. Other restriction endonucleases (Hae II, Hae III, HhaI, Hinfl, Hpa II, and Alu I) were kindly supplied by H. Ohmori.The reaction mixture for these endonucleases contained 6 mMTris-HCI (pH 7.9), 6 mM MgCl2, 6 mM 2-mercaptoethanol, andbovine serum albumin (500 ,ug/ml). Bacterial alkaline phos-phatase (36 units/ml) from Worthington Biochemical, in 0.1M Tris-HCI, pH 8.0/10 mM MgCl2, was used. Polynucleotidekinase (P-L Biochemicals, Inc.) was used for phosphorylationof the 5' end of the DNA with [y-32P]ATP (specific activity,-2000 Ci/mmol, ICN 35001-HH). The reaction mixture for

this enzyme contained 50 mM Tris-HCl (pH 7.5), 10 mMMgCl2, and 5 mM dithiothreitol.Gel Electrophoresis. To separate native DNA fragments,

electrophoresis was carried out on a 13 X 15 X 0.2 cm 4% or 10%

Abbreviation: IS, insertion sequence.

Dow

nloa

ded

by g

uest

on

Mar

ch 3

, 202

1

Page 2: Nucleotide sequence of an insertion element, ISI · proc. natl. acad.sci. usa75(1978)-40 -20 1 20 *-40 0hinf 3,ctgaggttctgtcgtcttcttaatatgctcaccatagacctactamaattmtamagcttctatggm

616 Biochemistry: Ohtsubo and Ohtsubo

EcoRI site HindM siteII

87.4kb 87.6kb87.2 kb

_ Gu'ms_ r.aAce -- I I

88.6 kb 89.3/0/82.7kb

| I S1 | 83.0kb85.5kb

or/ 87.2 kbpSM2 UNA

deleted in pSMI " | ,deeein pSMl5 I

I,,, , ,.§!wII I f I t-Jt II'n+~~~~*I I II*Ij--1

t * | o+ I~~~~~~~ .- ,- _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

a I !i n+- - -- -4"ii 414

MboD -Je Mlboll | | \ MboJIroq I |\ ToqI ph1p>l 0o=\EcoRI

Mbo I PstI Bo/I HphIFIG. 2. Cleavage maps of pSM2 DNA with various restriction enzymes. pSM2 is displayed in a linear representation by cutting the circle at

the EcoRl cleavage site. Thick lines represent the regions analyzed by endonuclease cleavage. Arrows indicate cleavage sites with various restrictionenzymes. Arrows with open heads are the cleavage sites that could only be fixed after nucleotide sequence analysis (see Fig. 4). The bottom linesummarizes cleavage sites of restriction endonucleases not used in the experiment; these sites were found after nucleotide sequencing analysisof the thick lined region.

polyacrylamide slab gel (acrylamide/bisacrylamide, 20:1) withE buffer (50 mM Tris-borate, pH 8.3/1 mM EDTA). DNAbands were visualized with ethidium bromide (4 ,ug/ml in Ebuffer) under a short-wavelength UV light.

Determination of Nucleotide Sequences. 5'-32P-Labeledfragments were obtained by strand separation (18) or bycleavage of the 5'-32P-labeled double strand with another re-

striction enzyme. The nucleotide sequence of these fragmentswas determined as described by Maxam and Gilbert (18).

RESULTSCleavage Maps of the Region Containing ISI. The physical

structures of the three small plasmids used for the mappingexperiments are shown in Fig. 1. The sequence relationshipsamong these plasmids and their parental R factor, R12 (sameas R100), have been determined with a coordinate system inkilobases by the electron microscope heteroduplex method (13).pSM2 and pSM15 were independently derived from the parentR12 by deletion. pSMI was derived from pSM2 spontaneously

EcoRl87.2kb

pSM2DNA fr

B{

deleted

87.4kb in pSMI 88.6kbI. '1

, . I;....4,,I i A> i 4

a l.

C,~~~~O-

DDl _

I e4

I] | -I I | 1IL ll00bp

_ _

by deletion. It should be noted that all of the deletion eventsoccurred at the ends of the IS1 sequence (13). An alternativemap showing sequence relationships among these plasmids isshown in a linear representation at the top of Fig. 2. pSML andpSM15 are missing different portions of pSM2, located at eitherend of the IS1 sequence of pSM2.

These three plasmids were cleaved by Hae II, Hae III, Hinf,Hha I, and Alu I and were subjected to gel electrophoresis. Thedeletion in pSMl or pSM15 resulted in a gel band pattern dif-ferent from that of pSM2. Comparison of gel band patterns ofpSMl or pSM15 with pSM2 enabled us to identify the fragmentscontaining the neighboring regions of IS1 as well as those con-taining the ends of IS1 DNA. Elution of the DNA fragmentsgenerated by a restriction endonuclease and further digestionof these DNA fragments with another restriction endonucleaseallowed us to order the fragments containing the IS1 sequenceand its neighboring regions. Fig. 3 shows the cleavage maps ofthese three small plasmids. All of the fragments were orientedfrom the EcoRl or HindIII cutting site, the location of which

ISi

deleted

89.3/0/82.7kb in pSMl5 83.0kbIf I

a. ,' ~200bp . * -

-_769 ;1"x i ----

FIG. 3. Labeled strands used for determination of the nucleotide sequence of the regions (I, II, and III) are shown at the bottom of the figure.Arrows indicate 5'-32P-labeled strand aligned in the 5' 3' direction. Most of the strands were obtained by the strand separation technique,

except for strands indicated by a and which were obtained by secondary cleavage of 5'-32P-labeled double-stranded DNA. The numbers inthe region I, II, and III indicate the positions of nucleotides from selected origins that were based on nucleotide sequencing results of those regions(see Fig. 4).

Hoe II

Hin,f I

Hae m

Hpo U

A/8 I

Hh o I

Proc. Natl. Acad. Sci. USA 75 (1978)

Dow

nloa

ded

by g

uest

on

Mar

ch 3

, 202

1

Page 3: Nucleotide sequence of an insertion element, ISI · proc. natl. acad.sci. usa75(1978)-40 -20 1 20 *-40 0hinf 3,ctgaggttctgtcgtcttcttaatatgctcaccatagacctactamaattmtamagcttctatggm

Proc. Natl. Acad. Sci. USA 75 (1978)

-40 -20 1 20*-

0Hinf40

3,CTGAGGTTCTGTCGTCTTCTTAATATGCTCACCATAGACCTACTAMAATTMTAMAGCTTCTATGGM CTTTTAATCATACCAGGGAGCTTCTCTTTTGTCGTGACTAAAGTAGTTACTGIHinf -80 -60 -40 -20 -1

- AATCAGAAATAMTTTATCAATCATGCTTGGCAAACACTGATAAGAACTCTACTTAATAACTCAAGAACAGAACAACCAAATTATGATTTTAAACAGGGA< '1100 b.p.- OTTAGTCTTTTAMAAATAGTTAGTACGAACCGMGTGACTATTCTTGAGATGAATTATTGAGTTCTTGTCTTGTTGGTMATACTAMMGTCCCT

1__*__^-^-By-it;TT.1GGTGATGCTGCCAACTTACTGAMTATGTATGATGGTG1TI1TTGAGGTGC GTU.AX1TiC1 1 !IIil L IA LAUL lbIG L ILL lb 11GAbL HiA I U WMoUUbIIUU 11IebutUCCACTACGACGGTTGAATGACTAAATCACATACTACCACAAAMACTCCACGAGGTCACCGAAGACAMAGATAGTCGACAGGGAGGACAAGTC~GATGACTGCCCCACCACGCATTGCCGTTTTCGTGGCi ~~~~~~~~Ball

HpaII HhaI,HaeII 160 PotI 200 EpaIl 220 HaeIfl 240 HpalI

CCGGACATCAG6G0CeTATCTCTGCTCTCACTGCCGTAAeCAGGCAACTGCAGTTCACTTACACCGCTTCTCAACCCGGTACGCACC ATCATTGAT3TGGCCATGMTGGCGTTGGATGCCGGGGCCTGTAGTCGCGATAGAGACGAGAGTGACGGCATMTGTACCGTTGACGTCMAGTGAATGTGGCGAAGAGTTGGGCCATGCGTGGTCTMTAGTMACTATACCGGTACTTACCGCAACCTACGGCC

260 HaII300 laI 330 HhI HaT 360 . 380

GCAACCGCCCGCAiTATGGGCGTTGGCCTCAACACGATTTMCCGCCAMAIMAACTCAGGCCGCAGTCGGTAACCTCGCGCATACAGCCGGGCAGTGACGTCATCGTCTGCGCGGAMATGG'ACGAACGTTGGCGGGCGTAATACCCGCAACCGGAGTTGTGCTAAMGGCGGTAAATTTMTGAGTCCGGCGTCAGCCATTGGAGCGCGTATGTCGGCCCGTCACTGCAGTAGCAGACGCGCCMACCTGCTT

400 Ehal, WhaIHaeUlzU

CAGTGGGGATACGTCaGGGCTMATCGCGCCAGCG'CTGGCTGTTTiACGCGTATGACAGGCTC'CGGAGACGGTT'GTTGCGCACGTATTCGGTGAACGCACTATGGCGACGCTGGGGCGTCTTATGAGGTCACCCCTATGCAGCC'CCGAMAGCGCGGTCGCGACCGACAAAATGCGCATACTGTCCGAGGCCTTCTGCCCAACGCCGTGCATAAGCCACTTGCGTGATACCGCTGCGACCCCGCAGAATACTC

520 540 HaeIIl linf 580 Alul 600 620 HinfCCTGCTG;CACCCMGACGTGGTGATATGGATGACGIATGGCTGGCCGCTGTATGAATCCCGCCTGAAGGGAAAG.CiGCACGTMT.AGCAAGCGAATACGCAGC..TTGAGCGGCATACCTGAGGACGACAGTGGGAAACTGCACCACTATACCTACTGCCTACCGACCGGCGACATACTTAGGGCGGACTTCCCMCGACGTGCATTAGTCGTTCGCTATATGCGTCGCTTMCTCG>CCGTATTGGACT

660 680 700AluI 720 740 linf 768

ATCTGAGGCAGCACCTGGCACGGCTGGGACGGAAGTCGCTGTCGTTCTCAAATCGGTGGAGCTGCATGACAAAGTCATCGGGCATTATCTGAACATAAAACACTATCAATAAGTTGGAGTCATTACCTAGACTCCGTCGTGGACCGTGCCGAcCCTGCCTTCAGCGACAGCAAGAGTM AGCCACCTCGACGTACTGMCAGTAGCCCGTAATAGACTTGTATTTTGTGATAGTTATTCAACCTCAGTAATGG

right end of ISi J

i780 _paI-80 -60 -40 -20 1

GTGAACGTGAATC 200 b CCGGCGGTGAATACTGGCAACGTCAGAAGACGCTGCTGACAGAAGGGAAGTCAGnTTTATGAAAGGACTGTTCAGAATTGTGGATATGACACTTGCACTTAG < GGCCGCCACTTATGACCGTTGCAGTCTTCTGCGACGACTGTCM CCTTCAGTCAAATACMCCTGACAGTCTTcACCTATACT

20#1 40of Alul 60'HpaII Haelli

*. I I - 31AGCGGTGGTATCTGTGTCCGCAGGTACGGGTCGCGGATATCGTCCAGCTGAACGGGAATAGTCCGGCCTCGCCACCATAGACACAGGCGTCCATGCCCAGCGCCTATAGCAGGTCGACTTGCCCTTATCAGGCCGG5,

480 500

20 40 60Alul 80

-

440 Spal* 460 HhalI I

HinfI-40 -20 1 20 40 60

'GACTCCAAGACAGCAGAAGAATTATACGAGTGGTATCTGGATGATATTAAATTATMATCGGG6TGATGCTGCCAACTTACTGAMAGTGTATGATGGTGTTITTITGAGGTGCTC-CAGTGGCTTCTGT3,CTGAGGTTCTGTCGTCTTCTTAATATGCTCACCATAGACCTACTAMMTTAATAMAGCCCACTACGACGGTTGAATGACTAMTCACATACTACCACAAMACTCCACGAGGTCACCGMAGACA

Lleft end of IS170 Alu3. v a

TTCTATCAGCT3AArG4TAGTCGA5,Alul Ri~~ ~ ~~ ~ ~~ ~~~~~~~~~~~~~~n~fof of to

Alul

5P 1 . 720 740 ,760 768 1 20 40 a. 3,1§lAGCTGCATGACAAAGTCATCGGGCATTATCTGAACATMMACACTATCMiTMGTTGGA6TCATTACCGAAGCGGTGGTATCTGTGTCCGCAGGTACGGGTCGCGGATATCGTCCAGCT

3, TCGACGTACTGTTTCAGTAGCCCGTAATAGACTTGTATMGTGATAGTTATTCAACCTCAGTMTGGCTTCGCl;ACCATA6ACACAGGCGTCCATGCCCAGCGCCTATAGCAGGTCGA5,right end of IS1 --'

FIG. 4. Nucleotide sequence in the region I, II, and III shown in Fig. 3. Determined nucleotide sequences of IS1 are numbered from 1 to

768. Other numbers outside of IS1 are named as indicated in this figure to facilitate the sequence relationships among regions I, II, and III (seeFigs. 5 and 6). Wedges indicate the cleavage sites of each enzyme on one strand which is aligned with 5' - 3' direction in this sequence. C at 655

and 657 shows modified dCMP, probably 5-methyl dCMP. The nucleotide sequence for approximately 100 residues from the 5'end of a strandcould be determined with reasonable accuracy. Most of the DNA sequences of IS1 and its neighboring regions were determined by sequencingboth strands, except that the regions 67-92, 282-318, 527-558, and 629-639 were determined by sequencing of only one strand as indicated by

underlines in the figure.

had been determined (13, 19). As will be discussed below, nu-

cleotide sequencing analysis of these fragments confirmed thecleavage maps that we constructed.

Nucleotide Sequence Analysis in the Regions ContainingISI. Part A of Fig. 3 summarizes the DNA fragments, se-

quenced by the Maxam and Gilbert method (18), for deter-mination of nucleotide sequences of the regions in pSM1, pSM2,and pSM15 that are indicated in B of Fig. 3. The final nucleo-tide sequences determined for these regions are shown in Fig.4. Fig. 5 shows the radioautographs used in the determinationof the sequences of two large DNA fragments, b and c, shownin Fig. 3.

Fig. 6 presents the sequences relevant to the identificationof the left and right end points of IS1. As shown in B of Fig. 6,sequence c of pSM1 is derived by illegitimate recombination

between a and b of pSM2. We define the left end of IS1 as thesequence present in both b of pSM2 and c of pSM1 or as thesequence in c of pSM1 but absent in a of pSM2. This enabledus to determine the left end of IS1 to be the G residue labeled1. pSM15 and pSM2 were independently derived from R12 byexcision events. We define the right end of IS1 as the sequencecommon to f of pSM3 and e of pSM15 or as the sequence in a

of pSM15 but not common to d of pSM2. This enabled us toassign the right end of IS1 to be the G residue labeled 769 or theC residue labeled 768.

DISCUSSION

We have described the nucleotide sequence of IS1 and itsneighboring regions that were present in small plasmids derived

Hinf.. I .5. I .GACTCCAAGACAGCAGAAGAATTATACGAGTGGTATCTGGATGATMATTAATTATMATCGAAGATACCAMGAMATTAGTATGGTCCCTCGAAGAGAAMCAGCACTGATTTCATCAAT(;ACTCI

Biochemistry: Ohtsubo and Ohtsubo 617

I. . .. - -.-. - - - -, -_-.-

r-left end of IS11

Alul 100* I

120

I

Dow

nloa

ded

by g

uest

on

Mar

ch 3

, 202

1

Page 4: Nucleotide sequence of an insertion element, ISI · proc. natl. acad.sci. usa75(1978)-40 -20 1 20 *-40 0hinf 3,ctgaggttctgtcgtcttcttaatatgctcaccatagacctactamaattmtamagcttctatggm

618 Biochemistry: Ohtsubo and Ohtsubo

C

A>GG>A C C+T

:_0

wifs ..

T

TA

A

TT

A

A

T

C.

CCT

A

. A

-A _C

TA

- '-, TT

AAA

_W X~~~~A

._~

..-.

.v~

_me_

__. Ww

A"

Mow

A

FIG. 5. Radioautograph used for determination of the sequences

of strands b and c shown in Fig. 3. These sands contained sequencesb and c which are shown in B of Fig. 6 for the determination of the leftend of IS1. Each strand, labeled at the 5' end with 32p, was subjectedto the four sets of chemical reactions. Aliquots of the four samples ofbase-specific cleavage products of each strand were analyzed byelectrophoresis for 36 hr in a 30 X 40 X 0.15 cm slab gel of 20% poly-acrylamide/7 M urea, according to the method developed by Maxamand Gilbert (18). Critical parts of the sequence read from gel patternsare indicated with numbers.

from an R factor, R12. IS1 DNA was determined to contain 768or 769 bases. As explained above, the end points of the IS1 se-

quence were defined as the points where deletions appear tohave occurred specifically. This raises the possibility that hotspots for deletion are not exactly the ends of IS1. At present, thenucleotide sequence results that are available for determinationof the ends of IS1 are the preliminary ones obtained by Grindley(20) on the end region of an ISI that occurred in the gal operon.Our results agree with his when we take the G residue labeled1 as the left end of IS1 and the C residue at 768 as the right endof ISI as shown in Fig. 6. IS1 DNA was therefore determinedto contain 768 bases. This may also indicate that both deletionsand insertions are site-specific events.

It is interesting to note that we have found inverted repeatsequences at the very ends of IS1, although there are some

mismatched regions in those repeats. Fig. 7 shows this invertedrepetition as an inversion loop structure. These repeated se-

quences at the ends of IS1 are reminiscent of the fact that mosttransposable DNA segments carrying antibiotic resistance genesare flanked by inverted repeated sequences (10-12, 21-23).Tn9, a transposable element for the chloramphenicol resistancegene, however, is flanked by direct repeats of IS1 (14, 15). Inthis case, however, Tn9 also has inverted repeats, because IS1

a b

A F pSM2 DNA

pSMI DNA

e

pSMI5 DNA

f d

-20' -10' -I' I' 10' 20'a 5-TGATAAATTAAfTATAAATCrI4AGATACCAAAGAAAATTAGT-3

-20 -10 -I 10 20B b 5-AATTATGATTTTAAACAGGGAGGTGATGCTGCCAACTTACTG-3'

-20' -10' -I' 10 20C 5'-TGATAAATTAAfTATAAATCGGGTGATGCTGCCAACTTACTG-3'

Left end of ISf-20 -id -I 10" 20'

d 5'-ACTGTTCAGAAfTGTGGATAf GAAGCGGTGGTATCTGTGTCC-3'750 760 768 I" loll 20"

c e 5'-CAATAAGTTGGAGTCATTACqGAAGCGGTGGTATCTGTGTCC-3'720 7§0 76§ 769 7§0

f 5'-CAATAAGTTGGAGTCATTACCGTGAACGTGAATC-3'Right end of 1Sf

FIG. 6. (A) Critical regions (a-f) of pSM2, pSM1, and pSM15 forthe determination of nucleotide sequence at the ends of IS1. (B)Comparison of the sequences a, b, and c fixes the left end of IS1. (C)Comparison of the sequences d, e, and f fixes the right end of IS1.

itself was found to have inverted sequences at its ends. Thesefacts suggest that the presence of inverted repeats is vital forinsertion of IS1 and transpositions.

As shown in B of Fig. 6, two different sequences, a and b,recombine to give sequence c. This recombination takes placeat a nonhomologous region, although these regions contain(AT)-rich clusters that always occur at almost the same positionsrelative to the site of recombination (see fragments a and b inB of Fig. 6). This illegitimate but site-specific recombinationmay thus be due to the presence of some specific sequencesaround the recombination sites. We notice that these regionscontain nucleotide sequences that are similar to or the same as

those seen in promoter and operator regions of various operonsthat are recognized and bound by DNA-dependent RNApolymerase. The sequences

30 36 730 724T-A-T-G-A-T-G and G-A-T-A-A-T-G

which are found near the left and right ends of ISi, are seen inthe t-RNATYr promoter region (24) and in the rightward earlypromoter region (pr) of phage X (25, 26). These sequences are

similar to the sequence, T-A-T-A-A-T-R, which was proposedby Pribnow (27) to be an important sequence involved in RNApolymerase binding. It should be noted that many other se-

quences similar to Pribnow's are also found in the region within50 bases from the recombination junction sites (see Fig. 6).The sequence

1 10 15G-G-T-G-A-T-G-C-T-G-C-C-A-A-C-T-T,

seen at the left end of IS1, is quite similar to the promoter se-

quence of pl-14 -10 0G-G-T-G-A-T-A-C-T-G-A-G-C-A-C-A-T,

30

5@ ,G-T-A,G ,T T

G left end of S1 G

~~~10 204% T

-IA,lA .G% G C20 ,T 40 A',/G-G-T A-T-G-C-T C-C-A-A-C-T-T-A T-G-A-T d-T-G-T-T-T-T-T-G

'C-C-A T-A-CG-A--G-T-T-GA-A-T A-C-T-A C-A-C-A-A-A-AA-C769G 768 'T' 'A' 'T' %'T' 'A

+ 761 750 740

31G right end of IS1

FIG. 7. Inverted repeat sequences at the ends of IS1 shown as an

inversion loop structure.

b A>GG>A C C+T

w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

usc -AT

ITA G

GGCA

-2 AT!

AT

A

TAAAT

-20 --A

C

AAA,cA'A

A

--40 C

A

Proc. Nati. Acad. Sci. USA 75 (1978)D

ownl

oade

d by

gue

st o

n M

arch

3, 2

021

Page 5: Nucleotide sequence of an insertion element, ISI · proc. natl. acad.sci. usa75(1978)-40 -20 1 20 *-40 0hinf 3,ctgaggttctgtcgtcttcttaatatgctcaccatagacctactamaattmtamagcttctatggm

Proc. Natl. Acad. Sci. USA 75 (1978) 619

which is protected by RNA polymerase from the attack ofDNase in vitro (28, 29). The sequence

4" 10" 20"G-C-G-G-T-G-G-T-A-T-C-T-G-T-G-T-C-C-G-C-A

is similar to the sequence

-16 -20 -30C--C-A-G-T-G-G-T-A-T-T-T-A-T-G-T-C-A-A-C-A

which is found in the promoter region (pl) of phage X. The se-quence

-21' -12'T-G-A-T-A-A-A-T-T-A

is exactly the same as the sequence seen in pl promoter region-56 to about -47, which is thought to be recognized by RNApolymerase.

Although experimental evidence is not yet available, thesimilarities between the nucleotide sequences in the IS1 endregions and in known promoter regions of various genes suggestthat RNA polymerase might play an important role in IS1-mediated recombination. We propose that the RNA polymerasecan recognize the DNA sequences at the ends of IS1 and thenopens up (or unwinds) these DNA strands. The recent reportby Ikeda and Kobayashi (30) on the results of their geneticanalysis of RNA polymerase involvement in a recA-indepen-dent (or illegitimate) pathway of genetic recombination in E.coli could be related to our suggestion that RNA polymeraseactivity may be required for illegitimate recombination.The polarity effect caused by IS1 could also be explained by

involvement of RNA polymerase binding regions that aregenerated by the insertion of IS1 and block the normal initiationof transcription of structural genes. However, several nonsensecodons occur in all three possible reading frames in the IS1 se-quences; this fact may be sufficient to explain polarity ef-fects.

Finally, it is interesting to note that a part of our sequenceat the right end of IS1,

1 10 20G-G-T-G-A-T-G-C-T-G-C-C-A-A-C-T-T-A-C-T-G-A-T

(which is inverted at the left end of IS1 as described above) isthe same as the nucleotide sequence seen in the attachment siteof phage X,

20 15 10 5T-T-T-T-A-T-A-A-T-G-C-C-A-A-C-T-T-A-G-T-A-T

which was determined by Landy and Ross (31). Although thismay be a coincidence, it is possible that these site-specific re-combination systems follow similar or identical pathways.We are grateful to A. Maxam for providing the details of the pro-

cedure for determining DNA sequences prior to publication. We alsothank H. Ohmori and J. I. Tomizawa for invaluable guidance with thesequencing techniques, H. Ohmori for providing various restrictionenzymes, J. Feingold for large-scale bacterial cultures, J. Rosen forcorrection of errors in the original manuscript, and J. Kates for hisconstant encouragement and his generous criticism during preparationof the manuscript. H.O. received partial support from U.S. PublicHealth Service Training Grant CA09176. This work was supported byU.S. Public Health Service Grant GM22007 to E.O.

1. Hirsch, H. J., Starlinger, P. & Brachet, P. (1972) Mol. Gen. Genet.119, 191-206.

2. Fiandt, M., Szybalski, W. & Malamy, M. H. (1972) Mol. Gen.Genet. 119,223-231.

3. Saedler, H. & Heiss, B. (1973) Mol. Gen. Genet. 122, 267-277.

4. Hu, S., Ohtsubo, E., Davidson, H. & Saedler, H. (1975) J. Bac-teriol. 122, 764-775.

5. Hu, S., Ptashne, K., Cohen, S. N. & Davidson, N. (1975) J. Bac-teriol. 123, 687-692.

6. Ohtsubo, H. & Ohtsubo, E. (1977) in DNA Insertion Elements,Plasmids and Episomes, eds. Bukhari, A. I., Shapiro, J. & Adhya,S. (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY),pp. 591-594.

7. Starlinger, P., Saedler, H., Rak, B., Tillmann, E., Venkow, P. &Waltschewa, L. (1973) Mol. Gen. Genet. 122, 279-286.

8. Adhya, S., Gottesman, M., de Crombrugghe, B. & Court, D.(1976) in RNA Polymerase, eds. Losick, R. & Chamberlin, M.(Cold Spring Harbor Laboratory, Cold Spring Harbor, NY), pp.719-730.

9. Saedler, H. & Starlinger, P. (1967) Mol. Gen. Genet. 100,178-189.

10. Tye, B. K., Chan, R. K. & Botstein, D. (1974) J. Mol. Biol. 85,485-500.

11. Kleckner, N., Chan, R. K., Tye, B. K. & Botstein, D. (1975) J. Mol.Biol. 97, 561-575.

12. Berg, D. E., Davis, J., Allet, B. & Rochaix, J. D. (1975) Proc. Natl.Acad. Sci. USA 72,3628-3632.

13. Mickel, S., Ohtsubo, E. & Bauer, W. (1977) Gene, in press.14. Chow, L. T. & Bukhari, A. I. (1977) in DNA Insertion Elements,

Plasmids and Episomes, eds. Bukhari, A. I., Shapiro, J. & Adhya,S. (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY),pp. 295-306.

15. MacHattie, L. A. & Jackowski, J. B. (1977) in DNA InsertionElements, Plasmids and Episomes, eds. Bukhari, A. I., Shapiro,J. & Adhya, S. (Cold Spring Harbor Laboratory, Cold SpringHarbor, NY), pp. 219-228.

16. Ohtsubo, H. & Ohtsubo, E. (1977) in DNA Insertion Elements,Plasmids and Episomes, eds. Bukhari, A. I., Shapiro, J. & Adhya,A. (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY),pp. 46-63.

17. Clewell, D. B. & Helinski, D. R. (1970) Biochemistry 9, 4428-4440.

18. Maxam, A. & Gilbert, W. (1977) Proc. Natl. Acad. Sci. USA 74,560-564.

19. Ohtsubo, E., Feingold, J., Ohtsubo, H., Mickel, S. & Bauer, W.(1977) Plasmid, 1, 8-18.

20. Grindley, N. G. F. (1977) in DNA Insertion Elements, Plasmidsand Episomes, eds. Bukhari, A. I., Shapiro, J. & Adhya, S. (ColdSpring Harbor Laboratory, Cold Spring Harbor, NY), pp.591-596.

21. Heffron, F., Rubens, C. & Falkow, S. (1975) Proc. Natl. Acad.Sci. USA 72,3623-3627.

22. Kopecko, D. J. & Cohen, S. N. (1975) Proc. Natl. Acad. Sci. USA72, 1373-1377.

23. Cohen, S. N. (1976) Nature 263, 731-738.24. Sekiya, T., Takeya, T., Coutreras, R., Kupper, H., Khorana, H.

G., & Landy, A. (1976) in RNA Polymerase, eds. Losick, R., &Chamberlin, M. (Cold Spring Harbor Laboratory, Cold SpringHarbor, NY), pp. 455-472.

25. Pirrotta, V. (1975) Nature 254, 114-117.26. Walz, A. & Pirrotta, V. (1975) Nature 254, 118-121.27. Pribnow, D. (1975) Proc. Natl. Acad. Sci. USA 72, 784-788.28. Maniatis, T., Ptashne, M., Barrell, B. G. & Donelson, J. (1974)

Nature 250, 394-397.29. Maniatis, T., Jeffrey, A. & Kleid, D. (1975) Proc. Natl. Acad. Sci.

USA 72, 1184-1188.30. Ikeda, H. & Kobayashi, I. (1977) Proc. Natl. Acad. Sci. USA 74,

3932-3936.31. Landy, A. & Ross, W. (1977) Science 197, 1147-1160.

Biochemistry: Ohtsubo and Ohtsubo

Dow

nloa

ded

by g

uest

on

Mar

ch 3

, 202

1