A Common Motif Organizes the Structure of Multi

Article No. mb982106
J. Mol. Biol. (1998) 283, 571±583
A Common Motif Organizes the Structure of
Multi-helix Loops in 16 S and 23 S Ribosomal RNAs
Neocles B. Leontis1 and Eric Westhof2*
1
Chemistry Department
Bowling Green State
University, Bowling Green
OH 43403, USA
2
Institut de Biologie
MoleÂculaire et Cellulaire du
CNRS, ModeÂlisations
et Simulations des Acides
NucleÂiques, UPR 9002
15 rue Rene Descartes
F-67084 Strasbourg Cedex
France
Phylogenetic and chemical probing data indicate that a modular RNA
motif, common to loop E of eucaryotic 5 S ribosomal RNA (rRNA) and
the a-sarcin/ricin loop of 23 S rRNA, organizes the structure of multihelix loops in 16 S and 23 S ribosomal RNAs. The motif occurs in the 30
domain of 16 S rRNA at positions 1345-1350/1372-1376 (Escherichia coli
numbering), within the three-way junction loop, which binds ribosomal
protein S7, and which contains nucleotides that help to form the binding
site for P-site tRNA in the ribosome. The motif also helps to structure a
three-way junction within domain I of 23 S, which contains many universally conserved bases and which lies close in the primary and secondary
structure to the binding site of r-protein L24. Several other highly conserved hairpin, internal, and multi-helix loops in 16 S and 23 S rRNA
contain the motif, including the core junction loop of 23 S and helix 27 in
the core of 16 S rRNA. Sequence conservation and range of variation in
bacteria, archaea, and eucaryotes as well as chemical probing and crosslinking data, provide support for the recurrent and autonomous existence
of the motif in ribosomal RNAs. Besides its presence in the hairpin ribozyme, the loop E motif is also apparent in helix P10 of bacterial RNase P,
in domain P7 of one sub-group of group I introns, and in domain 3 of
one subgroup of group II introns.
# 1998 Academic Press
*Corresponding author
Keywords: RNA; multi-helix loops; recurrent motif; phylogenetic analysis
Introduction
Single-stranded RNA molecules fold on themselves to form short double helices consisting of
classical Watson ±Crick base-pairs frequently interrupted by stretches of ``mismatched'' bases, comprising so-called internal loops, hairpin loops, and
multi-helix junction loops (Fresco et al., 1960). Universally conserved, nominally unpaired nucleotides often occur within these loops. That many of
these regions are in fact highly structured was ®rst
clearly indicated by chemical probing experiments
of 16 S rRNA (Moazed et al., 1986).
Phylogenetically based covariation analysis is
the most reliable means for determining secondary
structure in homologous RNA molecules (Michel
Abbreviations used: rRNA, ribosomal RNA; r-protein,
ribosomal protein; bp, base-pair; PSTV, potato spindle
tuber viroid; R, purine; Y, pyrimidine; CMCT,
1-cyclohexyl-3-(2N-morpholinethyl)-carbodiimide-methop-toluene sulfonate.
E-mail addresses of the corresponding authors:
[email protected] and
[email protected]
0022±2836/98/430571±13 $30.00/0
et al., 1982; Woese & Pace, 1993). It can also be
used to identify nucleotides involved in tertiary
interactions, when these co-vary in a Watson±
Crick pairing fashion, but not when these involve
universally conserved, single-stranded regions of
the already deduced secondary structure (Woese &
Gutell, 1989; Larsen, 1992). The modular approach
to sequence analysis of RNA is based on the
assumption that recurrent, conserved elements of
primary and secondary structure share common
three-dimensional (3D) structures and therefore
constitute building blocks for organizing the tertiary structure of RNA (Michel & Westhof,
1990). Such elements must be viewed as structural
units or motifs, since they comprise highly conserved nucleotides, which are paired non-canonically and which therefore do not co-vary in a
Watson± Crick fashion. Identi®cation of such
motifs within internal loops, followed by structural
characterisation using NMR and X-ray crystallography, advances our ability to model the secondary and tertiary architecture of native RNA
molecules, by elaborating a ``structural vocabulary'' applicable in evaluating possible interactions
in any structured RNA.
# 1998 Academic Press
572
Potential structural motifs are easiest to identify
when the conserved sets of nucleotides occur as
internal loops, bracketed on either side by
Watson ±Crick helices or hairpin loops. Such is the
case of the loop E motif ®rst identi®ed in 1985 as a
distinctive RNA structural motif when it was
found that very similar, conserved, asymmetric
internal loops in eucaryotic 5 S rRNA and potato
spindle tuber viroid (PSTV) shared an exceptional
propensity for UV crosslinking between speci®c
guanosine and uridine residues, as shown in
Figure 1A (Branch et al., 1985). The motif was also
Recurrent Motif in Multi-helix Loops in RNA
found in a conserved hairpin loop in domain VI of
23 S-like rRNA molecules (residues 2654± 2666 of
Escherichia coli 23 S and 4218 ± 4330 of rat 28 S
rRNA), which is the site of speci®c RNA modi®cation by the cytotoxic proteins a-sarcin and ricin
(Endo et al., 1993) and is directly involved in binding of elongation factors by the ribosome (Moazed
et al., 1988). It has been pointed out that the
element is likely to be widespread (Shen et al.,
1995) and that it may exist in other hairpin/stem
loops in 23 S rRNA (Wimberly et al., 1993;
Wimberly, 1994). UV crosslinking showed that the
Figure 1. A, Molecules in which the ``loop E motif'' has been established by NMR or UV cross-linking. The conserved nucleotides of the motif are boxed. Asterisks designate the G and U residues cross-linked upon irradiation
with UV light. From left to right: loop E from eucaryotic 5 S rRNA (305 sequences superimposed on that of X. laevis).
Loop E from Euryarchaeal 5 S rRNA (36 sequences superimposed on that of Methanococcus vanniellii). The 5 S
sequences are numbered according to the 5 S of X. laevis. sarcin/ricin stem from 23 S rRNA (sequence variations
from all three phylogenetic groups superimposed on E. coli sequence and numbering). Conserved core from PSTV
(Branch et al., 1985). Loop B from the hairpin ribozyme (Butcher & Burke, 1994a). Red letters: conserved nucleotides.
Green: base-pairs showing Watson ± Crick variation. Blue: residues within motif showing variation (for sarcin/ricin
stem see Table 1; for euryarchaeal 5 S rRNA: 13 A A, 4 G A, 5A G, 15 U A). Black: Other variable positions. B, Loop
E motifs in internal or hairpin loops of 23 S rRNA and 16 S rRNA. From left to right: 23 S rRNA helix 11; 23 S rRNA
helix 13; 23 S rRNA internal loop between helices 31a and 32. 16 S rRNA helix 27 in core region joining the 50 and
the central domains of the molecule. Residues are colour-coded as in A. Variations observed in three main phylogenetic groups superimposed on E. coli sequence and numbering system.
Recurrent Motif in Multi-helix Loops in RNA
motif exists in loop B of the hairpin ribozyme from
tobacco ringspot virus (Butcher & Burke, 1994a).
Crosslinked molecules retain enzymatic activity
(Butcher & Burke, 1994a; Burke et al., 1996). Here
we present evidence indicating that the motif is
not limited to hairpin stems or internal loops, but
also occurs in multi-helix junctions at critical sites
in the large ribosomal RNAs.
Results
Structural description of the motif
Both the sarcin/ricin loop from 23 S and the Eloop from 5 S rRNA have been studied as oligonucleotide fragments by high-resolution NMR, from
which very similar three-dimensional structures
were derived for the conserved nucleotides of the
motif (Wimberly et al., 1993; Szewczak & Moore,
1995). The structures also agree with the extensive
chemical probing data available for X. laevis 5 S
rRNA (Romaniuk et al., 1988; Westhof et al., 1989;
Romby et al., 1990). The salient structural features
identi®ed by NMR are shown in Figure 2. They
573
are: (1) a sheared A G pair; (2) a trans-Hoogsteen
U A pair; (3) a bulged G positioned in the deep
groove; and (4) a trans (locally parallel)-HoogsteenHoogsteen A A pair (boxed residues in Figure 1).
The sheared pairing of the conserved A G pair
(A77 G99 in the Xenopus sequence, A106 G130 in
the universal alignment) involves hydrogen bonding between AN6 and GN3 and between AN7 and
GN2. This leaves AN1 very exposed and reactive
towards DMS, whereas GN1 is partly exposed and
only reactive under semi-denaturing conditions in
X. laevis 5 S rRNA. Neither AN7 nor GN7 is reactive under native conditions. In the conserved,
trans-Hoogsteen U76 A100 (Xenopus numbering,
U104 A132 universal alignment), UN3 is Hbonded to AN7 leaving the Watson± Crick site
(N1) of A100 exposed and reactive while UN3 and
AN7 are protected in this pair. The N1 of the
bulged G75 (G103) is likely to be H-bonded to the
phosphate oxygen of A101 (A132), consistent with
observed lack of reactivity toward CMCT. Its N7,
however is completely exposed, as shown by the
fact that it is the most reactive base in 5 S rRNA
towards a sterically bulky nickel-organic reagent
Figure 2. Stereo view of the 3D structure of the sheared A G, trans-Hoogsteen U A, bulged G, and parallel A A
base-pairs of the loop E motif as determined by high-resolution NMR (Szewczak & Moore, 1995). The hydrogenbonding in these base-pairs is depicted to the right.
574
(Zheng et al., 1998). The direction of the ribose
phosphate chain is locally reversed at A74 (A102)
resulting in a symmetrical pairing between A74
and A101 (A133), which involves H-bonding of N7
and N6 of one adenine with N6 and N7 of the
other. Both bases are found to be reactive at N1
and unreactive at N7 in Xenopus 5 S rRNA.
On the basis of these structural features, one
expects to observe the following sequence variation
where the motif is evolutionarily conserved
(Leontis & Westhof, 1998): the sheared A G is
expected to covary with a sheared A A (Cate et al.,
1996a) but neither with the non-isosteric opposite
sheared G A pair nor with the G G opposition,
which would require another pairing geometry.
The U A pairing should be strictly conserved.
On the other hand, signi®cant variation is
expected at the position of the locally parallel
A A pair. One expects to see A G as well as
G A Hoogsteen-Hoogsteen (but no G G oppositions again). All R Y, Y R, and Y Y combinations can be accommodated by adopting a
trans-Watson± Crick geometry, while maintaining
the locally parallel strand orientation.
Recurrent Motif in Multi-helix Loops in RNA
Table 1. Sequence variation at the parallel base-pairs
(positions 2654 and 2666 in the E. coli sequence) in the
loop E motif of sarcin/ricin loop of 23 S rRNA in three
phylogenetic groups and mitochondria
Note the predominant pairing is AC in bacteria and mitochondria but AA in eucaryotes and Archaea and that most
variants are related to the dominant pair within a given phylogenetic group by a single mutation. (Number of sequences: 180,
bacterial; 64, eucaryotes; 24, archaeal; and 144, mitochondrial.)
Sequence variation in established
loop E motifs
To establish criteria for evaluating candidate
loop E motifs more precisely than by consensus
sequence comparisons, we examined the sequence
variations in the best-characterized molecules
known to harbour the motif, the sarcin/ricin stem
of 23 S rRNA and loop E of 5 S rRNA (Figure 1A).
We found that the motif exists not only in eucaryotic 5 S rRNA but also in the 5 S of the Euryarchaeota subgroup of Archaea. The sarcin/ricin stem
exists in the 23 S-like RNA of all three phylogenetic groups and of mitochondria. The bulged G,
trans-Hoogsteen U A and sheared A G pairs are
100% conserved in both 5 S groups. The only
exceptions in the sarcin/ricin stem are two mitochondial sequences, which have A A in place of
A G. As discussed above, A A can also form a
sheared base-pair isosteric to a sheared A G bp.
The A G and U A pairings are also conserved in
the viroid and hairpin-ribozyme versions of the
motif. The bulged base is a C in the viroids (Keese
& Symons, 1985) and a U in the hairpin ribozyme.
The symmetrical A A pairing is conserved in all
eucaryotic 5 S rRNAs. In 5 S of Euryarchaeota,
however, A A, A G, GA and U A are also
observed for this pairing. A A is the most common
opposition at the equivalent positions of eucaryotic
and archaeal sarcin/ricin stems, while A C is the
most common in bacterial and mitochondrial sarcin/ricin stems. The variations at this position in
sarcin/ricin loops are summarized in Table 1. The
variations observed in 5 S loop E and sarcin/ricin
loops fall into three groups: (1) R R (A A, A G,
G A, but never G G), (2) R Y and Y R (A Y and
Y A, but rarely GY or Y G), and (3) Y Y (U U,
U C). All observed R R pairings are compatible
with trans-Hoogsteen-Hoogsteen, as seen in the
NMR structure for A A. The other two groups are
both compatible with trans Watson± Crick/
Watson± Crick pairing. Next to the symmetrical
pair, another non-canonical pairing is found in
both 5 S loop E and the sarcin/ricin stem. In some
phylogenetic groups only Y Y pairings are found
here, while in others, wobble pairs (including A C
and C A) and Watson ± Crick pairs also occur. The
next base-pair is usually Watson± Crick, but in
some groups Y Y and wobble pairs are among the
variations. A Watson± Crick base-pair, which in
the sarcin/ricin site is a conserved C-G in all
phylogenetic groups, always follows the sheared
A G pair. Thus, Watson± Crick pairs ¯ank both
sides of the motif.
To summarize, the observed sequence variation
agrees with stereochemical expectations based on
the 3D structures. The U A is universally conserved, owing to the unique trans-Hoogsteen
pairing. The bulged base is not restricted to G.
Any pairing compatible with a trans-parallel
orientation of the bases is allowed immediately 50
to the bulged base. No insertions are observed
within the region bracketed by the parallel pair
and the sheared pair. In addition, it is observed
that the ®rst Watson± Crick pairing occurs at a
variable distance from the parallel base-pair; base
insertions or deletions may also occur here. The
data are insuf®cient yet for understanding the
possible structural constraints underlying these
sequence variations, although they do point to a
role akin to a ¯exible joint (see below). In the
575
Recurrent Motif in Multi-helix Loops in RNA
hairpin ribozyme, the ®rst Watson± Crick pairing
is several non-canonical pairings removed from
the loop E motif.
Loop E motifs in internal and hairpin loops of
23 S rRNA
Three hairpin or internal loops in addition to the
sarcin/ricin hairpin stem were identi®ed in 23 S
rRNA as loop E sites which meet these criteria.
These occur in the highly conserved helices 11 and
13 in domain I and in the internal loop between
helices 31a and 32 in domain II, in the 50 -most part
of the 23 S molecule. (The positions of the proposed loop E sites discussed in this paper within
the conventionally drawn secondary structures of
16 S and 23 S rRNA are shown in Figure 3). The
possible existence of loop E motifs in helices 11, 13,
and in the core of 23 S (see below) was previously
suggested, based on consensus sequence comparisons (Wimberly et al., 1993; Wimberly, 1994). The
E. coli versions of these are shown in Figure 1B, on
which sequence conservation and variations
observed in all three phylgenetic groups are superimposed. Very few deviations from the criteria
de®ned above are observed and these are restricted
to speci®c phylogenetic groups. These examples
show that the motif may exist in either orientation
closing a hairpin loop, which gives additional support for its autonomous existence. Footprinting
data are also consistent with existence of the motif
at these positions, as summarized in Table 2
(Egebjerg et al., 1987, 1990).
A loop-E motif in the core of 16 S rRNA
Universally conserved nucleotides 889-892/907909 comprise a loop E motif in a hairpin stem
(helix 27) in the core of 16 S rRNA (rightmost
panel, Figure 1B). The sheared pair is A A, with a
small number of A G in eucaryotes. The U A is
invariant in all groups. The bulged G shows some
variation (A, C, or U) and the symmetrical A A, a
very small number of AG, GA and AU variants.
Watson ±Crick or wobble pairs separate the motif
from a conserved GCAA hairpin loop; the pair
adjacent to the sheared A R pair is a conserved
Y-R pairing (C-G in all Archaea, U G in eucaryotes, C-G, U G or C A, in bacteria). Pairing of
positions 912± 910 with 885 ± 887 was recently proposed on the basis of site-directed mutagenesis,
completing helix 27 (Lodmell et al., 1995). This
leaves base 888 unpaired, but does not hinder formation of the motif. Helix 27 is part of a complex
junction, which includes two more helical elements
and a pseudo-knot to organize the 50 and central
domains of 16 S rRNA (Masquida et al., 1997). The
nucleotides of the loop E motif in helix 27 are protected from chemical probes both in the naked 16 S
rRNA and in the 30 S subunits (Moazed et al.,
1986). During the ®rst stages of in vitro assembly of
the RNA with r-proteins, residues G886 G890, on
the 50 side of the Loop E motif become more reac-
tive to chemical probes, indicating that a proteinmediated conformational change probably occurs
(Stern et al., 1988b). It was recently proposed that a
conformational switch occurs during translation in
which nucleotides 912 ±910 alternately pair with
885-887, as in Figure 1B, or with 888-890 (Lodmell
& Dahlberg, 1997). In the latter conformation the
loop E motif would be disrupted by pairing of the
bulged G890 with C910 and of the reversed A889
with U911. However, in isolated wild-type ribosomes, the predominant conformation is that
allowing formation of the loop E motif (Lodmell &
Dahlberg, 1997). Further, mutations favouring that
conformation are always less severe than those stabilizing the alternative conformation. It can be
added that helix H27 can be drawn with a loop E
motif in 16 S rRNA from mitochondria as well.
Interestingly, a conformational switch between a
structure involving a GNRA tetraloop and one
with a loop E motif occurs during the pathway of
the PSTV viroid processing (Baumstark et al.,
1997).
Three-way junctions organized by loop
E motifs
Three-way junctions occur widely in stable RNA
including ribosomal RNAs (5 S, 16 S and 23 S),
ribozymes, and viral RNAs. RNA three-3-way
junctions typically have a number of unpaired or
non-canonically paired bases in one or more
strands linking and organizing the helices in space,
as exempli®ed in the structure of the hammerhead
ribozyme (Pley et al., 1994). One of several threeway junctions which are found in the 30 major
domain of 16 S RNA binds S7 (Chiaruttini et al.,
1989; Dragon & Brakier Gingras, 1993) and forms
part of the P-site for tRNA binding (Moazed &
Noller, 1986). This junction is usually drawn in
secondary structure maps with 19 unpaired
nucleotides, many of which are universally conserved or show very restricted variation (Figure 3).
In fact, nucleotides 1345 ±1349 and 1373± 1376 form
a loop E motif bridging helices 29 and 43, which
together with helix 28 comprise the junction
(Figure 4A). The only variable position within the
motif is the parallel pairing between positions 1346
and 1375. This is a highly conserved A A in bacteria and in Archae (no exceptions), while mainly
A U in eucaryotes with variants A C, A A, C U,
G U, U A, C A, and U C, consistent with maintenance of the trans-trans pairing at this position.
A conserved U U (Y U in eucaryotes) can form
between positions 1345 and 1376. Usually U1345 is
shown Watson± Crick paired with conserved A938
in secondary structures of 16 S. Watson± Crick bp
1350 ± 1372 and 1344 ±939 ¯ank the motif. The exact
position of the three-way junction branch point
depends on how U1345 is paired, but in either case
the branching does not disrupt the motif. Chemical
probing of the Watson± Crick sites of nucleotides
in naked 16S rRNA showed that A1346, A1349,
A1374, A1375 within the motif are reactive,
Figure 3. Conventional secondary structures of 16 S rRNA and domains I and II of 23 S rRNA showing locations of proposed loop E sites. 16 S rRNA sites: helix 27 in
the core region (see also Figure 1B) and the three-way junction comprising helices 28, 29, and 43 in the 30 -major domain (see also Figure 4A). 23 S rRNA sites: internal loops
at helices 11, 13, 31a and 32 (see also Figure 1B); three-way junction formed by helices 3, 4, and 23 (see also Figure 4B); multi-helix loops adjacent to helices 21 and 36 (see
also Figure 5B). Not shown in this ®gure: the loop E motifs located in the sarcin/ricin stem in domain VI (see Figure 1A) and in the core region of 23 S rRNA (see
Figure 5A).
577
Recurrent Motif in Multi-helix Loops in RNA
Table 2. Chemical probing data of proven and proposed loop E motifs
Base-pair
Base-atom
5 S rRNA
23 S S/R helix
23 S helix 11
23 S helix 13
23 S loop 31a/32
16 S 3WJ 28/29/43
23 S 3WJ 3/4/23
23 S core
23 S MWJ 21
23 S MWJ 36
A-N1
R
r
R
r
r
R
r
r
R
r
Sheared
A-N7 G-N1
U
U
U
U
r
U
U
U
U
r
U
r
G-N7
U
R
UA trans-Hoogsteen
U-N3 A-N1 A-N7
U
U
U
U
U
r
r
r
U
r
R
r
r
r
r
R
r
r
U
r
U
U
Bulged G
G-N1 G-N7
U
U
U
r
ra
U
r
R
r
U
R
r
Parallel
A1-N1 A1-N7 A2-N1 A2-N7
R
r
r
r
Ub
R
r
U
U
Ub
U
U
R
r
r
r
r
R
R
r
U
r
U
U
R, very reactive; r, weakly reactive; U, unreactive. Blanks indicate undetermined. 3WJ stands for three-way junction and MWJ for
multi-way junction. The numbers indicate helices adjacent to the motifs identi®ed (see also Figures 3 to 5).
a
The bulged base is A in the probed sequence.
b
The pair is G A in the probed sequence. Probing data for 5 S rRNA are from Westhof et al. (1989), for 16 S from Moazed et al.
(1986) and Baudin et al. (1987), and for 23 S from Egebjerg et al. (1990).
whereas bulged G1347, U1348 and G1373 are not
(Moazed et al., 1986), in complete agreement with
the probing data of the corresponding positions of
5 S loop E discussed above (see Table 2). In the
30 S particle, these bases become protected from
reactive probes, due to the binding of protein S7
(see below).
A three-way junction in domain I of 23 S RNA
The three-way junction formed by helices 3, 4,
and 23 in domain I of 23 S rRNA also contains a
loop E motif. As in the 16 S junction, the motif
bridges two of the helical arms (helices 3 and 23)
creating potential continuous stacking between
them (Figure 4B). The branch point of the junction
occurs at the equivalent position: two pairings
from the bulged base or one after the A A pair.
However, the two junctions differ in the direction
of branching. In the 23 S three-way junction,
branching occurs in the strand containing the
unpaired G, while it was the opposite in the 16 S
junction, stressing again the autonomous existence
of the motif. Very little sequence variation is
observed across three phylogenetic groups.
Chemical probing of naked 23 S rRNA and 50 S
subunits supports the existence of the junction
(Table 2) (Egebjerg et al., 1990). A stretch of ®ve
other nominally unpaired bases within this
junction loop is also universally conserved
(G446 G450). The loop E motif in this junction
may, as in the 16 S three-way junction, organize
highly conserved single-stranded residues for
speci®c interactions with other RNA sequences.
The primary binding protein L24 binds to 23 S
rRNA at two sites within domain I, one of
which is adjacent in the primary and secondary
structure to this three-way junction (Egebjerg
et al., 1987). Thus the Loop E motif may also
help to structure the neighbouring region for
RNA ± protein interactions.
A loop E motif in the core of 23 S rRNA
The core of 23 S rRNA is closed by a helix
formed by the 50 and 30 termini of the molecule; it
is usually drawn in a circular format with the six
domains radiating outward (Airport Terminal
model, see Figure 5A). Covariation analysis
detected Watson± Crick interactions between positions 1262± 2017, 1269 ± 2011, and 1270 ±2010 in
23 S rRNA (Gutell & Woese, 1990), suggesting that
an irregular helix forms between nucleotides 1262
and 1270, which link domains II and III, and residues 2010 and 2017, which link domains IV and V
(Haselman et al., 1989). In fact these nucleotides
comprise a loop E motif, highly conserved across
all three phylogenetic groups and also found in
the highly reduced mitochondrial sequences
(Figure 5A). The only sequence variations occur at
the symmetrical A A parallel pairing which is conserved in bacterial and archaeal sequences but displays three C A and one A G variants in
eucaryotes. The folding of the motif creates a
stretch of continuous base-pairing between
domains II and IV. Disruption of either Watson±
Crick basepair 1262± 2017 or 1269± 2011, ¯anking
the motif was shown by site-directed mutagenesis
to impair the ability of the 23 S to engage in protein synthesis (Aagaard & Douthwaite, 1994). All
adenosine residues in the motif were found to be
slightly reactive with DMS in the naked 23 S
rRNA. Double-helix speci®c CV nuclease cut
strongly at the Watson± Crick sites ¯anking the
motif and weakly within it, giving further proof
for its existence. The motif is completely protected
from chemical and enzymatic probes in the 50 S
ribosome, consistent with multiple RNA-RNA or
RNA-protein interactions in the folded RNA.
Loop E motifs in multi-helix loops in domain I
and II of 23 S
A loop E motif conserved in all three phylogenetic groups is found at positions 1186± 1190/817±
578
Recurrent Motif in Multi-helix Loops in RNA
Figure 4. A, Three-way in 16 S rRNA comprising helices 28, 29, and 43, which contains a loop E motif in all phylogenetic groups. Left: as usually drawn. Right: redrawn to highlight loop E motif bridging helices 29 and 43 (note
rotation of junction relative to conventional view). Residues coloured as in Figure 1A. Variations observed in all three
phylogenetic groups superimposed on E. coli sequence and numbering system. A A pair in motif is conserved in bacteria and Archaea, but is predominantly A U in eucaryotes with variants A C, A A, C U, GU, U A, C A, and U C.
Box indicates inserted C present in eucaryotic and archaeal sequences. B, Three-way junction in domain I of 23 S
rRNA comprising helices 3, 4, and 23 containing Loop E motif. Left: as usually drawn. Right: redrawn to highlight
loop E motif bridging helices 3 and 23 (note rotation of junction). Residues colored as in Figure 1A. Variations
observed in all three phylogenetic groups superimposed on E. coli sequence and numbering system.
821 in E. coli 23 S at the base of helix 36, which
connects the seven-way junction loop of domain II
to the rest of the molecule. The motif creates potential stacking between helix 36 and either helix 37 or
helix 45. The branch point occurs next to the parallel base-pairs and does not interrupt the motif. The
parallel pair is predominantly A A in eucaryotic
23 S, G A in bacteria, and mixed in Archaea.
Another loop E motif is found in the variable junction in domain I adjacent to helix 21. Sequence
variations for both motifs are superimposed on the
E. coli sequence (Figure 5B) and chemical probing
data are summarized in Table 2 (Egebjerg et al.,
1990).
Loop E motifs in catalytic RNAs
Besides ribosomal RNAs and the other known
examples, PSTV and the hairpin ribozyme, the
loop E motif probably exists in other structured or
catalytic RNAs. The loop E motif has thus been
identi®ed in a subfamily of type B RNase P
sequences, in helix P10.1, where it replaces a regular Watson± Crick helix (Massire et al., 1998). The
loop E motif is also apparent in the P7.1 ± P7.2 subdomain of subgroup IA2 of group I introns (Michel
& Westhof, 1990) and in domain 3 of one class of
group II introns (Michel et al., 1989). These motifs
are illustrated in Figure 6.
579
Recurrent Motif in Multi-helix Loops in RNA
Figure 5. A, Core region of 23 S rRNA containing loop E motif. Left: schematic view as conventionally drawn.
Right: redrawn highlighting the loop E motif, which potentially bridges helix 26 of domain II and helix 61 of domain
IV. Residues colored as in Figure 1A. Variations observed in all three phylogenetic groups superimposed on E. coli
sequence and numbering system. B, Multi-helix junctions from 23 S rRNA containing loop E motifs. Left: loop E
motif at base of helix 21 in multi-helix loop in domain I. Bacterial and archaeal sequences only (eucaryotic sequences
do not retain the AG sheared pairing). Right: loop E motif in seven-way junction in domain II of 23 S rRNA, conserved in three phylogenetic groups. The motif potentially bridges helices 36 and 37 (as shown) or helices 36 and 45.
Unpaired G is conserved in eucaryotes and Archaea, but may be G, A or U in bacteria. The parallel pair is predominantly G A in Bacteria and Archaea and A A in eucaryotes. Other variants: bacteria: A A and G C; eucaryotes: G A
and U A; Archaea A A, C A, G U, U A and one G G. Residues colored as in Figure 1A.
Discussion
Evolutionary implications
Why are there so many loop E sites in the large
RNAs, and why are these so highly conserved?
The fact that they are conserved in all phylogenetic
groups indicates that loop E motifs were present in
the Ur-ribosome. Their occurrence at sites involved
in binding primary proteins and RNA sequences
which de®ne the domain structure of 16 S and 23 S
rRNA suggests that they evolved within internal
and junction loops due to their special ability to
participate in the organization and structuring of
multiple junctions as well as in RNA-RNA and
RNA-protein interactions. The fact that the Watson ±Crick sites of all the adenosine residues in the
motif are available for interaction is noteworthy
and parallels the situation found in the GNRA
loop motifs and their receptors (Jaeger et al., 1994;
Cate et al., 1996b).
Protein binding by loop E motifs
It is well known that loop E of eucaryal 5 S
rRNA forms part of the binding site for ribosomal
protein L5 (Allison et al., 1991) and transcription
factor TFIIIA (Romaniuk, 1989), while elongation
factors and cytotoxins interact with the sarcin/ricin
loop of 23 S rRNA. The bulged G must be present
580
Recurrent Motif in Multi-helix Loops in RNA
Figure 6. Loop E motifs identi®ed in ribozymes other than the
hairpin ribozyme (see Figure 1A).
From left to right: group I ribozyme, subgroup A2, P7.1 and 7.2
extension (Michel & Westhof,
1990); group II, subgroup a,
domain III (Michel et al., 1989); and
RNase P, type B, loop L10.1
(Massire et al., 1998).
for speci®c recognition and modi®cation of the
RNA motif by a-sarcin (Wool et al., 1992). The
interaction of elongation factors EF-G and EF-Tu
with the sarcin/ricin loop of 23 S rRNA was
demonstrated using chemical footprinting (Moazed
et al., 1988). Protection was observed upon binding
of EF-Tu or EF-G within the motif at the bulged
G2655 and at A2665, which is trans-Hoogsteen
paired to U2656. In the absence of factors, the
Watson ±Crick faces of A2665 and G2655, but not
A2654 (which probably interacts with C2666 in the
E. coli sequence) are reactive.
S7 is the sole primary binding protein in the 30
domain of 16 S. Foot-printing and cross-linking
experiments show that nucleotides comprising the
loop E motif of the 16 S three-way junction in
Figure 4A form part of the binding site of this protein (Wower & Brimacombe, 1983; Powers et al.,
1988). Concise footprints obtained using solventgenerated hydroxyl radicals show that S7 protects
nucleotides on the 30 side of the loop E motif of the
three-way junction, and in helices 29 and 43, which
are bridged by the motif (Powers & Noller, 1995).
The 50 -stagger of six to seven nucleotides observed
in the footprint is consistent with axial binding to
the deep groove of the RNA. The remaining
unpaired bases of the three-way junction are
lightly protected from hydroxyl radicals by S7.
However, some nucleotides become more accessible upon protein binding. Most notably, U1381,
adjacent to the loop E motif becomes reactive to
kethoxal upon binding by S7 and is strongly reactive in 30 S subunits. It is protected by tRNA binding to ribosomes (Moazed & Noller, 1986; Moazed
et al., 1986; Powers et al., 1988). These data suggest
that the loop E motif of this three-way junction
may play two roles: to provide a binding site
for S7, which helps to ®ne-tune the local RNA
structure, and to interact with other nucleotides in
the junction or in the nearby four-way junction to
help form the binding site for the tRNA anti-codon
loop. The structure of S7, recently solved using
X-ray crystallography (Hosaka et al., 1997;
Wimberly et al., 1997), indicates extensive protein
surface for interaction with double-helical RNA.
The majority of loop E motifs found in 23 S
rRNA occur in the 50 half of the molecule. In fact,
four occur in domain I, the ®rst to be transcribed
and the ®rst to fold (Tumminia et al., 1994). Chemical and enzymatic probing of 23 S rRNA, isolated
as well bound to r-proteins in the ribosome, indicates that domain I is more highly structured and
compact than the other domains of the molecule
(Egebjerg et al., 1987, 1990; DoÈring et al., 1991).
Addition of a single, relatively small ribosomal
protein, L24, the only primary binding protein
which binds to domain I, completes the folding of
the domain and initiates assembly of the 50 S subunit (CabezoÂn et al., 1977; Spillman & Nierhaus,
1978; Tumminia et al., 1994). L24 has been footprinted to two sites (Egebjerg et al., 1987), one of
which is adjacent in the primary and secondary
structure to the three-way junction which contains
the loop E motif, and which may therefore play a
role in helping to structure the neighbouring RNA
for interaction with L24.
Loop E and RNA-RNA interactions
In the hairpin ribozyme, it has been shown that
the internal loop containing the Loop E motif must
interact with the loop containing the cleavable
phosphate for catalysis to occur (Burke, 1996;
Burke et al., 1996). A recent model with side-byside interaction of the two internal loops has been
proposed (Earnshaw et al., 1997).
Protection by ribosomal proteins S5 and S12 of
bases in the multi-branch loop at the junction of
the three major domains of 16 S rRNA, including
those belonging to the loop E motif that we have
identi®ed, was observed using base-speci®c probes
(Stern et al., 1988a), but not using hydroxyl radicals
generated with the Fe2‡/EDTA reagent (Powers &
Noller, 1995). Hydroxyl radicals react primarily
with the ribose moieties of the backbone,
suggesting that the base modi®cations were due to
conformational changes induced by binding of
these proteins to other regions of the RNA (notably
the nearby 530 stem-loop region). Thus, it may be
that the loop E motif in the core of the 16 S rRNA
581
Recurrent Motif in Multi-helix Loops in RNA
is involved in RNA-RNA rather than RNA-protein
interactions. In support of this, two distinct RNARNA crosslinks connecting G894 (in the duplex
just above the sheared A A bp of the motif) with
bases U244 in the early-folding 50 domain and
A1468 in the 30 -terminal domain indicate the central role of the core in organizing the domain structure of 16 S rRNA (Wilms et al., 1997).
Crosslinking is also observed between U1345, at
the base of the loop E motif in the 16 S three-way
junction, and unpaired C934, in the same junction
(Wilms et al., 1997). Unpaired bases in the junction,
U1376 and C1378 have been crosslinked directly to
ribosome-bound tRNA (DoÈring et al., 1994). Taken
together with foot-printing of U1381 by bound
tRNA, discussed above, the evidence is strong for
a role for the loop E motif in structuring this threeway junction for RNA-RNA interaction with
tRNA.
Conclusions
Comparative sequence analysis and chemical
probing data show that the loop E motif is a recurrent, autonomous structural element that occurs
not only within RNA ``internal'' but also around
``multi-helix'' loops, which points to crucial and
multiple roles in structuring and organizing the
junctions. Although loop E motifs are often footprinted by proteins, their involvement as RNARNA anchors cannot be excluded, as shown by the
example of the hairpin ribozyme (Butcher & Burke,
1994b; Earnshaw et al., 1997). The speci®city and
the structure of the binding motif, of either protein
or RNA are still unknown.
Methods
Sequence analysis was carried using the COSEQ
program (Massire & Westhof, unpublished). We made
extensive use of the secondary structures of the 16 S,
23 S and group I introns posted on the Web (http://
pundit.colorado.edu:8080/RNA) (Gutell et al., 1993;
Damberger & Gutell, 1994; Gutell, 1994). 5 S rRNA
sequences were obtained from the Berlin database.
Nucleotides are numbered consecutively according to
the Xenopus laevis sequence (Specht et al., 1997). Large
(23 S-like) and small (16 S-like) ribosomal subunit
RNA sequences were obtained from the Antwerp
database and the Ribosomal Database Project (Maidak
et al., 1997; van de Peer et al., 1997a,b). Sequences
belonging to the three main phylogenetic groups
(Woese et al., 1990), eucaryotes (1581 16 S and 64 23 S
rRNA sequences), Archaea (379 16 S and 24 23 S
rRNA sequences), and bacteria (4188 16 S and 180
23 S rRNA sequences) were downloaded and analysed
separately. At positions harbouring likely loop E
motifs, the alignments were checked for insertions or
deletions as an initial screen for the existence of the
motif at a given site. When the absence of insertions
(or deletions) was con®rmed, covariations between the
pairings were analysed using COSEQ. Numbering of
nucleotides and helices for 16 S and 23 S rRNAs are
given according to the Escherichia coli sequences
(Brimacombe, 1991).
Acknowledgements
We are grateful to Christian Massire for help with the
use of the COSEQ program program. N.L. was a Fogerty Senior International Fellow, supported by NIH grants
(1-F06-TW02251-01) and 1R15-GM/OD55898-01 (NBL),
ACS-PRF grant no. 31427-B4 and by a ``Poste Rouge''
from the CNRS. E.W. acknowledges support from the
Institut Universitaire de France.
References
Aagaard, C. & Douthwaite, S. (1994). Requirement for a
conserved, tertiary interaction in the core of 23 S
ribosomal RNA. Proc. Natl Acad. Sci. USA, 91,
2989± 2993.
Allison, L. A., Romaniuk, P. J. & Bakken, A. H. (1991).
RNA-Protein interactions of stored 5 S RNA with
TFIIIA and ribosomal protein L5 during Xenopus
oogenesis. Develop. Biol. 144, 129± 144.
Baudin, F., Ehresmann, C., Romby, P., Mougel, M.,
Colin, J., Lempeureur, L., Bachellerie, J. P., Ebel, J. P.
& Ehresmann, B. (1987). Higher-order structure of
domain III in E. coli 16 S ribosomal RNA, 30 S subunit and 70 S ribosome. Biochimie, 69, 1081± 96.
Baumstark, T., Schroder, A. R. W. & Riesner, D. (1997).
Viroid processing: switch from cleavage to ligation
is driven by a change from a tetraloop to a Loop E
conformation. EMBO J. 16, 599± 610.
Branch, A. D., Benenfeld, B. J. & Robertson, H. D.
(1985). Ultraviolet light-induced crosslinking reveals
a unique region of local tertiary structure in potato
spindle tuber viroid andHeLa 5 S RNA. Proc. Natl
Acad. Sci. USA, 82, 6590± 6594.
Brimacombe, R. (1991). RNA-protein interactions in the
Escherichia coli ribosome. Biochimie, 73, 927± 936.
Burke, J. M. (1996). Hairpin ribozyme: current status
and future prospects. Biochem. Soc. Trans. 24, 608±
615.
Burke, J. M., Butcher, S. E. & Sargueil, B. (1996). Structural analysis and modi®cations of the hairpin ribozyme. Nucl. Acids Mol. Biol. 10, 129± 143.
Butcher, S. E. & Burke, J. M. (1994a). A photo-cross-linkable tertiary structure motif found in functionally
distinct RNA molecules is essential for catalytic
function of the hairpin ribozyme. Biochemistry, 33,
992± 999.
Butcher, S. E. & Burke, J. M. (1994b). Structure-mapping
of the hairpin ribozyme: magnesium-dependent
folding and evidence for tertiary interactions within
the ribozyme-substrate complex. J. Mol. Biol. 244,
52 ±63.
CabezoÂn, T., Herzog, A., Petre, J., Yaguchi, M. & Bollen,
A. (1977). Ribosomal assembly de®ciency in an
E. coli thermosensitive mutant having an altered
L24 ribosomal protein. J. Mol. Biol. 116, 361± 374.
Cate, J. H., Gooding, A. R., Podell, E., Zhou, K., Golden,
B. L., Kundrot, C. E., Cech, T. R. & Doudna, J. A.
(1996a). Crystal structure of a Group I ribozyme
domain: principles of RNA packing. Science, 273,
1678± 1685.
Cate, J. H., Gooding, A. R., Podell, E., Zhou, K., Golden,
B. L., Szewczak, A. A., Kundrot, C. E., Cech, T. R.
& Doudna, J. A. (1996b). RNA tertiary structure
mediation by adenosine platforms. Science, 273,
1696± 1699.
Chiaruttini, C., Milet, M., Hayes, D. H. & Expert,
B. A. (1989). Crosslinking of ribosomal pro-
582
teins S4, S5, S7, S8, S11, S12 and S18 to domains 1
and 2 of 16 S rRNA in the Escherichia coli 30 S particle. Biochimie, 71, 839± 852.
Damberger, S. H. & Gutell, R. R. (1994). A comparative
database of group I intron structures. Nucl. Acids
Res. 22, 3508± 3510.
DoÈring, T., Greuer, B. & Brimacombe, R. (1991). The
three-dimensional folding of ribosomal RNA: Localization of a series of intra-RNA cross-links in 23 S
RNA induced by treatment of E. coli 50 S ribosomal
subunits
with
bis-(2-chloroethyl)-methylamine.
Nucl. Acids Res. 19, 3517±3524.
DoÈring, T., Mitchell, P., Osswald, M., Bochkariov, D. &
Brimacombe, R. (1994). The decoding region of 16 S
RNA: a cross-linking study of the ribosomal A, P
and E sites using tRNA derivatized at position 32
in the anticodon loop. EMBO J. 13, 2677± 2685.
Dragon, F. & Brakier, Gingras L. (1993). Interaction of
Escherichia coli ribosomal protein S7 with 16 S
rRNA. Nucl. Acids Res. 21, 1199± 1203.
Earnshaw, D. J., Masquida, B., MuÈller, S., Sigurdsson,
S. T., Eckstein, F., Westhof, E. & Gait, M. J. (1997).
Inter-domain cross-linking and molecular modeling
of the hairpin ribozyme. J. Mol. Biol. 274, 197±212.
Egebjerg, J., Leffers, H., Christensen, A., Andersen, H. &
Garrett, R. A. (1987). Structure and accessibility of
domain I of Escherichia coli 23 S RNA in free RNA,
in the L24 ±RNA complex and in 50 S subunits.
J. Mol. Biol. 196, 125±136.
Egebjerg, J., Larsen, N. & Garrett, R. A. (1990). Structural map of 23 S rRNA. In The Ribosome: Structure,
Function and Evolution (Hill, W. E., Dahlberg, A.,
Garrett, R. A., Moore, P. B., Schlessinger, D. &
Warner, J. R., eds), pp. 168± 179, American Society
for Microbiology, Washington, DC.
Endo, Y., Gluck, A. & Wool, I. G. (1993). Ribosomal
RNA identity elements for recognition by ricin and
by alpha-sarcin: mutation in the putative CG pair
that closes a GAGA tetraloop. Nucl. Acids Symp Ser.
29, 165± 166.
Fresco, J. R., Alberts, B. M. & Doty, P. (1960). Some molecular details of secondary structure of ribonucleic
acid. Nature, 188, 98 ± 101.
Gutell, R. R. (1994). Collection of small subunit (16 S
and 16 S-like) ribosomal RNA structures: 1994.
Nucl. Acids Res. 22, 3502±3507.
Gutell, R. R. & Woese, C. (1990). Higher order structural
elements in ribosomal RNAs: pseudoknots and the
use of noncanonical pairs. Proc. Natl Ac. Sci. USA,
87, 663± 667.
Gutell, R. R., Gray, M. W. & Schnare, M. N. (1993).
A compilation of large subunit (23 S and 23 S-like)
ribosomal RNA structures: 1993. Nucl. Acids Res. 21,
3055± 3074.
Haselman, T., Gutell, R. R., Jurka, J. & Fox, G. E. (1989).
Additional Watson ± Crick interactions suggest a
structural core in large subunit ribosomal RNA.
J. Biomol. Struct. Dynam. 7, 181± 186.
Hosaka, H., Nakagawa, A., Tanaka, I., Harada, N.,
Sano, K., Kimura, M., Yao, M. & Wakatsuki, S.
(1997). Ribosomal protein S7: a new RNA-binding
motif with structural similarities to a DNA architectural factor. Structure, 5, 1199± 1208.
Jaeger, L., Michel, F. & Westhof, E. (1994). Involvement
of a GNRA retraloop in long-range RNA tertiary
interactions. J. Mol. Biol. 236, 1271± 1276.
Keese, P. & Symons, R. H. (1985). Domains in Viroids:
Evidence of intermolecular RNA rRearrangements
Recurrent Motif in Multi-helix Loops in RNA
and their contribution to viroid evolution. Proc. Natl
Acad. Sci. USA, 82, 4582± 4586.
Larsen, N. (1992). Higher order structure in 23 S rRNA.
Proc. Natl Acad. Sci. USA, 89, 5044± 5048.
Leontis, N. B. & Westhof, E. (1998). The 5 S rRNA Loop
E: chemical probing and phylogenetic data vs. crystal structure. RNA, in the press.
Lodmell, J. S. & Dahlberg, A. E. (1997). A conformational switch in Escherichia coli 16 S ribosomal RNA
during decoding of messenger RNA. Science, 277,
1262± 1267.
Lodmell, J. S., Gutell, R. R. & Dahlberg, A. E. (1995).
Genetic and comparative analyses reveal an alternative secondary structure in the region of nt 912 of
Escherichia coli 16 S rRNA. Proc. Natl Acad. Sci. USA,
92, 10555± 10559.
Maidak, B. L., Olsen, G. J., Larsen, L., Overbeek, R.,
McCaughey, M. & Woese, C. R. (1997). The Ribosomal Database Project. Nucl. Acids Res. 25, 109±111.
Masquida, B., Felden, B. & Westhof, E. (1997). Context
dependent RNA-RNA recognition in a three-dimensional model of the 16 S rRNA core. Bio. Med.
Chem. 5, 1021±1035.
Massire, C., Jaeger, L. & Westhof, E. (1998). Derivation
of the three-dimensional architecture of bacterial
ribonuclease P RNAs from comparative sequence
analysis. J. Mol. Biol. 279, 773± 793.
Michel, F. & Westhof, E. (1990). Modelling of the threedimensional architecture of group-I catalytic introns
based on comparative sequence analysis. J. Mol.
Biol. 216, 585± 610.
Michel, F., Jaquier, A. & Dujon, B. (1982). Comparison
of fungal mitochondrial introns reveals extensive
homologies in RNA secondary structure. Biochimie,
64, 867± 881.
Michel, F., Umesono, K. & Ozeki, H. (1989). Comparative and functional anatomy of Group II catalytic
introns ± a review. Gene, 82, 5 ± 30.
Moazed, D. & Noller, H. F. (1986). Transfer RNA shields
speci®c nucleotides in 16S ribosomal RNA from
attack by chemical probes. Cell, 47, 985± 994.
Moazed, D., Stern, S. & Noller, H. F. (1986). Rapid
chemical probing of conformation in 16 S ribosomal
RNA and 30 S ribosomal subunits using primer
extension. J. Mol. Biol. 187, 399± 416.
Moazed, D., Robertson, J. M. & Noller, H. F. (1988).
Interaction of elongation factors EF-G and EF-Tu
with a conserved loop in 23 S RNA. Nature, 334,
362± 364.
Pley, H. W., Flaherty, K. M. & McKay, D. B. (1994).
Three-dimensional structure of a hammerhead ribozyme. Nature, 372, 68 ± 74.
Powers, T. & Noller, H. F. (1995). Hydroxyl radical footprinting of ribosomal proteins on 16 S rRNA. RNA,
1, 194± 201.
Powers, T., Craven, G. R., Changchien, L. M. & Noller,
H. F. (1988). Probing the assembly of the 30 major
domain of 16 S ribosomal RNA. Quaternary interactions involving ribosomal proteins S7, S9 and S19.
J. Mol. Biol. 200, 309± 319.
Romaniuk, P. J. (1989). The role of highly conserved
single-stranded nucleotides of Xenopus 5 S RNA in
the binding of transcription factor IIIA. Biochemistry,
28, 1388± 1395.
Romaniuk, P. J., Stevenson, I. L., Ehresmann, C.,
Romby, P. & Ehresmann, B. (1988). A comparison
of the solution structures and conformational properties of the somatic and oocyte 5 S rRNA of Xenopus laevis. Nucl. Acids Res. 16, 2295± 2312.
Recurrent Motif in Multi-helix Loops in RNA
Romby, P., Baudin, F., Brunel, C., Leal de Stevenson, I.,
Westhof, E., Romaniuk, P. J., Ehresmann, C. &
Ehresmann, B. (1990). Ribosomal 5 S RNA from
Xenopus laevis oocytes: conformation and interaction with transcription factor IIIA. Biochimie, 72,
437± 452.
Shen, L. X., Cai, Z. & Tinoco, I. (1995). RNA structure at
high resolution. FASEB J. 9, 1023± 1033.
Specht, T., Szymanski, M., Barciszerwska, M. Z.,
Barciszewski, J. & Erdmann, V. A. (1997). Compilation of 5 S rRNA and 5 S rRNA gene sequences.
Nucl. Acids Res. 25, 96± 97.
Spillman, S. & Nierhaus, K. H. (1978). The ribosomal
protein L24 of E. coli is an assembly protein. J. Biol.
Chem. 253, 7047± 7050.
Stern, S., Powers, T., Changchien, L. M. & Noller, H. F.
(1988a). Interaction of ribosomal proteins S5, S6,
S11, S12, S18 and S21 with 16 S rRNA. J. Mol. Biol.
201, 683± 695.
Stern, S., Weiser, B. & Noller, H. F. (1988b). Model for
the three-dimensional folding of 16 S ribosomal
RNA. J. Mol. Biol. 204, 447± 481.
Szewczak, A. A. & Moore, P. B. (1995). The sarcin/ricin
loop, a modular RNA. J. Mol. Biol. 247, 81 ± 98.
Tumminia, S. J., Hellmann, W., Wall, J. S. & Boublik, M.
(1994). Visualization of protein-nucleic acid interactions involved in the in vitro assembly of the
Escherichia coli 50 S ribosomal subunit. J. Mol. Biol.
235, 1239± 1250.
van de Peer, Y., Jansen, J., de Rijk, P. & de Wachter, R.
(1997a). Database on the structure of large ribosomal subunit RNA. Nucl. Acids Res. 25, 117±123.
van de Peer, Y., Jansen, J., de Rijk, P. & de Wachter, R.
(1997b). Database on the structure of small ribosomal subunit RNA. Nucl. Acids Res. 25, 111±116.
Westhof, E., Romby, P., Romaniuk, P. J., Ebel, J. P.,
Ehresmann, C. & Ehresmann, B. (1989). Computer
modelling from solution data of spinach chloroplast
and of Xenopus laevis somatic and oocyte 5 S
rRNAs. J. Mol. Biol. 207, 417± 431.
583
Wilms, C., Noah, J. W., Zhong, D. & Wollenzien, P.
(1997). Exact determination of UV-induced crosslinks in 16 S ribosomal RNA in 30 S ribosomal subunits. RNA, 3, 602± 612.
Wimberly, B. (1994). A common RNA loop motif as a
docking module and its function in the hammerhead ribozyme. Nature Struct. Biol. 1, 820± 827.
Wimberly, B., Varani, G. & Tinoco, I., Jr (1993). The conformation of loop E of eukaryotic 5S ribosomal
RNA. Biochemistry, 32, 1078 ±1087.
Wimberly, B., White, S. W. & Ramakrishnan, V. (1997).
Ê resThe structure of ribosomal protein S 7 at 1.9 A
olution reveals a beta-motif that binds doublestranded nucleic acids. Structure, 5, 1187± 1198.
Woese, C. R. & Gutell, R. R. (1989). Evidence for several
higher order structural elements in ribosomal RNA.
Proc. Natl Acad. Sci. USA, 86, 3119± 3122.
Woese, C. R. & Pace, N. R. (1993). Probing RNA structure, function and history by comparative analysis.
In The RNA World (Gesteland, R. F. & Atkins, J. F.,
eds), Cold Spring Harbor Laboratory Press,
Cold Spring Harbor, NY.
Woese, C. R., Kandler, O. & Wheelis, N. L. (1990).
Towards a natural system of Oorganisms: a proposal for the domains of Archaea, Bacteria, and
Eucarya. Proc. Natl Acad. Sci. USA, 87, 4576± 4579.
Wool, I. G., GluÈck, A. & Endo, Y. (1992). Ribotoxin recognition of ribosomal RNA and a proposal for the
mechanism of translocation. Trends Biochem. Sci. 17,
266± 269.
Wower, I. & Brimacombe, R. (1983). The localization of
multiple sites on 16 S RNA which are cross-linked
to proteins S7 and S8 in Escherichia coli 30 S ribosomal subunits by treatment with 2-iminothiolane.
Nucl. Acids Res. 11, 1419 ±1437.
Zheng, P., Burrows, C. J. & Rokita, S. E. (1998). Nickeland cobalt-dependent reagents identify structural
features of RNA that are not detected by dimethyl
sulfate or RNase T1. Biochemistry, 37, 2207±2214.
Edited by K. Nagai
(Received 17 March 1998; received in revised form 13 July 1998; accepted 29 July 1998)