Bacterial group II introns: not just splicing Nicolás Toro, José Ignacio Jiménez-Zurdo & Fernando Manuel Garcı́a-Rodrı́guez Grupo de Ecologı́a Genética de la Rizosfera, Estación Experimental del Zaidı́n, Consejo Superior de Investigaciones Cientı́ficas, Granada, Spain Correspondence: Nicolás Toro, Grupo de Ecologı́a Genética de la Rizosfera, Estación Experimental del Zaidı́n, Consejo Superior de Investigaciones Cientı́ficas, Profesor Albareda 1, 18008 Granada, Spain. Tel.: 134 958181600; fax: 134 958129600; e-mail: [email protected] Received 30 November 2006; revised 12 January 2007; accepted 12 January 2007. First published online March 2007. DOI:10.1111/j.1574-6976.2007.00068.x Abstract Group II introns are both catalytic RNAs (ribozymes) and mobile retroelements that were discovered almost 14 years ago. It has been suggested that eukaryotic mRNA introns might have originated from the group II introns present in the alphaproteobacterial progenitor of the mitochondria. Bacterial group II introns are of considerable interest not only because of their evolutionary significance, but also because they could potentially be used as tools for genetic manipulation in biotechnology and for gene therapy. This review summarizes what is known about the splicing mechanisms and mobility of bacterial group II introns, and describes the recent development of group II intron-based gene-targetting methods. Bacterial group II intron diversity, evolutionary relationships, and behaviour in bacteria are also discussed. Editor: Rafael Giraldo Keywords catalytic RNA; homing; splicing; retroelements; Lactococcus lactis ; Sinorhizobium meliloti . Introduction: group II introns Group II introns are large catalytic RNAs (ribozymes) and mobile retroelements [reviewed by Pyle & Lambowitz (2006)] that splice by means of a lariat intermediate, in a mechanism similar to that of spliceosomal introns. The transfer of these introns either to intronless alleles (retrohoming) or to ectopic sites (retrotransposition) is mediated by ribonucleoprotein (RNP) complexes consisting of the intron-encoded protein (IEP) and the excised intron RNA lariat. Group II introns were initially identified in the mitochondrial and chloroplast genomes of lower eukaryotes and plants (Michel et al., 1989). They were later identified in bacteria (Ferat & Michel, 1993), and have recently been found in the archaeal genus Methanosarcinales (Toro, 2003; Dai & Zimmerly, 2003; Rest & Mindell, 2003). It is thought that both nuclear spliceosomal introns and non-Long Terminal Repeat (LTR) retrotransposons evolved from mobile group II introns [Sharp, 1985; Cech, 1986; Cavalier-Smith, 1991; Eickbush, 1994; Zimmerly et al., 1995; recently reviewed by Koonin (2006)]. According to this hypothesis, group II introns originated in bacteria and invaded the nucleus of a primitive eukaryote, possibly from the alphaproteobacterial progenitor of the mitochondria; they were then fragmented to form the spliceosome (Cavalier-Smith, 1991; Stoltzfus, 1999). It has recently been suggested that the 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c spread of group II introns and their decay into spliceosomal introns created a strong selective pressure, necessitating compartmentalization of the nucleus and cytoplasm (Martin & Koonin, 2006). In addition to their possible evolutionary significance in eukaryogenesis, mobile group II introns are of interest because they have been used to develop a new type of gene-targetting vector (Lambowitz & Zimmerly, 2004; Lambowitz et al., 2005), with potential applications in biotechnology and medicine. A typical group II intron (Fig. 1) consists of a highly structured RNA with six distinct double-helical domains (DI to DVI) and an internally encoded (ORF within DIV) reverse transcriptase (RT) maturase. The IEP is required for in vivo folding of the intron RNA into a catalytically active structure (Michel & Ferat, 1995). Unlike organellar introns, most bacterial group II introns have an IEP (Lambowtiz & Zimmerly, 2004). The IEPs of these introns (Fig. 1) have four conserved domains: an N-terminal RT domain; domain X – a putative RNA-binding domain associated with RNA splicing or maturase activity-, a C-terminal DNAbinding domain (D); and a DNA-endonuclease domain (En). More than half of the bacterial group II IEPs annotated to date lack the En domain, and many also lack the D domain (Martı́nez-Abarca & Toro, 2000b; Zimmerly et al., 2001; Belfort et al., 2002; Toro, 2003; Lambowitz & Zimmerly, 2004). It has been suggested that all current group II introns FEMS Microbiol Rev 31 (2007) 342–358 343 Bacterial group II introns Fig. 1. Generic secondary structure of group II intron RNA and domain structure of the intron-encoded proteins (IEP). The model shows the ribozyme domains (DI–DVI), the EBS/IBS elements, the bulged adenosine within DVI, and the sequences involved in tertiary interactions (Greek letters) operating in splicing as described in the text. Major differences between IIA and IIB introns are indicated in brackets. Note that IIC introns lack the EBS2 element. Boxed are the schematics (drawn to scale) of the Ll.LtrB (IIA), RmInt1 (IIB) and GBSi1 (IIC) IEPs encoded within intron DIV, with indication of the predicted reverse transcriptase (RT), maturase (X), variable DNA-binding region (D) and conserved DNA endonuclease (En) domains. Numbers above the RT domain identify conserved amino acid sequence blocks characteristic of RTs. The question mark denotes a functionally uncharacterized C-terminal extension of 20 amino acid residues in the RmInt1 IEP. are descended from an RT-encoding group II intron in bacteria (Toor et al., 2001). Three main phylogenetic subclasses (IIA, IIB and IIC) of group II introns have been described on the basis of IEP and conserved intron RNA structures (Michel et al., 1989; Toor et al., 2001; Zimmerly et al., 2001; Ferat et al., 2003; Toro, 2003), with the IIC subclass unique to bacteria. Bacterial group II introns differ markedly from organellar introns in not generally being located in conserved genes and in being mostly associated with mobile DNAs or located within intergenic regions (Dai & Zimmerly, 2002; Toro, 2003). Group II introns, which may predate prokaryotes (Koonin, 2006), are still present in many bacteria. So, what role do group II introns play in the bacterial genome? Although they are generally seen as ‘selfish’ elements, studies of group II introns are continually discovering FEMS Microbiol Rev 31 (2007) 342–358 surprising features such as the association of some lineages with specific genetic signals or with the host replication machinery for mobility. Bacterial group II intron splicing Group II introns splice by means of a lariat intermediate, in a mechanism resembling that of spliceosomal introns. Intron excision from eukaryotic mRNA requires a set of host-cell snRNA molecules and proteins, whereas the catalytic functions involved in group II intron splicing reside within the RNA molecule itself. Nevertheless, the folding of the intron RNA into the catalytically active structure in vivo depends on the maturase activity of the IEP encoded within intron DIV, outside the catalytic core of the ribozyme. 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c 344 Self-splicing mechanisms and products Several bacterial group II introns have been found to selfsplice in the absence of the IEP when incubated under nonphysiological high-salt conditions in vitro: Cal.x1 from Calothrix (Ferat & Michel, 1993); IntB from Escherichia coli (Ferat et al., 1994); Ll.LtrB from Lactococcus (Matsuura et al., 1997); AV1 (Avi.groEL) and AV2 from Azotobacter (Adamidi et al., 2003; Ferat et al., 2003; Kosaraju et al., 2005); GBSi1 from Streptococcus (Granlund et al., 2001); B.a.I1, B.a.I2 and B.hI1 from Bacillus (Robart et al., 2004; Tourasse et al., 2005; Toor et al., 2006); Gk.Int1 from Geobacillus (Chee & Takami, 2005); and RmInt1 from Sinorhizobium (Costa et al., 2006a). However, bacterial group II ribozymes are generally only moderately reactive and display slow kinetics in vitro. The models chosen for investigations of group II intron catalysis and splicing mechanisms have therefore tended to be the more efficient large mitochondrial group II ribozymes of the two major group II intron subclasses, IIA and IIB, from eukaryotic organisms (Costa et al., 1997, 2000; Su et al., 2001). The results of these studies suggest that the pathway generally used for group II intron splicing involves two sequential transesterification reactions (Fig. 2a). The first involves a nucleophilic attack on the 5 0 splice junction by the 2 0 -OH of a bulged nucleotide, typically an adenosine residue (bulging A) located in DVI near the 3 0 end of the intron, releasing the 5 0 exon and generating an intron-3 0 exon branched intermediate. The second reaction involves a nucleophilic attack on the 3 0 splice site by the free 3 0 -OH of the last nucleotide of the 5 0 -exon, yielding the ligated exons and the intron RNA lariat with a 2 0 –5 0 phosphodiester bond as the major splicing products. Alternatively, the splicing reaction may be initiated by hydrolysis (Fig. 2b), followed by the second step, resulting in the excision of the intron as a linear molecule rather than a lariat. Most of the self-splicing Nicolás Toro et al. bacterial group II introns identified to date display both excision pathways in vitro. The exceptions are some bacterial class C introns such as GBSi1, identified in some Streptococcus B isolates, which splice solely via a linear intermediate molecule (Granlund et al., 2001). In addition, it has been reported that yeast intron aI5g can be also excised as a true circular form in vitro (Fig. 2c). The circle seems to result from the formation of a 2 0 –5 0 phosphodiester bond between the last and first residues of the intron, and it has been proposed that this reaction requires the 3 0 exon to have been released first from precursor molecules by a trans-splicing reaction triggered by 5 0 exon molecules previously generated by the so-called spliced exon reopening (SER) reaction. Subsequent to trans splicing, the 2 0 -OH of the terminal intron residue will attack the 5 0 splice site, releasing the 5 0 exon and an intron circle (Murray et al., 2001). Group II ribozyme positioning on the precursor mRNA substrate and cleavage site fidelity are determined principally by base-pairing interactions between short-sequence elements located in loop structures of intron DI (exonbinding sites, EBS) and sequence stretches flanking the splice site (intron-binding sites, IBS). EBS1 and EBS2, each typically five to seven nucleotides long, recognize their partner sequences – IBS1 and IBS2 – at the 3 0 end of the 5 0 exon. Interestingly, IIC introns are self-splicing-competent, despite lacking the EBS2–IBS2 pairing (Granlund et al., 2001; Toor et al., 2006). The recognition of the 3 0 intron– exon junction involves two additional base-pair interactions. The first is a single tertiary base-pair interaction between the last position of the intron (g 0 ) and another intron nucleotide (g) between DII and DIII. The second involves the first one or several specific exon and intron nucleotides. The identity of this IBS for the 3 0 exon can be used to differentiate between the two major subclasses of group II introns. In IIA introns, the sequence immediately upstream from EBS1 (d sequence) base-pairs with one to Fig. 2. Mechanisms of group II intron splicing through (a) branching, (b) hydrolysis and (c) circles formation. 5 0 and 3 0 exons are indicated with black and grey boxes, respectively, whereas the line represents the intron RNA. Double arrows indicate the reversibility of some reactions. 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c FEMS Microbiol Rev 31 (2007) 342–358 345 Bacterial group II introns three nucleotide residues (d 0 position) of the 3 0 exon for intron splicing (3 nt in the Lactococcus lactis Ll.LtrB intron). In contrast, most IIB introns display no significant signs of d–d 0 complementarity. These group II ribozymes recognize the first nucleotide in the 3 0 exon, referred to as IBS3 rather than d 0 , by canonical base-pairing with an intron nucleotide, referred to as EBS3 rather than d, located in the so-called ‘coordination loop’ of DI (Costa et al., 2000; De Lencastre et al., 2005). The d position is involved in a new tertiary interaction with another residue (denoted d 0 ), also located in the coordination loop (Costa et al. 2000). These nucleotides in the loop are involved in aligning the two exons for the second step of splicing (d–d 0 and IBS3–EBS3 interactions). The three-dimensional arrangement of the active site of group II ribozymes can be further modelled by a complex network of tertiary interactions involving the various RNA domains (Michel & Ferat, 1995; Costa et al., 2000; Swisher et al., 2001 and references therein). Thus, 5 0 exon binding through DI interactions and the conserved DV are involved in the formation of the minimal catalytic core of the ribozyme, with DII and DIII also contributing to RNA folding and catalytic efficiency (Fedorova et al., 2003; for a review see Pyle & Lambowitz, 2006). The branch site of DVI, both exons, the essential catalytic regions of DV (bulge and AGC triad regions), conserved nucleotides in the joining region (J) between DII and DIII (J2/3), and the e–e 0 substructure are all in close proximity before the first splicing step (De Lencastre et al., 2005). It has recently been shown that the branch site binds, with specific polarity, to the coordination loop substructure, thereby placing the nucleophilic adenosine in a position in which it can react with the 5 0 splice site (Hamill & Pyle, 2006). The docking site for DV, a tetraloop receptor motif composed of the z–z and k–k 0 interactions, is separated from the coordination loop by a short helical stem (reviewed by Lehmann & Schmidt, 2003). The choice of branch site is strictly controlled by a molecular caliper, which measures the distance between the base of DV and the nucleophilic 2 0 -hydroxyl in DVI (Chu et al., 2001). Thus, DV and the branch-site receptors function together to position the nucleophile correctly, so as to promote catalysis (Hamill & Pyle, 2006). Alternative EBS-IBS interactions, shifted from the expected EBS-IBS pairings, have also been described in the bacterial group IIB introns B.a.I2 and RmInt1 (Robart et al., 2004; Costa et al., 2006a, b). As shown in vitro, the 3 0 splice site of B.a.I2 is flexible but depends on potential g–g 0 and IBS3–EBS3 pairings downstream from the wild-type site. Alternative splice-site usage either creates frame shifts within the exon-encoded ORF or joins the two exonic ORFs in frame, ultimately affecting the output of host mRNA translation in vivo (Robart et al., 2004). On the other hand, it has been suggested that an RmInt1 ribozyme reaction at a surrogate IBS sequence located 3 0 to the intron might be FEMS Microbiol Rev 31 (2007) 342–358 responsible for the generation of unconventional self-splicing products along with the ligated exons and the excised intron. These unconventional self-splicing products include a 3 0 exon truncated at its 5 0 end, truncated variants of the linear and lariat forms of the intron –3 0 exon intermediate, and putative circular molecules derived from this intermediate (Costa et al., 2006a). The self-splicing profiles of B.a.I2 and RmInt1 may therefore highlight mechanisms underlying the control of host-gene expression by bacterial group II introns. Protein-assisted splicing in vivo Only a minority of bacterial group II introns have been unequivocally shown to be functional or to undergo splicing in vivo: the L. lactis Ll.LtrB (Mills et al., 1996; Shearman et al., 1996); the Sinorhizobium meliloti RmInt1 (Martı́nezAbarca et al., 1998); a Clostridium difficile intron (Roberts et al., 2001); the Bacillus anthracis B.a.I2; and the Trichodesmium erythraeum RIR-3 (Robart et al., 2004; Meng et al., 2005) introns. The best characterized of these introns to date are Ll.LtrB and RmInt1, as representatives of subclasses IIA and IIB, respectively. Initial sequencing and genetic analysis have shown that the splicing of these group II introns in vivo requires the IEP encoded within intron DIV (Mills et al., 1996; Shearman et al., 1996; Martı́nez-Abarca et al., 1998; Muñoz-Adelantado et al., 2003). Bacterial group II intronencoded maturases function more efficiently when expressed in cis, within the same precursor transcript encoding the ribozyme core. However, trans-encoded IEPs can promote the splicing of genomic copies of nonfunctional ORF-less introns, a property that has evolutionary and biotechnological implications (Cui et al., 2004; Meng et al., 2005; Nisa-Martı́nez et al., 2007). Conserved RT and X domains are common to both Ll.LtrB and RmInt1 IEPs, being the X domains associated with the splicing-promoting (maturase) activity of the proteins. However, most of the available information on the mechanisms involved in IEP-assisted bacterial group II intron splicing is based on the lactoccocal intron (Matsuura et al., 1997, 2001; Saldanha et al., 1999; Wank et al., 1999; Singh et al., 2002; Cui et al., 2004). Ll.LtrB and its encoded protein, LtrA, are efficiently expressed and functional in E. coli, facilitating the detailed biochemical characterization of both the intron RNA and the IEP (Matsuura et al., 1997). The purified LtrA protein binds to the unspliced precursor RNA via a rapid bimolecular reaction, followed by a slower unimolecular step involving a change in RNA conformation that ultimately promotes Ll.LtrB splicing when assayed in vitro under low salt concentrations shown to inhibit the self-splicing reaction (Saldanha et al., 1999). In certain specific conditions, LtrA also binds nonspecifically to other group II introns and RNA molecules. However, this 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c 346 nonspecific binding is inefficient for RNA splicing, and LtrA is therefore considered to be an intron-specific splicing factor (Saldanha et al., 1999). This specificity is based on protein recognition of a unique structural feature in the intron subdomain DIVa as a primary high-affinity binding site; this site contains the Shine–Dalgarno sequence and the initiation codon of the LtrA ORF (Wank et al., 1999; Singh et al., 2002). This binding, which also autoregulates LtrA translation, requires domain X and parts of the RT domain of the IEP, and extends further into the 5 0 and 3 0 conserved regions of the catalytic core and DIV through weaker secondary contacts that are sufficient to support residual, but inefficient, splicing in vivo when the high-affinity binding site in DIVa is deleted (Wank et al., 1999; Matsuura et al., 2001; Singh et al., 2002; Cui et al., 2004). Further chemical probing and RNA footprinting experiments have demonstrated that this array of protein–RNA contacts promotes the generation of a novel set of conserved longrange tertiary interactions in the Ll.LtrB RNA, which are not stably formed in the absence of the maturase under low-salt conditions, but which are required to model the active conformation of the ribozyme core in vivo (Matsuura et al., 2001). The major protein-assisted splicing products of Ll.LtrB are the conventional linear and lariat forms of the intron, along with exons ligated in frame (Saldanha et al., 1999). The maturase activity of the RmInt1 IEP has not been biochemically characterized. However, recent studies have demonstrated that RmInt1 splices in vivo leading to the formation of the intron lariat form, along with circular molecules in which the first and last intron residues are ligated (Molina-Sánchez et al., 2006). Genetic analysis of the conserved X domain of the RmInt1 IEP not only confirmed the requirement of the maturase for intron splicing in vivo but also supported an additional role for the IEP in controlling the balance between the two intron excision pathways (Molina-Sánchez et al., 2006). Thus, in vivo the presence of the IEP seems to ensure cleavage at the correct 3 0 splice site or at position 11 in the 3 0 exon before intron RNA circle formation. Circular RNAs had previously only been described as splicing products of group II introns in eukaryotic cells, in which they were thought to be linked to cellular senescence (Murray et al., 2001 and references therein). These novel findings therefore open up new perspectives for investigation of the functional diversity of bacterial group II ribozymes and their biological significance. As described above, analyses of the splicing reaction, using the purified Ll.LtrB-encoded maturase and RNA, predict a remarkable self-sufficiency of bacterial group II introns in the promotion of their own splicing. However, it has also been shown that the efficiency of the in vivo splicing reaction of the group II intron RmInt1 decreases when the intron is expressed in a genetic background other than that of its natural bacterial host, S. meliloti (Martı́nez-Abarca 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c Nicolás Toro et al. et al., 1998, 1999). Several nuclear gene products have been identified as RNA chaperones involved in the splicing of group II introns from yeast, algae or higher plants (Lambowitz & Perlman, 1990; Lambowitz et al., 1999; Lehmann & Schmidt, 2003; Ostheimer et al., 2003). It remains unclear whether this is also the case for bacterial group II introns, and to what extent splicing efficiency depends on host functions. Bacterial group II intron mobility mechanisms Group II introns act both as large catalytic RNAs and as sitespecific retroelements (more recently reviewed by Lambowitz & Zimmerly, 2004; Pyle & Lambowitz, 2006). After splicing, the intron RNA lariat and the IEP remain associated, forming a ribonucleoprotein particle (RNP) promoting intron insertion into DNA target sites identical to the splice site through retrohoming, which is the principal and most efficient group II intron RNA-based mobility pathway (Fig. 3). Retrohoming has been unequivocally demonstrated for the lactococcal Ll.LtrB and rhizobial RmInt1 introns, using engineered ‘twintron’ shuttle constructs in which the bacterial group II intron was interrupted by a DNA sequence encoding a splicing-competent group I intron (tdI). Consistent with a mobility pathway involving an RNA rather than a DNA intermediate, the group I intron was found to be absent from the homing products generated in plasmidborne target sites as a result of tdI splicing in the precursor twintron RNA (Cousineau et al., 1998; Martı́nez-Abarca et al., 2004). Group II introns can also insert, at a lower frequency, into noncognate sequences resembling the homing site – a process known as ectopic transposition (retrotransposition) – which contributes to group II intron dispersal in nature (Cousineau et al., 2000; Martı́nez-Abarca & Toro, 2000a; Muñoz et al., 2001; Dai & Zimmerly, 2002; Ichiyanagi et al., 2002). Of all the bacterial group II introns characterized as splicing-competent in vivo, only the L. lactis Ll.LtrB and S. meliloti RmInt1 introns have been shown to be efficient mobile genetic elements, leading to the use of these elements as model experimental systems for deciphering the bacterial group II intron mobility pathways and mechanisms (Lambowitz & Zimmerly, 2004 and references therein). Retrohoming through target DNA-primed reverse transcription (TPRT) The retrohoming pathway of bacterial group II introns was first investigated for Ll.LtrB, by plasmid-based genetic assays in vivo, in E. coli or L. lactis. It was then further dissected by characterization of the biochemical activities of the intronencoded RNPs reconstituted with a purified LtrA expressed in E. coli and an Ll.LtrB RNA lariat excised from a precursor FEMS Microbiol Rev 31 (2007) 342–358 347 Bacterial group II introns Fig. 3. Retrohoming pathways of bacterial group II introns. RNP complexes recognize the target site on double- or single-stranded DNA primarily by the EBS-IBS (d–d 0 in subgroup IIA introns) pairings, whereas the IEP also binds specifically to key nucleotide residues at distal 5 0 - and 3 0 -exon regions indicated by dotted lines in the diagram (a). The En-dependent retrohoming mechanism (TPRT) used by the L. lactis Ll.LtrB intron involves DNA unwinding assisted by the domain D of the IEP, reverse splicing of the intron RNA into the top strand of the target DNA, and antisense strand cleavage catalyzed by the En domain. The cleaved antisense strand is then used by the IEP (RT activity) as primer to reverse transcribe the reverse-spliced intron RNA (b). The major pathway used for retrohoming by the Sinorhizobium meliloti RmInt1 intron (c) is linked to DNA replication involving invasion of the DNA strand that serves as a template for lagging strand synthesis. The reverse transcription of the inserted intron RNA could be primed by RNA primers synthesised by the primase or partial polymerization of the Okazaki fragments by DNA polymerase III. A second possible minor retrohoming pathway (c) may occur independently of the target-site strand, and involve alternative priming mechanisms. A cDNA copy of the intron is completely integrated in the new location by the host repair functions. transcript in vitro (Matsuura et al., 1997; Cousineau et al., 1998; Saldanha et al., 1999; Aizawa et al., 2003). Neither the Ll.LtrB RNA itself nor the purified LtrA protein displays reverse splicing or endonuclease activities of the RNP particle (Saldanha et al., 1999). Group II intron RNPs initially bind DNA nonspecifically and then search for DNA target sites (Aizawa et al., 2003). Mutagenesis experiments, DNA footprinting, and modification interference FEMS Microbiol Rev 31 (2007) 342–358 mapping have shown that Ll.LtrB-encoded RNPs recognize a relatively long DNA target site (Fig. 3a) extending from position 25 to 19 relative to the intron insertion site (Guo et al., 2000; Mohr et al., 2000; Karberg et al., 2001; Singh & Lambowitz, 2001). Retrohoming then occurs by a target DNA-primed reverse transcription mechanism (TPRT) involving several sequential reactions (Fig. 3b). The RNA component of the RNP cleaves the sense-strand 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c 348 precisely at the exon junction in the double-stranded DNA recipient allele, by a reverse splicing reaction that integrates the intron RNA at the target site, whereas LtrA uses its endonuclease domain (En) to cleave the antisense-strand at position 19 relative to the insertion site. The 3 0 end of the cleaved antisense strand is then targetted to the active site of the reverse transcriptase, where it is used as a primer for reverse transcription of the inserted intron RNA by the RT domain of LtrA, generating a cDNA copy of the intron that is subsequently integrated into its new location by homologous recombination-independent repair mechanisms (Cousineau et al., 1998). The target sequence is recognized principally by the RNA component of the RNP complex, through EBS2-IBS1, EBS1EBS2 and d-d 0 base-pairing interactions, which, for Ll.LtrB, involve positions 12 to 8, 6 to 1 and 11 to 13, respectively. The nucleotide residues within the IBS and d 0 regions contribute to DNA target-site recognition to different extents, but mutations in these nucleotides have an overall inhibitory effect on the reverse splicing activity of the reconstituted RNPs and Ll.LtrB homing frequencies in vivo; nonetheless, both can be restored to wild-type levels by introducing the corresponding complementary mutations into the intron RNA (Guo et al., 2000; Mohr et al., 2000). The variable DNA-binding domain (D) is involved in the recognition of a subset of key nucleotide residues (positions T-23, G-21 and A-20) in the distal 5 0 exon region via major groove interactions bolstered by phosphate-backbone contacts (Singh & Lambowitz, 2001). LtrA may also be involved in the recognition of distal nucleotides of IBS2, probably on the complementary strand of the target site, as the restoration of EBS2-IBS2 base-pairing at these positions does not result in the recovery of wild-type reverse splicing levels or homing frequencies. Mutations at these key positions in the distal 5 0 exon region block reverse splicing of the intron RNA into double-stranded, but not into single-stranded DNA, substrates. This suggests that the interaction of these positions with LtrA triggers local DNA unwinding, thereby placing the intron RNA in a position to base-pair with the IBS and d 0 sequences for reverse splicing into doublestranded DNA target sites (Mohr et al., 2000). Consistent with LtrA interaction and base-pairing of the intron RNA being concerted rather than sequential to promote the efficient unwinding of DNA at cognate sites, KMnO4 modification experiments revealed that RNPs reconstituted with intron RNAs with EBS/d mutations preventing basepairing with the DNA target site do not trigger DNA unwinding. Recent data indicate that before the reverse splicing of intron RNA, RNPs bend the target DNA by maintaining initial contacts with the 5 0 exon while engaging in 3 0 exon interactions, gradually bringing the scissile phosphate into position for bottom-strand cleavage (Noah et al., 2006). Second-strand cleavage, catalyzed by the 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c Nicolás Toro et al. conserved En domain of LtrA, occurs after a time lag, and requires additional interactions of the protein with fixed positions in the 3 0 exon, the most critical of which being position T15, which lies within a single-stranded region after initial DNA unwinding (Mohr et al., 2000; Singh & Lambowitz, 2001). En-independent retrohoming is linked to DNA replication in bacteria TPRT-based retrohoming requires the generation of a primer for reverse transcription of the intron RNA via second-strand cleavage, catalyzed by the conserved En domain of the IEP. Phylogenetic analysis based on the sequence and domain structure of group II intron IEPs, of both bacterial and eukaryotic origin, assigned a putative mitochondrial lineage to the Ll.LtrB intron, with most bacterial introns located elsewhere in the tree. Furthermore, the C-terminal En domain is found in only c. 40% of the bacterial group II IEPs currently annotated in databases (Martı́nez-Abarca & Toro, 2000b; Zimmerly et al., 2001; Dai & Zimmerly, 2002). The S. meliloti RmInt1 intron is the best characterized of the group II introns clustering on the bacterial branch. RmInt1 encodes a protein with conserved RT and maturase domains and a functionally uncharacterized C-terminal extension of 20 amino acid residues that appears to be unrelated to the DNA-binding (D) domain of LtrA (Martı́nez-Abarca et al., 2000; San Filippo & Lambowitz, 2002). Despite the absence of recognizable D and En domains, RmInt1 retrohomes very efficiently, with 100% of the intron-containing strains undergoing insertion events at plasmid-borne target sites (homing frequency), and 20–45% of target copies within a single cell invaded by the intron (homing efficiency) (Martı́nez-Abarca et al., 2000; JiménezZurdo et al., 2003). However, experiments with engineered En mutant Ll.LtrB derivatives and target sites have shown that Ll.LtrB can retrohome, albeit much less efficiently, without second-strand cleavage (D’Souza & Zhong, 2002; Zhong & Lambowitz, 2003). These findings suggest that there are alternative pathways operating in the mobility of En bacterial group II introns linked, as described below, to DNA replication (Muñoz-Adelantado et al., 2003; Zhong & Lambowitz, 2003; Martı́nez-Abarca et al., 2004). RmInt1 mobility (Fig. 3a and c) depends on the intron RNA and the IEP, with the RmInt1-encoded RNP complex recognizing a DNA target site extending 20 nt into the 5 0 exon and 5 nt into the 3 0 exon, as inferred from genetic characterization of the intron and target site coupled to homing assays in vivo (Jiménez-Zurdo et al., 2003; MuñozAdelantado et al., 2003). Subsequent mutational analysis demonstrated that this site is recognized primarily by the intron RNA, through the characteristic pairing pattern of IIB intron splicing: EBS2-IBS2, EBS1-IBS1 and EBS3-IBS3, FEMS Microbiol Rev 31 (2007) 342–358 349 Bacterial group II introns involving positions 13 to 9, 7 to 1 and 11 in the RmInt1 target site, respectively (Jiménez-Zurdo et al., 2003). Unlike Ll.LtrB, RmInt1 has less stringent requirements for recognition of the distal 5 0 and 3 0 exon regions, with only single nucleotide positions (T-15 and G14) required for wild-type retrohoming (Jiménez-Zurdo et al., 2003). These positions were initially thought to be recognized by the RmInt1 IEP, but recent findings suggest that there may be an alternative intron–exon pairing between an intron segment overlapping with the EBS2 exon-binding site and a 5 0 exon site located just distal of IBS2 relative to the splice junction (IBS2; Costa et al., 2006a, b). Thus, pairing of the first A of EBS2 with the last T of IBS2 at position 15 of the intron target site might render the last T of IBS2 critical for homing. The removal of nucleotides 20 to 16 from the RmInt1 target site dramatically decreases homing efficiency (Jiménez-Zurdo et al., 2003). The presumably small number of IEP–target-site interactions raises questions about the identity of the IEP domains involved and mechanistic implications for DNA target-site recognition. RmInt1-derived RNP particles have RT activity and promote the reverse splicing of intron RNA into both single-stranded and double-stranded DNA substrates, through RNA-mediated cleavage at the intron insertion site. However, they cannot carry out the En-dependent secondstrand cleavage in double-stranded DNA substrates (Muñoz-Adelantado et al., 2003). The efficiency of the reverse splicing reaction is markedly lower with long double-stranded DNA substrates. RmInt1-RNPs have higher levels of RT activity on exogenous synthetic substrates than on the endogenous intron RNA template, the reverse transcription of which in vitro has no obvious priming bias, suggesting that positioning of the reverse transcriptase on the precursor RNA facilitates initiation of the RT reaction various distances downstream from the intron (MuñozAdelantado et al., 2003). According to this biochemical analysis, the RmInt1-encoded RNP complex should recognize the key nucleotide residues in both single-stranded and double-stranded DNA target sites. However, the limited contact with the IEP is insufficient for the promotion of local DNA unwinding, suggesting that the preferred retrohoming pathway of RmInt1 involves reverse splicing into a transiently single-stranded DNA target site and that there may be alternative, uncharacterized priming mechanisms for reverse transcription of the intron RNA (MuñozAdelantado et al., 2003). Homing assays in vivo, using recipient plasmids in which the RmInt1 target site was inserted in both orientations relative to the origin of replication of the plasmid, have provided evidence for two distinct retrohoming pathways for RmInt1 (Fig. 3c). The main mechanism of mobility is favoured by cell division and involves the reverse splicing of the intron RNA into single-stranded DNA at DNA replicaFEMS Microbiol Rev 31 (2007) 342–358 tion forks, with a bias towards the DNA strand that serves as a template for lagging strand synthesis (Martı́nez-Abarca et al., 2004). This preference suggests that the intron should be inserted once the replication fork has passed the DNA target site, avoiding the potentially disruptive passage of the DNA polymerase complex through the region occupied by the RNP. Reverse transcription of the intron RNA is then primed by the nascent lagging strand, using either the RNA primer synthesized by the primase or partially polymerized Okazaki fragments. There is a second possible but minor retrohoming pathway (Fig. 3c), independent of DNA replication, that may involve reverse splicing into either doublestranded or transiently single-stranded DNA target sites and alternative priming mechanisms, such as random nonspecific DNA nicks (Schäfer et al., 2003), nascent leading strand (Zhong & Lambowitz, 2003) or de novo initiation priming (Wang & Lambowitz, 1993). The blocking of Ll.LtrBmediated second-strand cleavage by mutations in the En LtrA domain or substitutions of key nucleotide residues in the 3 0 exon revealed a minor retrohoming pathway for the lactococcal intron, dependent on DNA replication, with preferential use of the nascent leading strand to prime reverse transcription – a bias opposite to that for RmInt1 retrohoming (Zhong & Lambowitz, 2003). Other naturally occurring En group II introns include those belonging to the IIC subgroup of bacterial introns. These introns are often found inserted at target sites with recognizable IBS1 but not IBS2 sequences. The IBS2 sequence is replaced by a palindromic Rho-independent transcription terminator motif or similar structures (Granlund et al., 2001; Centron & Roy, 2002; Dai & Zimmerly, 2002). Recent data suggest that exon recognition by IIC introns rely on IBS1 and IBS3 pairings, and putatively on a new nonpairing interaction with a stem-loop (Toor et al., 2006). Dependence of retrohoming on host functions Group II introns are genetic elements with molecular features allowing them to maintain themselves and to spread in a genome. However, their mobility depends, to a variable extent, on host genetic background, suggesting that retrohoming requires the recruitment of host functions for its successful completion (Karberg et al., 2001; Toro et al., 2003). In addition to requiring the replication machinery during key initial stages, the retrohoming pathway of En bacterial group II introns requires, at later stages, repair functions poorly characterized for both endonucleasedependent and -independent retrohoming. A recent study of Ll.LtrB homing efficiency in various E. coli mutants with defective DNA/RNA repair functions led to the construction of a retrohoming completion model including exonucleases (Recj, MutD and PolI) and RNAses (RNase H) for DNA 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c 350 resection and removal of the intron RNA template after first-strand cDNA synthesis, together with complexes of DNA polymerases and repair polymerases (PolII, PolIII, PolIV and PolV) for synthesis and proofreading of the second strand of the intron cDNA (Smith et al., 2005). However, further functional characterization is required to determine unambiguously the role of these genes and proteins in the retrohoming pathway. Ectopic transposition Bacterial group II introns can also retrotranspose, at low frequency (typically 104 to 105), to nonallelic sites by means of an RNA intermediate (Cousineau et al., 2000; Martı́nez-Abarca & Toro, 2000a; Ichiyanagi et al., 2002). However, retrotransposition events were first demonstrated for eukaryotic mitochondrial group II introns, the RNPs of which were shown to promote reverse splicing of the intron RNA into noncognate DNA target sites (Mueller et al., 1993; Yang et al., 1998; Dickson et al., 2001). RNA-based retrotransposition was first investigated in bacteria using an Ll.LtrB twintron variant carrying the selfsplicing group I td intron and a kanamycin resistance gene for the selection of mobility events (Cousineau et al., 2000). In contrast to the situation described for mitochondrial group II introns, these experiments suggested that there may be a major En-independent and RecA-dependent Ll.LtrB retrotransposition pathway in L. lactis, probably involving reverse splicing of the intron RNA into cellular RNA rather than into DNA targets (Cousineau et al., 2000). However, the results of other studies are not consistent with this RNAtargetted mechanism as the preferred general pathway for the retrotransposition of bacterial group II introns: (1) the S. meliloti RmInt1 intron displays RecA-independent insertion into an ectopic site within the oxi1 gene (Martı́nezAbarca & Toro, 2000a); (2) many bacterial group II introns are located in nontranscribed intergenic regions (Martı́nezAbarca & Toro, 2000a; Dai & Zimmerly, 2002); (3) for the group II introns analysed to date, the IEP has been shown to have an affinity for DNA, rather than for RNA. Ll.LtrB retrotransposition was analysed further, using improved twintron donor constructs incorporating the kanamycin resistance gene as a retrotransposition indicator gene (RIG) inserted into intron DIV and interrupted by the td intron. Retrotransposition events can thus be reliably detected on the basis of kanamycin resistance once the td intron has been spliced out of the precursor RNA mobility intermediate (Ichiyanagi et al., 2002). The pattern of Ll.LtrB spread within the L. lactis genome, as shown by RIG analysis, is consistent with intron retrotransposition into double-stranded or single-stranded DNA targets through a homologous recombination-independent mechanism, as described for the mitochondrial and bacterial RmInt1 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c Nicolás Toro et al. introns. Insertion events were biased toward the use of the lagging strand as a template. This suggests a possible major En-independent retrotransposition pathway involving reverse splicing of the intron RNA into transiently singlestranded DNA target sites at replication forks, and the use of the nascent lagging strand to prime reverse transcription of the inserted intron RNA, a mechanism similar to the retrohoming mechanism of En bacterial group II introns (Ichiyanagi et al., 2002). An analysis of ectopic insertion sites for bacterial group II introns revealed that bona fide IBS1 sequences were conserved, whereas the IBS2 element and key nucleotide positions in the 5 0 and 3 0 exon sequences recognized by the protein component of the RNP in TPRTbased retrohoming events were not (Cousineau et al., 2000; Martı́nez-Abarca & Toro, 2000a, b; Dai & Zimmerly, 2002; Ichiyanagi et al., 2002). However, these target sites retain essential recognition elements for insertion events independent of second-strand cleavage. Nevertheless, recent studies have reported a preferential En-dependent Ll.LtrB retrotransposition mechanism in E. coli, suggesting that the retrotransposition strategies of bacterial group II introns may be largely influenced by the host (Coros et al., 2005). During the retrotransposition of the lactoccocal group II intron Ll.LtrB in E. coli, insertion occurs preferentially within the Ori and Ter macrodomains of the chromosome. This bipolar distribution of retrotransposition events results from the presence of the IEP (LtrA) at the poles of the cell (Zhao & Lambowitz, 2005; Beauregard et al., 2006). Bacterial group II intron retrotransposition has been shown to occur in natural bacterial populations and is favoured by targets in transmissible genetic elements, such as plasmids, providing further support for this mobility mechanism as the major dissemination strategy of bacterial group II introns in nature (Muñoz et al., 2001; Ichiyanagi et al., 2003). Group II introns as biotechnological tools Derivatives of group II intron Ll.LtrB have been used for gene disruption in various gram-negative (E. coli, Shigella flexneri and Salmonella typhimurium; Karberg et al., 2001) and gram-positive (i.e. Lactococcus lactis, Clostridium perfringens and Staphylococcus aureus; Chen et al., 2005; Yao et al., 2006) bacteria. A number of characteristics render group II introns suitable for use as biotechnological tools: (1) they are mobile elements, integrating with high efficiency into their DNA targets by a homologous recombination-independent process; (2) they recognize the target DNA mainly by base pairing, so target specificity can be changed by modifying the EBS sequences within the intron RNA; (3) they can mobilize foreign genetic information inserted within the intron; and (4) minimal host functions are required to FEMS Microbiol Rev 31 (2007) 342–358 351 Bacterial group II introns support intron mobility and are probably provided by conserved housekeeping proteins. Group II introns were first engineered to increase the frequency and efficiency of mobility (Guo et al., 2000; Nisa-Martı́nez et al., 2007; Plante & Cousineau, 2006). Initial modifications to Ll.LtrB included a deletion in the domain IV loop removing most of the LtrA ORF (DORF intron), with the intact LtrA ORF cloned and expressed downstream from the 3 0 exon. This configuration increased the frequency of mobility to almost 100% (Guo et al., 2000). Smaller DORF Ll.LtrB intron derivatives were recently constructed in which LtrA was inserted upstream from the 5 0 exon; these derivatives also had a higher homing efficiency (Plante & Cousineau, 2006). Similar engineering has been applied to the RmInt1 intron, resulting in a higher retrohoming efficiency than for the wild-type intron, with constructs in which the IEP has been deleted from domain IV and expressed either upstream from the 5 0 exon or downstream from the 3 0 exon. The retrohoming efficiency obtained was also higher (reaching 90%) if the constructs carried short stretches of wild-type 5 0 and 3 0 exons (20/ 15) flanking the DORF intron (Nisa-Martı́nez et al., 2007). The greater mobility of the engineered introns seems to be the result of greater stability of the intron RNA, probably owing to a decrease in susceptibility to degradation by host nucleases (Guo et al., 2000). In these constructs, the IEP promotes splicing of the DORF intron. However, after insertion into a new location, splicing of the DORF intron cannot occur in the absence of the IEP, resulting in disruption of the target gene. Introns targetted to the antisense strand lead to unconditional disruption of the gene. However, if DORF intron derivatives are retargetted to the sense strand, insertion of the intron generates a conditional disruption, as splicing can take place if the IEP is expressed in trans (Frazier et al., 2003; Nisa-Martı́nez et al., 2007). Another type of conditional gene disruption has also been achieved with Ll.LtrB. This conditional mutagenesis is based on the temperature-sensitive splicing displayed by this group II intron. The essential gene hsa of S. aureus was disrupted by an Ll.LtrB-DORF derivative in the sense orientation with respect to gene transcription. It could therefore be removed by RNA splicing, permitting the translation of Hsa. As IEP-assisted splicing is temperaturesensitive, hsa mutants can grow at 32 1C, a temperature at which splicing can take place, but cannot grow at 43 1C, a temperature at which splicing cannot occur (Yao et al., 2006). Thus, it is currently possible to reprogram group II introns to insert into any desired target sequence. Guo et al. (2000) have developed a method based on a random intron library, making it possible to select introns inserting into the selected target DNA. Two plasmids are used in this system. FEMS Microbiol Rev 31 (2007) 342–358 The donor plasmid expresses a set of intron molecules from a DORF-IEP configuration with randomized EBS sequences that also contain a T7 promoter, replacing the ORF within DIV. The second plasmid (the recipient) harbours the selected target sequence upstream from a promoterless tetR gene. A sequence encoding an E. coli rrnB T1 transcription terminator capable of terminating transcription by both the E. coli and T7 RNA polymerases is inserted upstream from the target site, and an rrnB T2 terminator, which terminates transcription by the E. coli RNA polymerase, but not by the T7 RNA polymerase, is inserted between the target site and the tetR gene. A phage T7 TF terminator is inserted downstream from the tetR gene to act as a terminator for the T7 RNA polymerase. Insertion of the intron carrying the T7 promoter into the target site activates expression of the tetR gene. This system can be used to select modified introns from the randomized library that can insert into the chosen gene. These introns can be further modified to improve base-pairing with the target DNA. This system was designed for use in E. coli, but genes from other organisms could be cloned and used in this system to select the intron disrupting the cloned gene most efficiently. This technique has been used to select an intron disrupting the human CCR5 gene (coreceptor for human HIV-1 virus; Guo et al., 2000). The CCR5 gene, carried by a plasmid within embryonic kidney cells, was then disrupted with the selected intron, which was introduced into the human cells as RNPs. The retargetting of bacterial group II introns can be optimized with a computer algorithm, which scans the target sequence, selects the best position for IEP recognition, and then designs primers to modify the intron EBS1, EBS2 and d sequences accordingly (Perutka et al., 2004). The IBS1 and IBS2 sequences in the 5 0 exon of the donor plasmid are also modified to make them complementary to the retargetted EBS sequences for efficient RNA splicing. The IEP recognizes few enough positions with a sufficient flexibility for the identification of several target sites in any gene. Introns integrated into the target DNA can be selected by colony PCR, or using a selectable marker, such as an antibiotic resistance gene. The selectable marker strategy is possible because group II introns can carry foreign DNA sequences within their RNA core. This heterologous DNA is carried in DIV of the group II intron, after removal of the IEP sequence. The presence of this foreign DNA generally has a negative effect on homing, but the modified introns nonetheless display detectable homing (Guo et al., 2000; Ichiyanagi et al., 2002; Frazier et al., 2003; Zhong et al., 2003). The main concern with conventional antibiotic resistance genes is that they are expressed from the donor plasmid independently of homing, which can hinder the selection of true homing events. The development of the retrotransposition-activated selectable marker (RAM) system has overcome this problem. The RAM system is based 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c 352 on RIG markers, interrupted by a splicing-competent tdI group I intron (Zhong et al., 2003). The selectable marker is thus expressed only if retrohoming occurs, facilitating the monitoring of group II intron dispersal through retrohoming. Almost all resistant, RAM-containing colonies display disruption of a single target. Subsequent modifications, in which the TpR gene was flanked by Flp Recombinase Target sequence (FRT), have adapted the system for multiple gene disruptions (Broach et al., 1982). The inserted FRT-flanked RAM marker can thus be excised by expressing the Flp recombinase, leading to 100% excision. Once the marker has been excised, a second mutagenesis with the same RAM is possible. Engineered group II introns carrying both the RAM-Tp marker and randomized EBSs have been used to generate a library of E. coli mutants (Zhong et al., 2003). Most of the intron insertions were found to be located close to the origin of replication of the chromosome, indicating a bias favouring this region. However, the library was complex enough for the detection by PCR of insertions in less favoured regions. The integrated introns can be isolated and used to obtain single knockouts in any gene (Yao et al., 2005). Bacterial group II intron diversity and evolution Group II introns have been found in c. 25% of sequenced bacterial genomes (Lambowitz & Zimmerly, 2004) and they are present in the Proteobacteria, Cyanobacteria, Acidobacteria, Actinobacteria, Bacteroides and Firmicutes (gram-positive bacteria) groups (Zimmerly group II-intron database, http://www.fp.ucalgary.ca/group2introns). Group II introns were discovered more recently in the Archaea, in which they Nicolás Toro et al. have been found only in two closely related methanogenic species, Methanosarcina mazei and Methanosarcina acetivorans, probably as a result of horizontal transmission from Eubacteria (Dai & Zimmerly, 2003; Rest & Mindell, 2003; Toro, 2003). Group II introns are currently classified into three main phylogenetic subclasses: IIA, IIB and IIC (Michel et al., 1989; Toor et al., 2001, 2002; Zimmerly et al., 2001; Ferat et al., 2003; Toro, 2003). Organellar introns from mitochondria and chloroplasts are most closely related to bacterial IIA and IIB (IIB1, IIB2) introns, whereas IIC introns are missing from Cyanobacteria and there are no close relatives of these introns among organellar introns (Fig. 4). The RNA structures of the group II intron ribozyme and the IEP seem to have coevolved, leading to the ‘retroelement ancestor hypothesis’, which suggests that all extant group II introns descended from an RT-encoding group II intron in bacteria with RNA structural features not conforming to canonical IIA or IIB structures (Robart & Zimmerly, 2005). Two different lineages, the canonical IIA and IIB, then became associated with organelles and the ORF was repeatedly lost, resulting in the formation of numerous ORF-less organellar introns (Toor et al., 2001; Lambowitz & Zimmerly, 2004; Robart & Zimmerly, 2005). ORF-less introns have also been found in Cyanobacteria (Nakamura et al., 2002), Bacillus (van der Auwera et al., 2005; Tourasse et al., 2006) and Archaea (Dai & Zimmerly, 2003), but they seem to be derived from intron-carrying ORFs. Interestingly, truncated group II introns are also present in bacteria (Dai & Zimmerly, 2002; Fernández-López et al., 2005). Some have a 5 0 truncation that could be accounted for by incomplete reverse transcription during TPRT. However, 5 0 truncations account for only a minority of fragmented introns (Robart & Zimmerly, 2005). Introns Fig. 4. Phylogenetic relationships between group II intron ORFs and correspondence with RNA structural class. The consensus phylogenetic unrooted tree shown was obtained using the maximum-parsimony method, and values are for 1000 bootstrap replicates (shown as percentages). The IEP RTs used for the alignment were the same as previously described (Zimmerly et al., 2001; Toro, 2003). The various mitochondrial group II intron-encoded ORFs and their accession numbers are as follows: M.p. atpA,1, M.p. atpA,2, M.p. cob,3, M.p. coxI,1, M.p. ORF732, M.p. coxI,2 and M.p. 18S rRNA geneM68929 are from the liverwort Marchantia polymorpha; O.b.nad1,4-M63034 is from the green plant Oenothera berteriana; S.p. cob,1-X54421, K.l. coxI,1-X57546, S.c. aI1 and S.c. aI2-AJ011856 are from the yeasts Schizosaccharomyces pombe, Kluyveromyces lactis and Saccharomyces cerevisiae, respectively; N.c. coI,1-S07649 and P.a. coI,1-X55026 are from the fungi Neurospora crassa and Podospora anserina, respectively; P.l.LSU,2.-S58504 and P.l.LSU,1-S58503 are from the brown alga Pylaiella littoralis; P.p. ORF544 and P.p. ORF546-AF114794 are from the red alga Porphyra purpurea. The chloroplast group II intron-encoded ORFs are S.o. petD,1-P19593 from the green alga Scenedesmus obliquus and B.m. rbcl,1-X55877 from Bryopsis maxima. The bacterial IEPs are in bold and underlined and named according to Dai & Zimmerly (2002), except for those from the uncultured marine bacterium (UMB), for which we used its accession number (AAL78688). M.a.I1 (gene name MA4626, AE011185) and M.a.I2 (gene name MA3645, AE011073) are from Methanosarcina acetivorans str.C2A, and M.m.I1 is from Methanosarcina mazei Goe1 (gene name MM2683, AE013515). RmInt1 (S.me.I1) is from Sinorhizobium meliloti (Y11597), as are S.me.I3 (AL0603646) and S.me.I4 (AJ496462); E.c.I2 is from Escherichia coli (X77508), as are E.c.I4 (AB024946) and E.c.I5 (AF074613); B.j.I2 (elsewhere B.j.F1) is from Bradyrhizobium japonicum (AF322013), as are B.j.F4/B.j.F5 (AF322012) and B.j.I1 (AF322013); S.ma.I1 is from Serriata marcescens (AF027768); S.p.I1 is from Streptococcus pneumoniae (AF030367); B.h.I1 is from Bacillus halodurans (AP01507); P.p.I1 and P.p.I2 are from Pseudomonas putida (AF101076 and Y18999, respectively); P.a.I1 is from Pseudomonas alcaligenes (AF323437); R.e.I1 is from Ralstonia eutropha (AF261712); C.d.I1 is from Clostridium difficile (X98606); B.me.I1 is from Bacillus megaterium (AB022308); B.a.I1 and B.a.I2 are from Bacillus anthracis (AF065404) and S.f.I1 is from Shigella flexneri (U97489); C.sp.I1 is from Calothrix sp. (X71404); L.l.LtrB (L.l.I1) is from Lactococcus lactis; N.a.I1 and N.a.I2 are from Novosphingobium aromaticivorans (AF079317). 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c FEMS Microbiol Rev 31 (2007) 342–358 353 Bacterial group II introns like RmInt1 tend to be inactivated by fragmentation, with loss of the 3 0 terminus. In some cases, such inactivation arises from the insertion of other mobile elements and further genetic rearrangements (Dai & Zimmerly, 2002; Fernández-López et al., 2005). The fragmentation of group II introns, together with the accumulation of mutations and natural selection may well be responsible for the current status of self-splicing group II introns in bacteria, with the intrinsically large population size and genome simplification (genomic streamlining) also playing important roles (Lynch, 2006). FEMS Microbiol Rev 31 (2007) 342–358 The IEP of full-length bacterial group II introns generally contains both the RT and maturase domains, but different groups of introns can be distinguished on the basis of the presence or absence of the D and/or En domains. The absence of the D and En domains may result from their specific loss in some introns. However, whole intron groups within the IIB class (Fig. 4), such as the IIB3 (also known as the D class, Zimmerly et al., 2001) and IIB5 lineages (also known as the E class, Toro et al., 2002; Lambowitz & Zimmerly, 2004; Robart & Zimmerly, 2005), seem to lack the D and En domains entirely (Toro, 2003). The IIC class of 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c 354 Nicolás Toro et al. require further investigation, it is also plausible that the ancestor of these introns was actually a catalytic RNA intron functioning in the absence of the RT protein (Fig. 5), perhaps in extreme environments capable of supporting such catalytic activity (Lambowitz & Zimmerly, 2004). The acquisition of the RT-maturase protein may have enabled group II introns to extend into bacterial populations, in which they survived and spread by inserting into nonessential genes, such as other mobile elements. Their mobility would have been linked to the replication machinery of the host cell, with insertion into singlestranded DNA target sites at a replication fork, using a nascent lagging DNA strand as the primer for reverse transcription. Subsequent acquisition of the D and En domains would then have enabled group II introns to insert efficiently by reverse splicing into double-stranded DNA target sites (Lambowitz & Zimmerly, 2004). This acquisition may have provided group II introns present in the alphaproteobacterium ancestor of the mitochondria, which invaded an archaeal host during eukaryogenesis (Fig. 5), with the features required for the massive invasion of the host-cell genome, leading to the creation of spliceosomal introns (Koonin, 2006). Conclusions and perspectives Fig. 5. Hypothesis for the evolution of group II introns. The model predicts a catalytic group II intron RNA functioning in the absence of an RT and its evolution in bacteria by acquisition of the RT-maturase IEP leading to the ancestor of the three current intron lineages IIA, IIB and IIC (a). IIA and IIB introns present in the alphaproteobacterium ancestor of mitochondria would lead to massive invasion of the archaeal host genome during eukaryogenesis (Koonin, 2006), thus giving rise to spliceosomal introns (b). introns (Fig. 4), characterized by the absence of an EBS2 sequence and a shorter ribozyme DV (Toor et al., 2001; Toro, 2003), also lacks the En domain, but still has a putative D domain (San Filippo & Lambowitz, 2002). It is widely accepted that the ancestral RT probably lacked the D and En domains (Fig. 5) and that their acquisition and loss in some cases may have contributed to the evolution of the various bacterial group II intron lineages. Phylogenetic analyses of IEP and the intron RNA structures (Martı́nez-Abarca & Toro, 2000b; Toor et al., 2001; Zimmerly et al., 2001; Toro, 2003) have shown the IIB and IIA classes to be sister lineages and the IIC class to be the most divergent lineage (Fig. 4). So which of these lineages of group II introns is the oldest? The data currently available provide no final answer to this question, but, according to the retroelement hypothesis, the earliest branching class of introns is most likely to be class IIC. Regardless of the evolutionary relationships between the current subclasses of group II introns in bacteria, which 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c Since the discovery 14 years ago of group II introns in bacteria, information has accumulated about the architectural and functional organization of group II introns as catalytic RNAs, the mechanism of interaction between the IEP and the intron RNA catalytic core leading to RNP assembly, the mechanisms of intron mobility and DNA target site recognition, and the development of these retroelements as new gene-targetting vectors. These advances have provided new insight into the fundamental nature and applications of these ribozymes. Improvements in our understanding of the course of evolution of group II introns in bacteria should also provide clues to the origin of the spliceosome and the possible role of these introns in eukaryogenesis. What role do group II introns play in bacterial genomes? Although some group II introns are inserted into nonessential regions of the genome, others are found in essential genes. The current situation may result from bacterial genome streamlining and population size thus decreasing intron numbers or leading to such genetic elements being entirely absent in some bacterial species. There may also be unknown mechanisms in bacteria controlling intron expression and mobility. Despite extensive debate concerning the way in which introns have played a critical role in eukaryote evolution by inserting into protein-coding genes, very little is known about group II intron evolution in bacteria. Progress in bacterial genome sequencing programs may shed light on FEMS Microbiol Rev 31 (2007) 342–358 355 Bacterial group II introns the evolution of bacterial group II introns, revealing lineages other than the current known subclasses IIA, IIB and IIC, and possibly on group II intron RNAs functioning in the absence of the RT protein. This work may involve sequencing metagenomes from extreme environments. Studies on natural bacterial populations may also improve our understanding of the current contribution of group II introns to bacterial evolution and the possible functional role of these introns. The current view of group II introns as selfish elements maintained throughout prokaryotic evolution may turn out to be a simplistic assumption based on our current lack of knowledge of bacterial group II intron evolution. Future studies on group II introns will focus not only on fundamental evolutionary aspects, but also on the development of gene-targetting vectors. The development of such tools based on other bacterial introns, with specific mechanistic intron mobility features different from those of Ll.LtrB, should make this technology more broadly applicable. The mobility mechanism of these bacterial introns does not depend on recombination. This feature makes these retroelements potential tools for genetic manipulation in higher eukaryotes with low levels of recombination, such as plants and animals. Like the splicing factors identified for group II introns in organelles, bacterial group II introns probably use certain host factors that might contribute to intron RNA folding and mobility. In the next few years, efforts should be made to identify such factors, which might increase the functionality of these bacterial group II introns in various prokaryotic and eukaryotic hosts. Studies on the architectural and functional organization of group II introns will continue to expand our knowledge about the splicing mechanism of these ribozymes, and will also contribute to parallel investigations on related elements, such as spliceosomes and retrotransposons. The excision of group II introns as circles has been demonstrated to occur in vivo for yeast introns such as aI2 and for some plant mitochondrial introns. The recent detection of circular intron molecules in bacteria indicates that this particular mode of excision may be more widespread in nature than initially thought. The splicing mechanism involved in group II intron circle formation, and the biological role of such circles in intron spread remain key issues that we can expect to see resolved in the next few years. Acknowledgements Work at the authors’ laboratory was supported by the Spanish Ministerio de Educación y Ciencia (BIO200502312) and Junta de Andalucı́a (CVI-01522). F.M.G.-R. holds an I3P postdoctoral fellowship (CSIC), and J.I.J.-Z. is a Ramón & Cajal hired scientist. FEMS Microbiol Rev 31 (2007) 342–358 References Adamidi C, Fedorova O & Pyle AM (2003) A group II intron inserted into a bacterial heat-shock operon shows autocatalytic activity and unusual thermostability. Biochemistry 42: 3409–3418. Aizawa Y, Xiang Q, Lambowitz A & Pyle AM (2003) The pathway for DNA recognition and RNA integration by a group II intron retrotransposon. Mol Cell 11: 795–805. Beauregard A, Chalamcharla VR, Lyn Piazza C, Belfort M & Coros CJ (2006) Bipolar localization of the group II intron Ll.LtrB is maintained in Escherichia coli deficient in nucleoid condensation, chromosome partitioning and DNA replication. Mol Microbiol 62: 709–722. Belfort M, Derbyshire V, Parker MM, Cousineau B & Lambowitz AM (2002) Mobile introns. Mobile DNA. 2nd edn (Craig N, Craigie R, Gellert M & Lambowitz AM, eds), pp. 761–783. ASM Press, Washington, DC. Broach JR, Guarascio VR & Jayaram M (1982) Recombination within the yeast 2 mu circle is site-specific. Cell 29: 227–234. Cavalier-Smith T (1991) Intron phylogeny: a new hypothesis. Trends Genet 7: 145–148. Cech TR (1986) The generality of self-splicing RNA: relationship to nuclear mRNA splicing. Cell 44: 207–210. Centron D & Roy PH (2002) Presence of a group II intron in a multiresistant Serratia marcescens strain that harbors three integrons and a novel gene fusion. Antimicrob Agents Chemother 45: 1402–1409. Chee GJ & Takami H (2005) Housekeeping recA gene interrupted by group II intron in the thermophilic Geobacillus kaustophilus. Gene 363: 211–220. Chen Y, McClane BA, Fisher DJ, Rood JI & Gupta P (2005) Construction of an alpha toxin gene knockout mutant of Clostridium perfringens type A by use of a mobile group II intron. Appl Environ Microbiol 71: 7542–7547. Chu VT, Adamidi C, Liu Q, Perlman PS & Pyle AM (2001) Control of branch-site choice by a group II intron. EMBO J 20: 6866–6876. Coros CJ, Landthaler M, Piazza CL, Beauregard A, Esposito D, Perutka J, Lambowitz AM & Belfort M (2005) Retrotransposition strategies of the Lactococcus lactis Ll.LtrB group II intron are dictated by host identity and cellular environment. Mol Microbiol 56: 509–524. Costa M, Deme E, Jacquier A & Michel F (1997) Multiple tertiary interactions involving domain II of group II self-splicing introns. J Mol Biol 267: 520–536. Costa M, Michel F & Westhof E (2000) A three-dimensional perspective on exon binding by a group II self-splicing intron. EMBO J 19: 5007–5018. Costa M, Michel F, Molina-Sánchez MD, Martı́nez-Abarca F & Toro N (2006a) An alternative intron–exon pairing scheme implied by unexpected in vitro activities of group II intron RmInt1 from Sinorhizobium meliloti. Biochimie 88: 711–717. Costa M, Michel F & Toro N (2006b) Potential for alternative intron–exon pairings in group II intron RmInt1 from Sinorhizobium meliloti and its relatives. ARN 12: 338–341. 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c 356 Cousineau B, Smith D, Lawrence-Cavanagh S, Mueller JE, Yang J, Mills D, Manias D, Dunny G, Lambowitz A & Belfort M (1998) Retrohoming of a bacterial group II intron mobility via complete reverse splicing, independent of homologous DNA recombination. Cell 94: 451–462. Cousineau B, Lawrence S, Smith D & Belfort M (2000) Retrotransposition of a bacterial group II intron. Nature 404: 1018–1021. Erratum (2001) Nature 414: 84. Cui X, Matsuura M, Wang Q, Ma H & Lambowitz AM (2004) A group II intron-encoded maturase functions preferentially in cis and requires both the reverse transcriptase and X domains to promote RNA splicing. J Mol Biol 340: 211–231. Dai L & Zimmerly S (2002) Compilation and analysis of group II intron insertions in bacterial genomes: evidence for retroelement behaviour. Nucleic Acids Res 30: 1091–1102. Dai L & Zimmerly S (2003) ORF-less and reverse-transcriptaseencoding group II introns in archeobacteria, with a pattern of homing into related group II introns ORFs. RNA 8: 1294–1307. De Lencastre A, Hamill S & Pyle AM (2005) A single-active site region for a group II intron. Nature Struct Mol Biol 12: 626–627. Dickson L, Huang HR, Liu L, Matsuura M, Lambowitz AM & Perlman PS (2001) Retrotransposition of a yeast group II intron occurs by reverse splicing directly into ectopic DNA sites. Proc Natl Acad Sci USA 98: 13207–13212. D’Souza LM & Zhong J (2002) Mutations in the Lactococcus lactis Ll.LtrB group II intron that retain mobility in vivo. BMC Mol Biol 3: 17. Eickbush TH (1994) Origins and evolutionary relationships of retroelements. The Evolutionary Biology of Viruses (Morse SS, ed), pp. 121–157. Raven Press, New York. Fedorova O, Mitros T & Pyle AM (2003) Domains 2 and 3 interact to form critical elements of the group II intron active site. J Mol Biol 330: 197–209. Ferat JL & Michel F (1993) Group II self-splicing introns in bacteria. Nature 364: 358–361. Ferat JL, Le Gouar M & Michel F (1994) Multiple group II selfsplicing introns in mobile DNA from Escherichia coli. C R Acad Sci III 317: 141–148. Ferat JL, Le Gouar M & Michel F (2003) A group II intron has invaded the genus Azotobacter and inserted within the termination codon of the essential groEL gene. Mol Microbiol 49: 1407–1423. Fernández-López M, Muñoz-Adelantado E, Gillis M, Willems A & Toro N (2005) Dispersal and evolution of the Sinorhizobium meliloti group II RmInt1 intron in bacteria that interact with plants. Mol Biol Evol 22: 1518–1528. Frazier CL, San Filipo J, Lambowitz AM & Mills DA (2003) Genetic manipulation of Lactococcus lactis by using targeted group II introns: generation of stable insertions without selection. Appl Environ Microbiol 69: 1121–1128. Granlund M, Michel F & Norgren M (2001) Mutually exclusive distribution of IS1548 and GBSi1, an active group II intron 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c Nicolás Toro et al. identified in human isolates of group B Streptococci. J Bacteriol 183: 2560–2569. Guo H, Karberg M, Long M, Jones JP III, Sullenger B & Lambowitz AM (2000) Group II introns designed to insert into therapeutically relevant DNA target sites in human cells. Science 289: 452–457. Hamill S & Pyle AM (2006) The receptor for branch-site docking within a group II intron active site. Mol Cell 23: 831–840. Ichiyanagi K, Beauregard A, Lawrence S, Smith D, Cousineau B & Belfort M (2002) Retrotransposition of the Ll.LtrB group II intron proceeds predominantly via reverse splicing into DNA targets. Mol Microbiol 46: 1259–1272. Ichiyanagi K, Beauregard A & Belfort M (2003) A bacterial group II intron favors retrotransposition into plasmid targets. Proc Natl Acad Sci USA 100: 15742–15747. Jiménez-Zurdo JI, Garcı́a-Rodrı́guez FM, Barrientos-Durán A & Toro N (2003) DNA target site requirements for homing in vivo of a bacterial group II intron encoding a protein lacking the DNA endonuclease domain. J Mol Biol 326: 413–423. Karberg M, Guo H, Zhong J, Coon R, Perutka J & Lambowitz AM (2001) Group II introns as controllable gene targeting vectors for genetic manipulation of bacteria. Nat Biotechnol 19: 1162–1167. Koonin EV (2006) The origins of introns and their role in eukaryogenesis: a compromise solution to introns-early vs. introns-late debate? Biol Direct 1: 39. Kosaraju P, Pulakat L & Gavini N (2005) Analysis of the genome of Azotobacter vinelandii revealed the presence of two genetically distinct group II introns on the chromosome. Genetica 124: 107–115. Lambowitz AM & Perlman PS (1990) Involvement of aminoacyltRNA synthetases and other proteins in group I and group II intron splicing. Trends Biochem Sci 15: 440–444. Lambowitz AM & Zimmerly S (2004) Mobile group II introns. Ann Rev Genet 38: 1–35. Lambowitz AM, Caprara MG, Zimmerly S & Perlman PS (1999) Group I and group II ribozymes as RNPs: clues to the past and guides to the future. The RNA World. 2nd edn (Gesteland RF, Cech TR, Atkins JF, eds), pp. 451–485. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Lambowitz AM, Mohr G & Zimmerly S (2005) Group II intron homing endonucelases: ribonuceloprotein complexes with programmable target specificity. Homing Endonucleases and Inteins (Belfort M, ed), pp. 121–145. Springer-Verlag, Heidelberg. Lehmann K & Schmidt U (2003) Group II introns: structural and catalytic versatility of large natural ribozymes. Crit Rev Biochem Mol Biol 38: 249–303. Lynch M (2006) Streamlining and simplification of microbial genome architecture. Annu Rev Microbiol 60: 327–349. Martin W & Koonin EV (2006) Introns and the origin of nucleuscytosol compartmentalization. Nature 440: 41–45. Martı́nez-Abarca F & Toro N (2000a) RecA-independent ectopic transposition in vivo of a bacterial group II intron. Nucleic Acids Res 28: 4397–4402. FEMS Microbiol Rev 31 (2007) 342–358 357 Bacterial group II introns Martı́nez-Abarca F & Toro N (2000b) Group II introns in the bacterial world. Mol Microbiol 38: 917–926. Martı́nez-Abarca F, Zekri S & Toro N (1998) Characterization and splicing in vivo of a Sinorhizobium meliloti group II intron associated with particular insertion sequences of the IS630Tc1/IS3 retroposon superfamily. Mol Microbiol 28: 1295–1306. Martı́nez-Abarca F, Garcı́a-Rodrı́guez F, Muñoz E & Toro N (1999) Biochemical demonstration of reverse transcriptase activity and splicing efficiency of bacterial group II intron from Sinorhizobium meliloti. Nucleic Acids Symp Ser 41: 117–119. Martı́nez-Abarca F, Garcı́a-Rodrı́guez FM & Toro N (2000) Homing of a bacterial group II intron with an intron-encoded protein lacking a recognizable endonuclease domain. Mol Microbiol 35: 1405–1412. Martı́nez-Abarca F, Barrientos-Durán A, Fernández-López M & Toro N (2004) The RmInt1 group II intron has two different retrohoming pathways for mobility using predominantly the nascent lagging strand at DNA replication forks for priming. Nucleic Acids Res 32: 2880–2888. Matsuura M, Saldanha R, Ma H, Wank H, Yang J, Mohr G, Cavanagh S, Dunny GM, Belfort M & Lambowitz AM (1997) A bacterial group II intron encoding reverse transcriptase, maturase and DNA endonuclease activities: biochemical demonstration of maturase activity and insertion of new genetic information within the intron. Genes Dev 11: 2910–2924. Matsuura M, Noah JW & Lambowitz AM (2001) Mechanism of maturase-promoted group II intron splicing. EMBO J 20: 7259–7270. Meng Q, Wang Y & Liu XQ (2005) An intron-encoded protein assists RNA splicing of multiple similar introns of different bacterial genes. J Biol Chem 280: 35085–35088. Michel F & Ferat JL (1995) Structure and activities of group II introns. Annu Rev Biochem 64: 435–461. Michel F, Umesomo K & Ozeki H (1989) Comparative and functional anatomy of group II catalytic introns – a review. Gene 82: 5–30. Mills DA, McKay LL & Dunny GM (1996) Splicing of a group II intron involved in the conjugative transfer of pRS01 in lactococci. J Bacteriol 178: 3531–3538. Mohr G, Smith D, Belfort M & Lambowitz AM (2000) Rules for DNA target-site recognition by a lactococcal group II intron enable retargeting of the intron to specific DNA sequences. Genes Dev 14: 559–573. Molina-Sánchez MD, Martı́nez-Abarca F & Toro N (2006) Excision of the Sinorhizobium meliloti group II intron RmInt1 as circles in vivo. J Biol Chem 281: 28737–28744. Mueller MW, Allmaier M, Eskes R & Schweyen RJ (1993) Transposition of group II intron aI1 in yeast and invasion of mitochondrial genes at new locations. Nature 366: 174–176. Muñoz E, Villadas P & Toro N (2001) Ectopic transposition of a group II intron in natural bacterial populations. Mol Microbiol 41: 645–652. FEMS Microbiol Rev 31 (2007) 342–358 Muñoz-Adelantado E, San Filippo J, Martı́nez-Abarca F, Garcı́aRodrı́guez FM, Lambowitz AM & Toro N (2003) Mobility of the Sinorhizobium meliloti group II intron RmInt1 occurs by reverse splicing into DNA, but requires an unknown reverse transcriptase priming mechanism. J Mol Biol 327: 931–943. Murray HL, Mikheeva S, Coljee VW, Turczyk BM, Donahue WF, Bar-Shalom A & Jarrel KA (2001) Excision of group II introns as circles. Mol Cell 8: 201–211. Nakamura Y, Kaneko T, Sato S, et al. (2002) Complete genome structure of the thermophilic cyanobacterium Thermosynechococcus elongatus BP-1. DNA Res 9: 123–130. Nisa-Martı́nez R, Jiménez-Zurdo JI, Martı́nez-Abarca F, MuñozAdelantado E & Toro N (2007) Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer. Nucleic Acids Res 35: 214–222. Noah JW, Park S, Whitt JT, Perutka J, Frey W & Lambowitz AM (2006) Atomic force microscopy reveals DNA bending during group II intron ribonucleoprotein particle integration into double-stranded DNA. Biochemistry 45: 12424–12435. Ostheimer GJ, Williams-Carrier R, Belcher S, Osborne E, Gierke J & Barkan A (2003) Group II intron splicing factors derived by diversification of an ancient RNA-binding domain. EMBO J 22: 3919–3929. Perutka J, Wang W, Goerlitz D & Lambowitz AM (2004) Use of computer-designed group II introns to disrupt Escherichia coli DexH/D-box protein and DNA helicase genes. J Mol Biol 336: 421–439. Plante I & Cousineau B (2006) Restriction for gene insertion within the Lactococcus lactis Ll.LtrB group II intron. RNA. 12: 1980–1992. Pyle AM & Lambowitz AM (2006) Group II introns: ribozymes that splice RNA and invade DNA. RNA World. 3rd edn (Getsland RF, Ceh TR & Atkins JF, eds), pp. 469–505. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Rest JS & Mindell DP (2003) Retroids in archaea: phylogeny and lateral origins. Mol Biol Evol 20: 1134–1142. Robart AR & Zimmerly S (2005) Group II intron retroelements: function and diversity. Cytogenet Genome Res 110: 589–597. Robart AR, Montgomery NK, Smith KL & Zimmerly S (2004) Principles of 3 0 splice site selection and alternative splicing for an unusual group II intron from Bacillus anthracis. RNA 10: 854–862. Roberts AP, Braun V, von Eichel-Streiber C & Mullany P (2001) Demonstration that the group II intron from the clostridial conjugative transposon Tn5397 undergoes splicing in vivo. J Bacteriol 183: 1296–1299. Saldanha R, Chen B, Wank H, Matsuura M, Edwards J & Lambowitz AM (1999) RNA and protein catalysis in group II intron splicing and mobility reactions using purified components. Biochemistry 38: 9069–9083. San Filippo J & Lambowitz AM (2002) Characterization of the Cterminal DNA-binding/DNA endonuclease region of a group II intron-encoded protein. J Mol Biol 324: 933–951. 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c 358 Schäfer B, Gan L & Perlman PS (2003) Reverse transcriptase and reverse splicing activities encoded by the mobile group II intron cob Il of fission yeast mitochondrial DNA. J Mol Biol 329: 191–206. Sharp PA (1985) On the origin of RNA splicing and introns. Cell 42: 397–400. Shearman C, Godon JJ & Gasson M (1996) Splicing of a group II intron in a functional transfer gene of Lactococcus lactis. Mol Microbiol 21: 45–53. Singh NN & Lambowitz AM (2001) Interaction of a group II intron ribonucleoprotein endonuclease with its DNA target site investigated by DNA footprinting and modification interference. J Mol Biol 309: 361–386. Singh RN, Saldanha RJ, D’Souza LM & Lambowitz AM (2002) Binding of a group II intron-encoded reverse transcriptase/ maturase to its high affinity intron RNA binding site involves sequence-specific recognition and autoregulates translation. J Mol Biol 318: 287–303. Smith D, Zhong J, Matsuura M, Lambowitz AM & Belfort M (2005) Recruitment of host functions suggests a repair pathway for late steps in group II intron retrohoming. Genes Dev 19: 2477–2487. Stoltzfus A (1999) On the possibility of constructive neutral evolution. J Mol Evol 49: 169–181. Su LJ, Qin PZ, Michels WJ & Pyle AM (2001) Guiding ribozyme cleavage through motif recognition: the mechanism of cleavage site selection by a group II intron ribozyme. J Mol Biol 306: 655–668. Swisher J, Duarte CM, Su LJ & Pyle AM (2001) Visualizing the solvent-inaccessible core of a group II intron ribozyme. EMBO J 20: 2051–2061. Toor N, Hausner G & Zimmerly S (2001) Coevolution of group II intron RNA structures with their intron-encoded reverse transcriptases. RNA 7: 1142–1152. Toor N, Robart AR, Christianson J & Zimmerly S (2006) Selfsplicing of a group IIC intron: 5 0 exon recognition and alternative 5 0 splicing events implicate the stem-loop motif of a transcriptional terminator. Nucleic Acids Res 34: 6561–6573. Toro N (2003) Bacteria and archaea group II introns: additional mobile elements in the environment. Environ Microbiol 5: 143–151. Toro N, Molina-Sánchez MD & Fernández-López M (2002) Identification and characterization of bacterial class E group II introns. Gene 299: 245–250. Toro N, Martı́nez-Abarca F, Fernández-López M & MuñozAdelantado E (2003) Diversity of group II introns in the genome of Sinorhizobium meliloti strain 1021: splicing and mobility of RmInt1. Mol Genet Genomics 268: 628–636. 2007 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved c Nicolás Toro et al. Tourasse NJ, Stabell FB, Reiter L & Kolst AB (2005) Unusual group II introns in bacteria of the Bacillus cereus group. J Bacteriol 187: 5437–5451. Tourasse NJ, Helgason E, Økstad OA, Hegna IK & Kolst AB (2006) The Bacillus cereus group: novel aspects of population structure and genome dynamics. J Appl Microbiol 101: 579–593. van der Auwera GA, Andrup L & Mahillon J (2005) Conjugative plasmid pAW63 brings new insights into the genesis of Bacillus anthracis virulence plasmid pX02 and of the Bacillus thuringiensis plasmid pBT9727. BMC Genomics 6: 103. Wang H & Lambowitz AM (1993) The Mauriceville plasmid reverse transcriptase can initiate cDNA synthesis de novo and may be related to reverse transcriptase and DNA polymerase progenitor. Cell 75: 1071–1081. Wank H, San Filippo J, Singh RN, Matsuura M & Lambowitz AM (1999) A reverse transcriptase/maturase promotes splicing by binding at its own coding segment in a group II intron RNA. Mol Cell 4: 239–250. Yang J, Mohr G, Perlman PS & Lambowitz AM (1998) Group II intron mobility in yeast mitochondria: target DNA-primed reverse transcription activity of aI1 and reverse splicing into DNA transposition sites in vitro. J Mol Biol 282: 505–523. Yao J, Zhong J & Lambowitz AM (2005) Gene targeting using randomly inserted group II introns (targetrons) recovered from Escherichia coli gene disruption library. Nucleic Acids Res 33: 3351–3362. Yao J, Zhong J, Fang Y, Geisinger E, Novick RP & Lambowitz AM (2006) Use of targetrons to disrupt essential and nonessential genes in Staphylococcus aureus reveals temperature sensitivity of Ll.LtrB group II intron splicing. RNA 12: 1271–1281. Zhao J & Lambowitz AM (2005) A bacterial group II intronencoded reverse transcriptase localizes to cellular poles. Proc Natl Acad Sci USA 102: 16133–16140. Zhong J & Lambowitz AM (2003) Group II intron mobility using nascent strands at DNA replication forks to prime reverse transcription. EMBO J 22: 4555–4565. Zhong J, Karberg M & Lambowitz AM (2003) Targeted and random bacterial gene disruption using a group II intron (targetron) vector containing a retrotransposition-activated selectable marker. Nucleic Acids Res 31: 1656–1664. Zimmerly S, Guo H, Perlman PS & Lambowitz AM (1995) Group II intron mobility occurs by target DNA-primed reverse transcription. Cell 82: 545–554. Zimmerly S, Hausner G & Wu X (2001) Phylogenetic relationships among group II intron ORFs. Nucleic Acids Res 29: 1238–1250. FEMS Microbiol Rev 31 (2007) 342–358
© Copyright 2026 Paperzz