The Yeast Intein Database
  The Yeast Intein Database is a comprehensive and curated database devoted to yeast inteins.

  Splicing motifs
 

(Perler, F. B. (2002). InBase, the Intein Database. Nucleic Acids Res. 30, 383-384)
Several conserved motifs have been observed by comparing intein amino acid sequences. There are two nomenclatures for these motifs: Blocks A, B, C, D, E, H, F, G or Blocks N1, N3, EN1, EN2, EN3, EN4, C2 and C1, respectively . Blocks N2 and N4 are not as well conserved as the other intein motifs. The intein motifs are more extensively described in the Conserved Intein Features - Do you Have an Intein? section and their sequences are listed in the Splicing Motifs and LAGLIDADG (DOD) Homing Endonuclease Motifs sections

The consensus sequence for each block is indicated below. Although, no single residue is invariant, the Ser and Cys in Block A, the His in Block B, the His, Asn and Ser/Cys/Thr in Block G are the most conserved residues in the splicing motifs. Any member of an amino acid group may be present in the remaining positions, even when a specific predominant residue is indicated
(Perler, F. B. (2002). InBase, the Intein Database. Nucleic Acids Res. 30, 383-384).

Remember, the splicing domain consists of Blocks A, N2, B, N3, F and G, while Blocks C, D, E and H are only in the DOD homing endonuclease domains. Blocks C and E each contains an endonuclease active site Asp (D) or Glu (E). Block D contains an endonuclease active site Lys (K) in the Sce VMA intein (Duan 1997). Several inteins have mutations in these endonuclease active site residues and therefore may not be active endonucleases although the remainder of the motif is present.
Key: upper case letters represent the standard single letter amino acid code for the most common amino acid at this position and lower case letters represent amino acid groups: ., any residue; h, hydrophobic residues: G,A,V,L,I, M; p, polar residues: S,C,T; a, acidic residues: D or E; r, aromatic residues: F,Y,W)

Consensus pattern of intein motifs
Block A: Ch..Dp.hhh..G (first residue = intein N-terminus)
Block B: G..h.hT..H.hhh

(usually 70-105 residues from N-terminus)

Block C: LhG..hhaG (motif of DOD homing endo)
Block D: .K.IP..h (motif of DOD homing endo)
Block E: .L.GhFahDG (motif of DOD homing endo)
Block H : p.S..hh..h..LL..hGI

(motif of DOD homing endo)

Block F: rVYDLpV[1-3 residues]a..[H or E]NFh
Block G: NGhhhHNp

(p = downstream extein N-terminus)

(Francine B. Perler: InBase: the Intein Database: Identifying Inteins by Conserved Intein Features Vol. 30, No. 1 383-384, Research, 2002)

A. Splicing Motifs (Blocks A, B, F, G)

N-terminal (N) domain
<-->
1-33 AA
<-->
16-75 AA

<-->
3-37 AA
N1
N2
N3
N4

The first motif is found at the N-termini of inteins. Note the conserved Ser and Cys at the N' intein splice site. The OH/SH side groups of the aa in this position are necessary for the N-O/S shift of the peptide bond to the N-extein. The His at position 10 of the third motif is the most conserved intein residue. It was first predicted and then shown to be involved in the protein splicing reaction. Intein structures also showed this residue to be positioned at the protein splicing active site.

C-terminal (C) domain
<-->
1-9 AA

C2
C1

The two C domain motifs are involved in the final steps of protein splicing. A Ser, Thr or Cys are found at the N-terminus of the C-extein (last position of second motif). The hydroxyl, or thiol, group of this aa attacks the last aa of the N-extein in a transesterification reaction. The resulting branched intermediate is resolved by the cyclization of the Asn preceding the attacking aa. This Asn, the intein's C-termini, is the second most conserved residue in inteins. Gln residues are now also known to occur in this position. They are found in the CIV RIR1 and Pho polC inteins. These inteins are integrated in conserved protein regions of vital proteins (subunits of ribonucleotide reductase and replicative DNA polymerases). Hence it is likely that these inteins are active and capable of protein splicing. The splicing reaction is suggested to be a variation of the Asn cyclization in which the Gln will undergo cyclization to glutarimide ring.

 

 

 

Please send comments and suggestions to curator at: curator@ibibiosolutions.com.


Developed at the Bioinformatics Research Laboratory
of IBI Biosolutions Pvt. Ltd.

Copyright © IBI Biosolutions Pvt. Ltd.. (2008).
For problems or questions regarding this Web site contact us at: curator@ibibiosolutions.com

Disclaimer:
IBI Databases and associated information are protected by copyright. This server and its associated data and services are for academic, non-commercial use only. The IBI has no liability for the use of results, data or information which have been provided through this server. Neither the use for commercial purposes nor the redistribution of IBI database files to third parties nor the distribution of parts of files or derivative products to any third parties is permitted. Commercial users may contact the IBI BIosolutions Pvt. Ltd.