The Yeast Intein Database
  The Yeast Intein Database is a comprehensive and curated database devoted to yeast inteins.

  Intein Properties
 

Intein Name: Tli pol-1. The intein name should consist of a 3 letter organism specification, where the first letter is the first letter of the Genus (Thermococcus) and the second and third letters are a Species designation (litoralis). The organism abbreviation is then followed by an abbreviation for the extein gene (Pol). If more than 1 intein is present in a gene, they should be numbered in order of appearance from the N- to C-terminus. For organisms with only a genus designation and an isolate code, such as Psp Pol or Tsp Pol inteins, include a strain designation - Example: Psp-KOD Pol.

Intein size (aa): List the number of amino acids in the intein, not including the C-extein S/T/C.

Endo Activity: If endonuclease activity has been demonstrated, list endonuclease name. Convention dictates that the endonuclease name is preceded by the 'PI-' prefix.

Endo Motif: List: 'DOD' if a member of the LAGLIDADG or dodecapeptide motif family; 'HNH' if a member of the HNH family; 'none' if no large insert is present between intein Blocks B and F; and 'unknown' if a large insert (>100 aa) is present that doesn't have any of the known homing endonuclease signature motifs. Leave blank if you are not sure.

Location in extein (aa preceding intein): List the amino acid preceding the intein (using the single letter code) and its position in the extein, with amino acid 1 being the initiating Met of the extein gene.

N-terminal Splice Junction: Last 10 N-extein residues/intein N-terminus (single letter code).

C-terminal Splice Junction: Last 2 amino acids of the intein/first 10 amino acids of the C-extein (single letter code).

Insert Site: Include the insertion site name consisting of the abbreviated extein name and an alphabetic designation for insertion sites in that extein, taking into account the previously identified insertion sites in extein homologs. Also list notable landmarks like motifs or active site residues.

Intein name
Intein Size
Endo activity
Endo motif
Location in extein
N-terminal Splice Junction
C-terminal Splice Junction
Insert Site
Cba-WM02.98 PRP8 170 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cba-WM728 PRP8 170 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cla PRP8 522 -- DOD S59? TWEGLFWEKS/C HN/SGFEESMKFK PRP8-a,same as A1578 in Sce PRP8
Cga PRP8 170 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cgl VMA 415 -- DOD G276 SNSDTIIYVG/C HN/CGERGNEMAE VMA-a
Cne-a RP8 (Fne-A PRP8) 171 -- None A? TWEGLFWEKA/C HN/SGFEE----- PRP8-a, same as A1578 in Sce PRP8
Cne-AD PRP8 (Fne-AD PRP8) 172 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cne-JEC21 PRP8 172 -- None A1530 TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cpa ThrRS 182 -- None N399 KETFGLKPMN/C GN/CPGHCILFKS thrRS-a
Ctr ThrRS 345 -- -- N399 KETFGLKPMN/C GN/CPGHAVMFKS thrRS-a
Ctr VMA 471 -- DOD G283 SNSDVIIYVG/C HN/CGERGNEMAE VMA-a
Dhan GLT1 607 -- DOD R1183 IALGCIMMRR/C AN/CHLNTCPVGI GLT1-a
Dhan VMA 394 -- DOD G271 SNSDLMVYIG/C HN/CGERGNEMAE VMA-a
Kla-CBS683 VMA 410 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Kex-CBS379 VMA 502 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Kla-IFO1267 VMA 410 -- DOD G276 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Kla-NRRLY1140 VMA 410 -- DOD G283 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Lel VMA 421 -- DOD G279 SNSDVIIYVG/C HN/CGERGNEMAE VMA-a
Pgu GLT1 553 -- DOD K1186 IAMGCIMMRK/C AN/CHLNTCPVGI GLT1-a
Pgu-alt GLT1 553 -- DOD K? IAMGCIMMRK/C SN/CHLNTCPVGI GLT1-a, R1183 of Dha GluSyn
Pst VMA 449 -- DOD G282 SNSDVIIYVG/C HN/CGERGNEMAE VMA-a
Sca-CBS4309 VMA 517 -- DOD G18? SNSDSIIYVG/C HN/CGERGNEMAE VMA-a
Sca-IFO1992 VMA 517 -- DOD G265 SNSDSIIYVG/C HN/CGERGNEMAE VMA-a
Scar VMA 454 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sce VMA 454 PI-SceI DOD G283 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sce-DH1-1A VMA 454 -- DOD G28? SNSDAIIYVGlC HN/CGERGNEMAE VMA-a
Sce-OUT7091 VMA 454 -- DOD G258 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sce-OUT7112 VMA 454 -- DOD G258 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sda VMA 501 -- DOD G18? SNSDTIIYVG/C HN/CGERGNEMAE VMA-a
Sex-IFO1128 VMA 499 -- DOD G267? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sja VMA 476 -- DOD G284 SNSDLIVYVG/C HN/CGERGNEMAE VMA-a
Spa VMA 454 -- DOD G269 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sun VMA 414 -- DOD G18? SNSDTIIYVG/C HN/CGERGNEMAE VMA-a
Tgl VMA 456 -- DOD G18? SNSDVIIYVG/C HN/CGERGNEMAE VMA-a
Tpr VMA 455 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Vpo VMA 433 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Zba VMA 456 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Zbi VMA 450 -- DOD G18? SNSDAIVYVG/C HN/CGERGNEMAE VMA-a
Zro VMA 450 -- DOD G18? SNSDAIVYVG/C HN/CGERGNEMAE VMA-a
Cba PRP8 170 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cba PRP8-1 170 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cba PRP8-2 170 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cba PRP8-3 170 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cne PRP8 172 -- None A1530 TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cne PRP8-1 172 -- None A1530 TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Fne PRP8 236 -- None A1530 TWEGLFWEKA/C HN/ SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cne PRP8-2 219 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Cne PRP8-3 171 -- None A? TWEGLFWEKA/C HN/SGFEE----- PRP8-a, same as A1578 in Sce PRP8
Cne PRP8-4 171 -- None A? TWEGLFWEKA/C HN/SGFEE----- PRP8-a, same as A1578 in Sce PRP8
Cne PRP8-5 171 -- None A? TWEGLFWEKA/C HN/SGFEE----- PRP8-a, same as A1578 in Sce PRP8
Cne PRP8-6 171 -- None A? TWEGLFWEKA/C HN/SGFEE----- PRP8-a, same as A1578 in Sce PRP8
Cgl VMA-1 415 -- DOD G276 SNSDTIIYVG/C HN/CGERGNEMAE VMA-a
Cgl VMA-2 415 -- DOD G276 SNSDTIIYVG/C HN/CGERGNEMAE VMA-a
Kun VMA 414 -- DOD G18? SNSDTIIYVG/C HN/CGERGNEMAE VMA-a
Vpo VMA-1 433 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Vpo VMA-2 433 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sce VMA-1 454 -- DOD G269 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Kun VMA-1 414 -- DOD G18? SNSDTIIYVG/C HN/CGERGNEMAE VMA-a
Kex VMA 499 -- DOD G267? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Kex VMA-1 502 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sce VMA-2 454 PI-SceI DOD G283 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sce VMA-3 454 -- DOD G258 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sce VMA-4 454 -- DOD G258 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Sce VMA-5 586 -- DOD G269 -------------------- KS/CGNNAGARIC VMA-a
Spa VMA-1 586 -- DOD G269 ------------------- KS/CGNNAGARIC VMA-a
Sce VMA-6 572 -- DOD G258 -------------- -------------- VMA-a
Lel VMA-1 421 -- DOD G279 SNSDVIIYVG/C HN/CGERGNEMAE VMA-a
Lel VMA-2 421 -- DOD G279 SNSDVIIYVG/C HN/CGERGNEMAE VMA-a
Cne PRP8-7 172 -- None A1530 TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a
Sce VMA-7 454 PI-SceI DOD G283 SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Cba PRP8-4 170 -- None A? TWEGLFWEKA/C HN/SGFEESMKNK PRP8-a, same as A1578 in Sce PRP8
Pgu GLT1-a 553 -- DOD K1186 IAMGCIMMRK/C AN/CHLNTCPVGI GLT1-a
Pgu GLT1-b 553 -- DOD K1186 IAMGCIMMRK/C AN/CHLNTCPVGI GLT1-a
Vpo VMA-1 433 -- DOD G18? SNSDAIIYVG/C HN/CGERGNEMAE VMA-a
Spa VMA-2 586 -- DOD G269 - KS/CRGCCVGEQL VMA-a
Cgl VMA-3 578 -- DOD G18? - YS/CTECCETDAV VMA-a
Cgl VMA-4 464 -- DOD G18? - YS/CTECCETDAV VMA-a
Cgl VMA-5 578 -- DOD G18? - YS/CTECCETDAV VMA-a
Vpo VMA-2 464 -- DOD G269 -- PV/CGSHCEKEQP VMA-a
Vpo VMA-3 464 -- DOD G269 - PV/CGSHCEKEQP VMA-a
Sce VMA-8 465 -- DOD G269 -- KS/CRGCCVGEQL VMA-a
Sce VMA-9 465 -- DOD G269 - KS/CRGCCVGEQH VMA-a
Sce VMA-10 465 -- DOD G269 - KS/CRGCCVGEQH VMA-a
Spa VMA-3 465 -- DOD G269 - KS/CRGCCVGEQL VMA-a
Sba VMA 465 -- DOD G269 - KS/CGGYCEGEQP VMA-a

 

 

 

Please send comments and suggestions to curator at: curator@ibibiosolutions.com.


Developed at the Bioinformatics Research Laboratory
of IBI Biosolutions Pvt. Ltd.

Copyright © IBI Biosolutions Pvt. Ltd.. (2008).
For problems or questions regarding this Web site contact us at: curator@ibibiosolutions.com

Disclaimer:
IBI Databases and associated information are protected by copyright. This server and its associated data and services are for academic, non-commercial use only. The IBI has no liability for the use of results, data or information which have been provided through this server. Neither the use for commercial purposes nor the redistribution of IBI database files to third parties nor the distribution of parts of files or derivative products to any third parties is permitted. Commercial users may contact the IBI BIosolutions Pvt. Ltd.