search about people linkage internships contact

Genes Under Study

Picture of Arabidopsis plants growing on petri plateThe original list of genes being targeted for analysis by our project contained 4455 nuclear genes that were identified as likely to encode plastid-targeted proteins. We have adopted an inclusive approach to predicting plastid targeting to minimize false negatives, recognizing the likelihood of false positives. The 28,952 sequences in the TIGR A.thaliana proteome (release 5) were processed by TargetP, a sensitive and specific prediction algorithm (Emanuelsson et al., 2000; Richly and Leister, 2004). TargetP identified 3,996 protein-coding genes as chloroplast-targeted, with 273 ambiguously predicted as either mitochondrial or chloroplastic. Because such ambiguity is a common failing of prediction algorithms, these sequences are included in our gene list. 186 additional sequences are reliably validated as chloroplast-targeted but are missed by TargetP.

In the spring of 2007 we added 426 new genes to the target list. These genes are from stromal proteomics data from Klaas van Wijk at Cornell University and annotation made available from the SubCellular Proteomic Database (SUBA) at the Plant Energy Biology Centre of Excellence in Australia.

In the second half of 2007 we modified the gene list to incorporate changes to the annotation of the Arabidopsis genome due to release of TAIR7. This version was also updated by removing lists of genes annotated as producing embryo lethal genes when mutated. The current list of target genes can be accessed by clicking here.

Most plastid outer envelope membrane proteins do not carry predictable plastid-targeting sequences. Therefore this project will not attempt a comprehensive characterization of genes encoding proteins that are targeted to the outer envelope membrane. We will include selected proteins already demonstrated in the plastid outer envelope membrane and are directly related to ongoing work in our labs.

Heazlewood JL, Verboom RE, Tonti-Filippini J, Small I and Millar AH. (2007) SUBA: the Arabidopsis Subcellular Database. Nucleic Acids Res. 35, D213-D218 (link to PUBMED)

Heazlewood JL, Tonti-Filippini J, Verboom RE, Millar AH. (2005) Combining experimental and predicted datasets for determination of the subcellular location of proteins in Arabidopsis. Plant Physiol. 139, 598-609. (link to PUBMED)

Emanuelsson, O., Nielsen, H., Brunak, S. & von Heijne, G. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J.Molec. Biol. 300, 1005-1016 (2000).

Richly, E. and Leister, D. An improved prediction of chloroplast proteins reveals diversities and commonalities in the chloroplast proteomes of Arabidopsis and rice. Gene 329, 11-6 (2004).

 

Previous lists of target genes can be accessed by clicking here.

Chloroplast 2010 is a collaborative Arabidopsis functional genomics project funded by a National Science Foundation
grant to Michigan State University. Did you find a bug or an error? Please let us know.