Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Genome Researcharrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
Genome Research
Article
Data sources: UnpayWall
Genome Research
Article . 2006 . Peer-reviewed
Data sources: Crossref
Genome Research
Article . 2006
versions View all 2 versions
addClaim

Small open reading frames: Not so small anymore

Authors: Richelle, Sopko; Brenda, Andrews;

Small open reading frames: Not so small anymore

Abstract

Today, nearly 10 years after the publication of the complete sequence of the Saccharomyces cerevisiae genome, the total number of genes in this organism is largely considered resolved and currently stands at 5782 (http://www.yeastgenome.org/cache/ genomeSnapshot.html). This gene sequence information has led to construction of numerous yeast strain and plasmid collections, including the two-hybrid (Ito et al. 2000, 2001; Uetz et al. 2000) viable haploid deletion (Winzeler et al. 1999; Giaever et al. 2002), titratable promoter allele (Mnaimneh et al. 2004), and chromosomally tagged green fluorescent protein and TAP fusion libraries (Ghaemmaghami et al. 2003; Huh et al. 2003). Because creation of these collections has relied on the accuracy of gene annotation at the time of library construction, each library has limitations. One major issue for most is the fact that initial annotation of the S. cerevisiae genome included only those regions consisting of at least 100 contiguous codons, and therefore small open reading frames (sORFs) encoding functional proteins were largely missed (Goffeau et al. 1996) and only considered later following detection of expression (Olivas et al. 1997; Velculescu et al. 1997; Kumar et al. 2002; Oshiro et al. 2002; Kessler et al. 2003). As such, the phenotypic consequences of gene disruption of more than half of these sORFs has not been assessed, and sORFs are underrepresented in genomic libraries and other collections. In this issue of Genome Research, Kastenmayer et al. (2006) provide the first systematic analysis of the prevalence of sORFs in a eukaryotic genome. Of the 299 currently recognized sORFs in the S. cerevisiae genome, they discovered that more than half (170) have been annotated since the genome was sequenced. Of these, 12% were identified by homology, and an even greater fraction (74%) was identified by combining both homology and empirical evidence such as detection of a transcribed and translated product. This is extremely significant given the variability in gene models that makes genome comparisons among species difficult and comparison between genera even more complex. The comparison of sORFs is additionally complicated given that a change in a single amino acid can create a large bias when assessing percent conservation. Also, prokaryotic genome sequence comparisons suggest that sORFs may be particularly susceptible to over-annotation (Nielsen and Krogh 2005). The significant apparent conservation of sORFs inspired Kastenmayer et al. to extend their analysis to include the entire list (299) of currently annotated sORFs. Using BLAST and analysis of the HomoloGene database, Kastenmayer et al. established that 184 S. cerevisiae sORFs have potential orthologs in other organisms and likely encode bona fide proteins with conserved functions. Like other large-scale functional genomic studies, this sORF survey affirms the importance of continued confirmation and refinement of genome annotation (for recent review, see Dolinksi and Botstein 2005). To initiate characterization of those 170 sORFs discovered since the completion of the sequencing of the S. cerevisiae genome, Kastenmayer et al. (2006) constructed 140 deletion mutants, bringing the total number of sORF deletion strains to 247 (the rest were previously constructed by the deletion consortium, Winzeler et al. 1999). Examination of these newly constructed strains in an array of conditions uncovered growth phenotypes for 22 mutants, some of which were overlapping. For example, three new sORFs are essential, six mutants had a significant growth defect, and several exhibited sensitivities to a range of cell stressors, including DNA-damaging agents and suboptimal growth conditions. What other information about sORF function might we expect to extract from experiments with the new sORF deletion collection? The standard yeast deletion mutant collection enabled a paradigm shift in yeast genetics—the collection of deletion mutants has been systematically scrutinized for a variety of phenotypes, providing rosters of genes that may contribute to a particular biological process (Scherens and Goffeau 2004). For example, the cell morphology of each mutant strain in the deletion collection has been examined by microscopy, providing the first comprehensive list of morphogenetic defects for any organism (Giaever et al. 2002; Saito et al. 2005). Also, the deletion mutants were designed with molecular “barcodes”—unique DNA sequences that identify each mutant strain—enabling parallel phenotypic analysis in pools of deletion mutants. The fitness contribution of individual strains can then be quantitatively assessed using a barcode oligonucleotide microarray readout (Giaever et al. 2002). The sORF collection has been similarly barcoded, and these mutant strains can now be included in all future functional profiling experiments. Twenty-one of the 247 sORFs (8%) for which deletion mutants exist are essential for haploid viability (including three new sORFs uncovered from the Kastenmayer study). This is much lower than the incidence of essentiality in the entire genome (22%), suggesting (1) condition-dependent essentiality for these sORFs or (2) much genetic redundancy involving sORFs. Given that the sORF collection is arrayed and compatible with methods developed for the larger mutant collection, genetic redundancy can be tackled using methods of automated yeast genetics that have been recently invented (Tong et al. 2004). In particular, the so-called Synthetic Genetic Array or SGA method can be applied to systematically construct sORF double mutants, allowing assessment of genetic background in which sORFs are required for viability. A synthetic lethal interaction network has already been constructed by screening for genetic interactions involving well over 100 mutant strains and the ∼5000 strains in the original deletion mutant collection (Tong et al. 2004). This project revealed that, on average, each gene will have about 26 synthetic lethal interactions. Using this average as a guide, we might expect sORFs to contribute 7774 (26 299) new genetic interactions to the S. cerevisiae genetic interaction network (Tong et al. 3Corresponding author. E-mail Brenda.andrews@utoronto.ca; fax (416) 946-8253. Article and publication are at http://www.genome.org/cgi/doi/10.1101/ gr.4976706. Commentary

Related Organizations
Keywords

Open Reading Frames, Genome, Human, Humans, Saccharomyces cerevisiae, Genome, Fungal

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    21
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Top 10%
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Top 10%
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
21
Average
Top 10%
Top 10%
bronze