Characterization of new proteins found by analysis of short open reading frames from the full yeast genome

Yeast. 1997 Nov;13(14):1363-74. doi: 10.1002/(SICI)1097-0061(199711)13:14<1363::AID-YEA182>3.0.CO;2-8.

Abstract

We have analysed short open reading frames (between 150 and 300 base pairs long) of the yeast genome (Saccharomyces cerevisiae) with a two-step strategy. The first step selects a candidate set of open reading frames from the DNA sequence based on statistical evaluation of DNA and protein sequence properties. The second step filters the candidate set by selecting open reading frames with high similarity to other known sequences (from any organism). As a result, we report ten new predicted proteins not present in the current sequence databases. These include a new alcohol dehydrogenase, a protein probably related to the cell cycle, as well as a homolog of the prokaryotic ribosomal protein L36 likely to be a mitochondrial ribosomal protein coded in the nuclear genome. We conclude that the analysis of short open reading frames leads to biologically interesting discoveries, even though the quantitative yield of new proteins is relatively low.

MeSH terms

  • Alcohol Dehydrogenase / genetics
  • Amino Acid Sequence
  • Cell Cycle Proteins / genetics
  • Codon / genetics
  • Evolution, Molecular
  • Frameshift Mutation
  • Fungal Proteins / chemistry*
  • Fungal Proteins / genetics*
  • Genome, Fungal*
  • Molecular Sequence Data
  • Open Reading Frames*
  • Protein Biosynthesis
  • Ribosomal Proteins / genetics
  • Saccharomyces cerevisiae / genetics*
  • Sequence Homology, Amino Acid

Substances

  • Cell Cycle Proteins
  • Codon
  • Fungal Proteins
  • Ribosomal Proteins
  • Alcohol Dehydrogenase