Predicting protein functions from redundancies in large-scale protein interaction networks

Manoj Pratim Samanta; Shoudan Liang

doi:10.1073/pnas.2132527100

Predicting protein functions from redundancies in large-scale protein interaction networks

Proc Natl Acad Sci U S A. 2003 Oct 28;100(22):12579-83. doi: 10.1073/pnas.2132527100. Epub 2003 Oct 17.

Authors

Manoj Pratim Samanta¹, Shoudan Liang

Affiliation

¹ NASA Advanced Supercomputing Division, National Aeronautics and Space Administration, Ames Research Center, Moffet Field, CA 94035, USA.

Abstract

Interpreting data from large-scale protein interaction experiments has been a challenging task because of the widespread presence of random false positives. Here, we present a network-based statistical algorithm that overcomes this difficulty and allows us to derive functions of unannotated proteins from large-scale interaction data. Our algorithm uses the insight that if two proteins share significantly larger number of common interaction partners than random, they have close functional associations. Analysis of publicly available data from Saccharomyces cerevisiae reveals >2,800 reliable functional associations, 29% of which involve at least one unannotated protein. By further analyzing these associations, we derive tentative functions for 81 unannotated proteins with high certainty. Our method is not overly sensitive to the false positives present in the data. Even after adding 50% randomly generated interactions to the measured data set, we are able to recover almost all (approximately 89%) of the original associations.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Databases, Protein
Neural Networks, Computer
Probability
Protein Binding
Proteins / chemistry*
Proteins / metabolism*
Reproducibility of Results
Ribosomal Proteins / chemistry
Ribosomal Proteins / classification
Saccharomyces cerevisiae
Saccharomyces cerevisiae Proteins / chemistry*
Saccharomyces cerevisiae Proteins / metabolism

Substances

Proteins
Ribosomal Proteins
Saccharomyces cerevisiae Proteins