Knowledge-driven data integration for the prediction of protein-protein interaction networks

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review


Information about the networks of protein interactions within a cell can greatly increase our understanding of protein function and cellular processes. The advent of high- throughput experimental techniques, as well as large-scale computational prediction models, has greatly increased the volume of genomic data aimed at discovering the functionality of genes and proteins. Individually, these data should be viewed with caution as they are often inaccurate and incomplete. Recent years have seen a growing trend towards the adoption of diverse data integration techniques to support large-scale analysis of protein-protein interaction (PPI) networks. However, current research is mainly focused on data-driven approaches. This paper proposes a knowledge-driven computational framework to support systems-level data integration for the prediction of PPI networks. Based on the incorporation of prior knowledge of the relationship between different “omic” datasets, different likelihood-ratio-based Bayesian models (LR-NB) have been developed to combine the evidence from diverse sources, ranging from co- expression to essentiality to formulate PPI predictions. We demonstrate improvements in the PPI prediction performance. Results are evaluated against Gold Standards, which were derived from the MIPS Complex Catalogue (Saccharomyces cerevisiae) and the Human Protein Reference Database. We also implement a novel analysis of local regions of a Receiver Operating Characteristic curve as less biased and more exact approach to assessing the quality of prediction models. This investigation also provides the basis for new PPI network inference and analysis applications in other model organisms and in specific diseases.
Original languageEnglish
Title of host publicationUnknown Host Publication
Number of pages1
Publication statusPublished (in print/issue) - Jul 2008
Event16th Annual International Conference Intelligent Systems for Molecular Biology -
Duration: 1 Jul 2008 → …


Conference16th Annual International Conference Intelligent Systems for Molecular Biology
Period1/07/08 → …


Dive into the research topics of 'Knowledge-driven data integration for the prediction of protein-protein interaction networks'. Together they form a unique fingerprint.

Cite this