Using likelihood-free inference to compare evolutionary dynamics of the protein networks of H. pylori and P. falciparum

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Using likelihood-free inference to compare evolutionary dynamics of the protein networks of H. pylori and P. falciparum. / Ratmann, Oliver; Jørgensen, Ole; Hinkley, Trevor; Stumpf, Michael; Richardson, Sylvia; Wiuf, Carsten.

In: PLOS Computational Biology, Vol. 3, No. 11, 01.11.2007, p. 2266-2278.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Ratmann, O, Jørgensen, O, Hinkley, T, Stumpf, M, Richardson, S & Wiuf, C 2007, 'Using likelihood-free inference to compare evolutionary dynamics of the protein networks of H. pylori and P. falciparum', PLOS Computational Biology, vol. 3, no. 11, pp. 2266-2278. https://doi.org/10.1371/journal.pcbi.0030230

APA

Ratmann, O., Jørgensen, O., Hinkley, T., Stumpf, M., Richardson, S., & Wiuf, C. (2007). Using likelihood-free inference to compare evolutionary dynamics of the protein networks of H. pylori and P. falciparum. PLOS Computational Biology, 3(11), 2266-2278. https://doi.org/10.1371/journal.pcbi.0030230

Vancouver

Ratmann O, Jørgensen O, Hinkley T, Stumpf M, Richardson S, Wiuf C. Using likelihood-free inference to compare evolutionary dynamics of the protein networks of H. pylori and P. falciparum. PLOS Computational Biology. 2007 Nov 1;3(11):2266-2278. https://doi.org/10.1371/journal.pcbi.0030230

Author

Ratmann, Oliver ; Jørgensen, Ole ; Hinkley, Trevor ; Stumpf, Michael ; Richardson, Sylvia ; Wiuf, Carsten. / Using likelihood-free inference to compare evolutionary dynamics of the protein networks of H. pylori and P. falciparum. In: PLOS Computational Biology. 2007 ; Vol. 3, No. 11. pp. 2266-2278.

Bibtex

@article{245b3463785d478bbfbd58d381f45de9,
title = "Using likelihood-free inference to compare evolutionary dynamics of the protein networks of H. pylori and P. falciparum",
abstract = "Gene duplication with subsequent interaction divergence is one of the primary driving forces in the evolution of genetic systems. Yet little is known about the precise mechanisms and the role of duplication divergence in the evolution of protein networks from the prokaryote and eukaryote domains. We developed a novel, model-based approach for Bayesian inference on biological network data that centres on approximate Bayesian computation, or likelihood-free inference. Instead of computing the intractable likelihood of the protein network topology, our method summarizes key features of the network and, based on these, uses a MCMC algorithm to approximate the posterior distribution of the model parameters. This allowed us to reliably fit a flexible mixture model that captures hallmarks of evolution by gene duplication and subfunctionalization to protein interaction network data of Helicobacter pylori and Plasmodium falciparum. The 80% credible intervals for the duplication-divergence component are [0.64, 0.98] for H. pylori and [0.87, 0.99] for P. falciparum. The remaining parameter estimates are not inconsistent with sequence data. An extensive sensitivity analysis showed that incompleteness of PIN data does not largely affect the analysis of models of protein network evolution, and that the degree sequence alone barely captures the evolutionary footprints of protein networks relative to other statistics. Our likelihood-free inference approach enables a fully Bayesian analysis of a complex and highly stochastic system that is otherwise intractable at present. Modelling the evolutionary history of PIN data, it transpires that only the simultaneous analysis of several global aspects of protein networks enables credible and consistent inference to be made from available datasets. Our results indicate that gene duplication has played a larger part in the network evolution of the eukaryote than in the prokaryote, and suggests that single gene duplications with immediate divergence alone may explain more than 60% of biological network data in both domains.",
author = "Oliver Ratmann and Ole J{\o}rgensen and Trevor Hinkley and Michael Stumpf and Sylvia Richardson and Carsten Wiuf",
year = "2007",
month = nov,
day = "1",
doi = "10.1371/journal.pcbi.0030230",
language = "English",
volume = "3",
pages = "2266--2278",
journal = "P L o S Computational Biology (Online)",
issn = "1553-734X",
publisher = "Public Library of Science",
number = "11",

}

RIS

TY - JOUR

T1 - Using likelihood-free inference to compare evolutionary dynamics of the protein networks of H. pylori and P. falciparum

AU - Ratmann, Oliver

AU - Jørgensen, Ole

AU - Hinkley, Trevor

AU - Stumpf, Michael

AU - Richardson, Sylvia

AU - Wiuf, Carsten

PY - 2007/11/1

Y1 - 2007/11/1

N2 - Gene duplication with subsequent interaction divergence is one of the primary driving forces in the evolution of genetic systems. Yet little is known about the precise mechanisms and the role of duplication divergence in the evolution of protein networks from the prokaryote and eukaryote domains. We developed a novel, model-based approach for Bayesian inference on biological network data that centres on approximate Bayesian computation, or likelihood-free inference. Instead of computing the intractable likelihood of the protein network topology, our method summarizes key features of the network and, based on these, uses a MCMC algorithm to approximate the posterior distribution of the model parameters. This allowed us to reliably fit a flexible mixture model that captures hallmarks of evolution by gene duplication and subfunctionalization to protein interaction network data of Helicobacter pylori and Plasmodium falciparum. The 80% credible intervals for the duplication-divergence component are [0.64, 0.98] for H. pylori and [0.87, 0.99] for P. falciparum. The remaining parameter estimates are not inconsistent with sequence data. An extensive sensitivity analysis showed that incompleteness of PIN data does not largely affect the analysis of models of protein network evolution, and that the degree sequence alone barely captures the evolutionary footprints of protein networks relative to other statistics. Our likelihood-free inference approach enables a fully Bayesian analysis of a complex and highly stochastic system that is otherwise intractable at present. Modelling the evolutionary history of PIN data, it transpires that only the simultaneous analysis of several global aspects of protein networks enables credible and consistent inference to be made from available datasets. Our results indicate that gene duplication has played a larger part in the network evolution of the eukaryote than in the prokaryote, and suggests that single gene duplications with immediate divergence alone may explain more than 60% of biological network data in both domains.

AB - Gene duplication with subsequent interaction divergence is one of the primary driving forces in the evolution of genetic systems. Yet little is known about the precise mechanisms and the role of duplication divergence in the evolution of protein networks from the prokaryote and eukaryote domains. We developed a novel, model-based approach for Bayesian inference on biological network data that centres on approximate Bayesian computation, or likelihood-free inference. Instead of computing the intractable likelihood of the protein network topology, our method summarizes key features of the network and, based on these, uses a MCMC algorithm to approximate the posterior distribution of the model parameters. This allowed us to reliably fit a flexible mixture model that captures hallmarks of evolution by gene duplication and subfunctionalization to protein interaction network data of Helicobacter pylori and Plasmodium falciparum. The 80% credible intervals for the duplication-divergence component are [0.64, 0.98] for H. pylori and [0.87, 0.99] for P. falciparum. The remaining parameter estimates are not inconsistent with sequence data. An extensive sensitivity analysis showed that incompleteness of PIN data does not largely affect the analysis of models of protein network evolution, and that the degree sequence alone barely captures the evolutionary footprints of protein networks relative to other statistics. Our likelihood-free inference approach enables a fully Bayesian analysis of a complex and highly stochastic system that is otherwise intractable at present. Modelling the evolutionary history of PIN data, it transpires that only the simultaneous analysis of several global aspects of protein networks enables credible and consistent inference to be made from available datasets. Our results indicate that gene duplication has played a larger part in the network evolution of the eukaryote than in the prokaryote, and suggests that single gene duplications with immediate divergence alone may explain more than 60% of biological network data in both domains.

UR - http://www.scopus.com/inward/record.url?scp=36949017519&partnerID=8YFLogxK

U2 - 10.1371/journal.pcbi.0030230

DO - 10.1371/journal.pcbi.0030230

M3 - Journal article

C2 - 18052538

AN - SCOPUS:36949017519

VL - 3

SP - 2266

EP - 2278

JO - P L o S Computational Biology (Online)

JF - P L o S Computational Biology (Online)

SN - 1553-734X

IS - 11

ER -

ID: 203904457