Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.

Abstract:

:Sampling across tree space is one of the major challenges in Bayesian phylogenetic inference using Markov chain Monte Carlo (MCMC) algorithms. Standard MCMC tree moves consider small random perturbations of the topology, and select from candidate trees at random or based on the distance between the old and new topologies. MCMC algorithms using such moves tend to get trapped in tree space, making them slow in finding the globally most probable trees (known as "convergence") and in estimating the correct proportions of the different types of them (known as "mixing"). Here, we introduce a new class of moves, which propose trees based on their parsimony scores. The proposal distribution derived from the parsimony scores is a quickly computable albeit rough approximation of the conditional posterior distribution over candidate trees. We demonstrate with simulations that parsimony-guided moves correctly sample the uniform distribution of topologies from the prior. We then evaluate their performance against standard moves using six challenging empirical data sets, for which we were able to obtain accurate reference estimates of the posterior using long MCMC runs, a mix of topology proposals, and Metropolis coupling. On these data sets, ranging in size from 357 to 934 taxa and from 1740 to 5681 sites, we find that single chains using parsimony-guided moves usually converge an order of magnitude faster than chains using standard moves. They also exhibit better mixing, that is, they cover the most probable trees more quickly. Our results show that tree moves based on quick and dirty estimates of the posterior probability can significantly outperform standard moves. Future research will have to show to what extent the performance of such moves can be improved further by finding better ways of approximating the posterior probability, taking the trade-off between accuracy and speed into account. [Bayesian phylogenetic inference; MCMC; parsimony; tree proposal.].

journal_name

Syst Biol

journal_title

Systematic biology

authors

Zhang C,Huelsenbeck JP,Ronquist F

doi

10.1093/sysbio/syaa002

subject

Has Abstract

pub_date

2020-09-01 00:00:00

pages

1016-1032

issue

5

eissn

1063-5157

issn

1076-836X

pii

5716338

journal_volume

69

pub_type

杂志文章
  • Incomplete Lineage Sorting in Mammalian Phylogenomics.

    abstract::The impact of incomplete lineage sorting (ILS) on phylogenetic conflicts among genes, and the related issue of whether to account for ILS in species tree reconstruction, are matters of intense controversy. Here, focusing on full-genome data in placental mammals, we empirically test two assumptions underlying current u...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw082

    authors: Scornavacca C,Galtier N

    更新日期:2017-01-01 00:00:00

  • How the worm got its pharynx: phylogeny, classification and Bayesian assessment of character evolution in Acoela.

    abstract::Acoela are marine microscopic worms currently thought to be the sister taxon of all other bilaterians. Acoels have long been used as models in evolutionary scenarios, and generalized conclusions about acoel and bilaterian ancestral features are frequently drawn from studies of single acoel species. There is no extensi...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syr073

    authors: Jondelius U,Wallberg A,Hooge M,Raikova OI

    更新日期:2011-12-01 00:00:00

  • Phylogenomics of piranhas and pacus (Serrasalmidae) uncovers how dietary convergence and parallelism obfuscate traditional morphological taxonomy.

    abstract::The Amazon and neighboring South American river basins harbor the world's most diverse assemblages of freshwater fishes. One of the most prominent South American fish families is the Serrasalmidae (pacus and piranhas), found in nearly every continental basin. Serrasalmids are keystone ecological taxa, being some of th...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa065

    authors: Kolmann MA,Hughes LC,Hernandez LP,Arcila D,Betancur-R R,Sabaj MH,López-Fernández H,Ortí G

    更新日期:2020-08-12 00:00:00

  • Congruence and conflict in the higher-level phylogenetics of squamate reptiles: an expanded phylogenomic perspective.

    abstract::Genome-scale data have the potential to clarify phylogenetic relationships across the tree of life, but have also revealed extensive gene tree conflict. This seeming paradox, whereby larger datasets both increase statistical confidence and uncover significant discordance, suggests that understanding sources of conflic...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa054

    authors: Singhal S,Colston TJ,Grundler MR,Smith SA,Costa GC,Colli GR,Moritz C,Pyron RA,Rabosky DL

    更新日期:2020-07-18 00:00:00

  • Radiation of extant cetaceans driven by restructuring of the oceans.

    abstract::The remarkable fossil record of whales and dolphins (Cetacea) has made them an exemplar of macroevolution. Although their overall adaptive transition from terrestrial to fully aquatic organisms is well known, this is not true for the radiation of modern whales. Here, we explore the diversification of extant cetaceans ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp060

    authors: Steeman ME,Hebsgaard MB,Fordyce RE,Ho SY,Rabosky DL,Nielsen R,Rahbek C,Glenner H,Sørensen MV,Willerslev E

    更新日期:2009-12-01 00:00:00

  • Transformationalism, taxism, and developmental biology in systematics.

    abstract::Issues concerning transformational and taxic comparisons are central to understanding the impact of the recent proliferation of molecular developmental data on evolutionary biology. More importantly, an understanding of taxism and transformationalism in comparative biology is critical to assessing the impact of the re...

    journal_title:Systematic biology

    pub_type: 杂志文章,评审

    doi:10.1080/10635150050207366

    authors: Bang R,DeSalle R,Wheeler W

    更新日期:2000-03-01 00:00:00

  • The effect of phylogeny on interspecific body shape variation in darters (Pisces: Percidae).

    abstract::We conducted a geometric morphometric analysis of interspecific body shape variation among representatives of 31 species of darters (Pisces: Percidae) to determine whether there is evidence of a phylogenetic effect in body shape variation. Cartesian transformation grids representing relative shape differences of indiv...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150390197019

    authors: Guill JM,Heins DC,Hood CS

    更新日期:2003-08-01 00:00:00

  • How Should Genes and Taxa be Sampled for Phylogenomic Analyses with Missing Data? An Empirical Study in Iguanian Lizards.

    abstract::Targeted sequence capture is becoming a widespread tool for generating large phylogenomic data sets to address difficult phylogenetic problems. However, this methodology often generates data sets in which increasing the number of taxa and loci increases amounts of missing data. Thus, a fundamental (but still unresolve...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv058

    authors: Streicher JW,Schulte JA 2nd,Wiens JJ

    更新日期:2016-01-01 00:00:00

  • Untangling complex histories of genome mergings in high polyploids.

    abstract::Polyploidy, the duplication of entire genomes, plays a major role in plant evolution. In allopolyploids, genome duplication is associated with hybridization between two or more divergent genomes. Successive hybridization and polyploidization events can build up species complexes of allopolyploids with complicated netw...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150701424553

    authors: Brysting AK,Oxelman B,Huber KT,Moulton V,Brochmann C

    更新日期:2007-06-01 00:00:00

  • Testing for Independence between Evolutionary Processes.

    abstract::Evolutionary events co-occurring along phylogenetic trees usually point to complex adaptive phenomena, sometimes implicating epistasis. While a number of methods have been developed to account for co-occurrence of events on the same internal or external branch of an evolutionary tree, there is a need to account for th...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw004

    authors: Behdenna A,Pothier J,Abby SS,Lambert A,Achaz G

    更新日期:2016-09-01 00:00:00

  • Modeling Phylogenetic Biome Shifts on a Planet with a Past.

    abstract::The spatial distribution of biomes has changed considerably over deep time, so the geographical opportunity for an evolutionary lineage to shift into a new biome may depend on how the availability and connectivity of biomes has varied temporally. To better understand how lineages shift between biomes in space and time...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa045

    authors: Landis M,Edwards EJ,Donoghue MJ

    更新日期:2021-01-01 00:00:00

  • Optimal selection of gene and ingroup taxon sampling for resolving phylogenetic relationships.

    abstract::A controversial topic that underlies much of phylogenetic experimental design is the relative utility of increased taxonomic versus character sampling. Conclusions about the relative utility of adding characters or taxa to a current phylogenetic study have subtly hinged upon the appropriateness of the rate of evolutio...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syq025

    authors: Townsend JP,Lopez-Giraldez F

    更新日期:2010-07-01 00:00:00

  • Tracing the decay of the historical signal in biological sequence data.

    abstract::Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invaria...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490503035

    authors: Ho SY,Jermiin L

    更新日期:2004-08-01 00:00:00

  • Biogeographic interpretation of splits graphs: least squares optimization of branch lengths.

    abstract::Although most often used to represent phylogenetic uncertainty, network methods are also potentially useful for describing the phylogenetic complexity expected to characterize recent species radiations. One network method with particular advantages in this context is split decomposition. However, in its standard imple...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150590906046

    authors: Winkworth R,Bryant D,Lockhart P,Havell D,Moulton V

    更新日期:2005-02-01 00:00:00

  • A novel test for host-symbiont codivergence indicates ancient origin of fungal endophytes in grasses.

    abstract::Significant phylogenetic codivergence between plant or animal hosts (H) and their symbionts or parasites (P) indicates the importance of their interactions on evolutionary time scales. However, valid and realistic methods to test for codivergence are not fully developed. One of the systems where possible codivergence ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150802172184

    authors: Schardl CL,Craven KD,Speakman S,Stromberg A,Lindstrom A,Yoshida R

    更新日期:2008-06-01 00:00:00

  • Partial sequence homogenization in the 5S multigene families may generate sequence chimeras and spurious results in phylogenetic reconstructions.

    abstract::Multigene families have provided opportunities for evolutionary biologists to assess molecular evolution processes and phylogenetic reconstructions at deep and shallow systematic levels. However, the use of these markers is not free of technical and analytical challenges. Many evolutionary studies that used the nuclea...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syt101

    authors: Galián JA,Rosato M,Rosselló JA

    更新日期:2014-03-01 00:00:00

  • Phylogeny imbalance: taxonomic level matters.

    abstract::Two lines of evidence indicate that the degree of symmetry in phylogenetic topologies differs at different hierarchical levels. First, in a set of 61 phylogenies with superspecific taxa as their terminals, trees were on average more unbalanced (asymmetric) when the species richness of terminals was considered than whe...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150290102546

    authors: Purvis A,Agapow PM

    更新日期:2002-12-01 00:00:00

  • Integration of Anatomy Ontologies and Evo-Devo Using Structured Markov Models Suggests a New Framework for Modeling Discrete Phenotypic Traits.

    abstract::Modeling discrete phenotypic traits for either ancestral character state reconstruction or morphology-based phylogenetic inference suffers from ambiguities of character coding, homology assessment, dependencies, and selection of adequate models. These drawbacks occur because trait evolution is driven by two key proces...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syz005

    authors: Tarasov S

    更新日期:2019-09-01 00:00:00

  • Rumbling Orchids: How To Assess Divergent Evolution Between Chloroplast Endosymbionts and the Nuclear Host.

    abstract::Phylogenetic relationships inferred from multilocus organellar and nuclear DNA data are often difficult to resolve because of evolutionary conflicts among gene trees. However, conflicting or "outlier" associations (i.e., linked pairs of "operational terminal units" in two phylogenies) among these data sets often provi...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv070

    authors: Pérez-Escobar OA,Balbuena JA,Gottschling M

    更新日期:2016-01-01 00:00:00

  • Split support and split conflict randomization tests in phylogenetic inference.

    abstract::Randomization tests allow the formulation and statistical testing of null hypotheses about the quality of entire data sets or the quality of fit between the data and particular phylogenetic hypotheses. Randomization tests of phylogenetic hypotheses based on the concepts of split support and split conflict are describe...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351598260662

    authors: Wilkinson M

    更新日期:1998-12-01 00:00:00

  • Bacterial species and speciation.

    abstract::Bacteria are profoundly different from eukaryotes in their patterns of genetic exchange. Nevertheless, ecological diversity is organized in the same way across all of life: individual organisms fall into more less discrete clusters on the basis of their phenotypic, ecological, and DNA sequence characteristics. Each se...

    journal_title:Systematic biology

    pub_type: 杂志文章,评审

    doi:10.1080/10635150118398

    authors: Cohan FM

    更新日期:2001-08-01 00:00:00

  • Phylogenetic relationships of fig wasps pollinating functionally dioecious Ficus based on mitochondrial DNA sequences and morphology.

    abstract::The obligate mutualism between pollinating fig wasps in the family Agaonidae (Hymenoptera: Chalcidoidea) and Ficus species (Moraceae) is often regarded as an example of co-evolution but little is known about the history of the interaction, and understanding the origin of functionally dioecious fig pollination has been...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:

    authors: Weiblen GD

    更新日期:2001-04-01 00:00:00

  • Molecular and morphological phylogenetics of weevils (coleoptera, curculionoidea): do niche shifts accompany diversification?

    abstract::The main goals of this study were to provide a robust phylogeny for the families of the superfamily Curculionoidea, to discover relationships and major natural groups within the family Curculionidae, and to clarify the evolution of larval habits and host-plant associations in weevils to analyze their role in weevil di...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150290102465

    authors: Marvaldi AE,Sequeira AS,O'Brien CW,Farrell BD

    更新日期:2002-10-01 00:00:00

  • Identifying hidden rate changes in the evolution of a binary morphological character: the evolution of plant habit in campanulid angiosperms.

    abstract::The growth of phylogenetic trees in scope and in size is promising from the standpoint of understanding a wide variety of evolutionary patterns and processes. With trees comprised of larger, older, and globally distributed clades, it is likely that the lability of a binary character will differ significantly among lin...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syt034

    authors: Beaulieu JM,O'Meara BC,Donoghue MJ

    更新日期:2013-09-01 00:00:00

  • Inferring and validating horizontal gene transfer events using bipartition dissimilarity.

    abstract::Horizontal gene transfer (HGT) is one of the main mechanisms driving the evolution of microorganisms. Its accurate identification is one of the major challenges posed by reticulate evolution. In this article, we describe a new polynomial-time algorithm for inferring HGT events and compare 3 existing and 1 new tree com...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp103

    authors: Boc A,Philippe H,Makarenkov V

    更新日期:2010-03-01 00:00:00

  • Hylid frog phylogeny and sampling strategies for speciose clades.

    abstract::How should characters and taxa be sampled to resolve efficiently the phylogeny of ancient and highly speciose groups? We addressed this question empirically in the treefrog family Hylidae, which contains > 800 species and may be nonmonophyletic with respect to other anuran families. We sampled 81 species (54 hylids an...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150500234625

    authors: Wiens JJ,Fetzner JW,Parkinson CL,Reeder TW

    更新日期:2005-10-01 00:00:00

  • Perils of paralogy: using HSP70 genes for inferring organismal phylogenies.

    abstract::Conserved genes have found their way into the mainstream of molecular systematics. Many of these genes are members of multigene families. A difficulty with using single genes of multigene families for phylogenetic inference is that genes from one species may be paralogous to those from another taxon. We focus attentio...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150290069995

    authors: Martin AP,Burg TM

    更新日期:2002-08-01 00:00:00

  • Large-scale phylogenies and measuring the performance of phylogenetic estimators.

    abstract::Performance measures of phylogenetic estimation methods such as accuracy, consistency, and power are an attempt at summarizing an ensemble of a given estimator's behavior. These summaries characterize an ensemble behavior with a single number, leading to a variety of definitions. In particular, the relationships betwe...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351598261021

    authors: Kim J

    更新日期:1998-03-01 00:00:00

  • Phylogenomics, Origin and Diversification of Anthozoans (Phylum Cnidaria).

    abstract::Anthozoan cnidarians (corals and sea anemones) include some of the world's most important foundation species, capable of building massive reef complexes that support entire ecosystems. Although previous molecular phylogenetic analyses have revealed widespread homoplasy of the morphological characters traditionally use...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa103

    authors: McFadden CS,Quattrini AM,Brugler MR,Cowman PF,Dueñas LF,Kitahara MV,Paz-García DA,Reimer JD,Rodríguez E

    更新日期:2021-01-28 00:00:00

  • Multilocus Phylogeny of the Afrotropical Freshwater Crab Fauna Reveals Historical Drainage Connectivity and Transoceanic Dispersal Since the Eocene.

    abstract::Phylogenetic reconstruction, divergence time estimations and ancestral range estimation were undertaken for 66% of the Afrotropical freshwater crab fauna (Potamonautidae) based on four partial DNA loci (12S rRNA, 16S rRNA, cytochrome oxidase one [COI], and histone 3). The present study represents the most comprehensiv...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv011

    authors: Daniels SR,Phiri EE,Klaus S,Albrecht C,Cumberlidge N

    更新日期:2015-07-01 00:00:00