Efficient exploration of the space of reconciled gene trees.

Abstract:

:Gene trees record the combination of gene-level events, such as duplication, transfer and loss (DTL), and species-level events, such as speciation and extinction. Gene tree-species tree reconciliation methods model these processes by drawing gene trees into the species tree using a series of gene and species-level events. The reconstruction of gene trees based on sequence alone almost always involves choosing between statistically equivalent or weakly distinguishable relationships that could be much better resolved based on a putative species tree. To exploit this potential for accurate reconstruction of gene trees, the space of reconciled gene trees must be explored according to a joint model of sequence evolution and gene tree-species tree reconciliation. Here we present amalgamated likelihood estimation (ALE), a probabilistic approach to exhaustively explore all reconciled gene trees that can be amalgamated as a combination of clades observed in a sample of gene trees. We implement the ALE approach in the context of a reconciliation model (Szöllősi et al. 2013), which allows for the DTL of genes. We use ALE to efficiently approximate the sum of the joint likelihood over amalgamations and to find the reconciled gene tree that maximizes the joint likelihood among all such trees. We demonstrate using simulations that gene trees reconstructed using the joint likelihood are substantially more accurate than those reconstructed using sequence alone. Using realistic gene tree topologies, branch lengths, and alignment sizes, we demonstrate that ALE produces more accurate gene trees even if the model of sequence evolution is greatly simplified. Finally, examining 1099 gene families from 36 cyanobacterial genomes we find that joint likelihood-based inference results in a striking reduction in apparent phylogenetic discord, with respectively. 24%, 59%, and 46% reductions in the mean numbers of duplications, transfers, and losses per gene family. The open source implementation of ALE is available from https://github.com/ssolo/ALE.git.

journal_name

Syst Biol

journal_title

Systematic biology

authors

Szöllõsi GJ,Rosikiewicz W,Boussau B,Tannier E,Daubin V

doi

10.1093/sysbio/syt054

subject

Has Abstract

pub_date

2013-11-01 00:00:00

pages

901-12

issue

6

eissn

1063-5157

issn

1076-836X

pii

syt054

journal_volume

62

pub_type

杂志文章
  • Phylogeny of Eunicida (Annelida) and exploring data congruence using a partition addition bootstrap alteration (PABA) approach.

    abstract::Even though relationships within Annelida are poorly understood, Eunicida is one of only a few major annelid lineages well supported by morphology. The seven recognized eunicid families possess sclerotized jaws that include mandibles and a maxillary apparatus. The maxillary apparatuses vary in shape and number of elem...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150500354910

    authors: Struck TH,Purschke G,Halanych KM

    更新日期:2006-02-01 00:00:00

  • The Phylogeny of Rickettsia Using Different Evolutionary Signatures: How Tree-Like is Bacterial Evolution?

    abstract::Rickettsia is a genus of intracellular bacteria whose hosts and transmission strategies are both impressively diverse, and this is reflected in a highly dynamic genome. Some previous studies have described the evolutionary history of Rickettsia as non-tree-like, due to incongruity between phylogenetic reconstructions ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv084

    authors: Murray GG,Weinert LA,Rhule EL,Welch JJ

    更新日期:2016-03-01 00:00:00

  • Novel versus unsupported clades: assessing the qualitative support for clades in MRP supertrees.

    abstract::Matrix representation with parsimony (MRP) supertree construction has been criticized because the supertree may specify clades that are contradicted by every source tree contributing to it. Such unsupported clades may also occur using other supertree methods; however, their incidence is largely unknown. In this study,...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:

    authors: Bininda-Emonds OR

    更新日期:2003-12-01 00:00:00

  • Evolutionary rates, divergence dates, and the performance of mitochondrial genes in Bayesian phylogenetic analysis.

    abstract::The mitochondrial genome is one of the most frequently used loci in phylogenetic and phylogeographic analyses, and it is becoming increasingly possible to sequence and analyze this genome in its entirety from diverse taxa. However, sequencing the entire genome is not always desirable or feasible. Which genes should be...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150500541672

    authors: Mueller RL

    更新日期:2006-04-01 00:00:00

  • Phylogeny and biogeography of dolichoderine ants: effects of data partitioning and relict taxa on historical inference.

    abstract::Ants (Hymenoptera: Formicidae) are conspicuous organisms in most terrestrial ecosystems, often attaining high levels of abundance and diversity. In this study, we investigate the evolutionary history of a major clade of ants, the subfamily Dolichoderinae, whose species frequently achieve ecological dominance in ant co...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syq012

    authors: Ward PS,Brady SG,Fisher BL,Schultz TR

    更新日期:2010-05-01 00:00:00

  • Measuring Stratigraphic Congruence Across Trees, Higher Taxa, and Time.

    abstract::The congruence between the order of cladistic branching and the first appearance dates of fossil lineages can be quantified using a variety of indices. Good matching is a prerequisite for the accurate time calibration of trees, while the distribution of congruence indices across large samples of cladograms has underpi...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw039

    authors: O'Connor A,Wills MA

    更新日期:2016-09-01 00:00:00

  • Swapping Birth and Death: Symmetries and Transformations in Phylodynamic Models.

    abstract::Stochastic birth-death models provide the foundation for studying and simulating evolutionary trees in phylodynamics. A curious feature of such models is that they exhibit fundamental symmetries when the birth and death rates are interchanged. In this article, we first provide intuitive reasons for these known transfo...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syz039

    authors: Stadler T,Steel M

    更新日期:2019-09-01 00:00:00

  • Maximum Likelihood Implementation of an Isolation-with-Migration Model for Three Species.

    abstract::We develop a maximum likelihood (ML) method for estimating migration rates between species using genomic sequence data. A species tree is used to accommodate the phylogenetic relationships among three species, allowing for migration between the two sister species, while the third species is used as an out-group. A Mar...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw063

    authors: Dalquen DA,Zhu T,Yang Z

    更新日期:2017-05-01 00:00:00

  • Robustness of compound Dirichlet priors for Bayesian inference of branch lengths.

    abstract::We modified the phylogenetic program MrBayes 3.1.2 to incorporate the compound Dirichlet priors for branch lengths proposed recently by Rannala, Zhu, and Yang (2012. Tail paradox, partial identifiability and influential priors in Bayesian branch length inference. Mol. Biol. Evol. 29:325-335.) as a solution to the prob...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/sys030

    authors: Zhang C,Rannala B,Yang Z

    更新日期:2012-10-01 00:00:00

  • Mitochondrial DNA rates and biogeography in European newts (genus Euproctus).

    abstract::Sequence divergence for segments of three mitochondrial DNA (mtDNA) genes encoding the 12S and 16S ribosomal RNA and cytochrome b was examined in newts belonging to the genus Euproctus (E. asper, E. montanus, E. platycephalus) and in three other species belonging to the same family (Salamandridae), Triturus carnifex, ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/46.1.126

    authors: Caccone A,Milinkovitch MC,Sbordoni V,Powell JR

    更新日期:1997-03-01 00:00:00

  • Hylid frog phylogeny and sampling strategies for speciose clades.

    abstract::How should characters and taxa be sampled to resolve efficiently the phylogeny of ancient and highly speciose groups? We addressed this question empirically in the treefrog family Hylidae, which contains > 800 species and may be nonmonophyletic with respect to other anuran families. We sampled 81 species (54 hylids an...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150500234625

    authors: Wiens JJ,Fetzner JW,Parkinson CL,Reeder TW

    更新日期:2005-10-01 00:00:00

  • Tracing the decay of the historical signal in biological sequence data.

    abstract::Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invaria...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490503035

    authors: Ho SY,Jermiin L

    更新日期:2004-08-01 00:00:00

  • Effect of nonindependent substitution on phylogenetic accuracy.

    abstract::All current phylogenetic methods assume that DNA substitutions are independent among sites. However, ample empirical evidence suggests that the process of substitution is not independent but is, in fact, temporally and spatially correlated. The robustness of several commonly used phylogenetic methods to the assumption...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351599260319

    authors: Huelsenbeck JP,Nielsen R

    更新日期:1999-06-01 00:00:00

  • Arrival and diversification of caviomorph rodents and platyrrhine primates in South America.

    abstract::Platyrrhine primates and caviomorph rodents are clades of mammals that colonized South America during its period of isolation from the other continents, between 100 and 3 million years ago (Mya). Until now, no molecular study investigated the timing of the South American colonization by these two lineages with the sam...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150500481390

    authors: Poux C,Chevret P,Huchon D,de Jong WW,Douzery EJ

    更新日期:2006-04-01 00:00:00

  • Exploration of Plastid Phylogenomic Conflict Yields New Insights into the Deep Relationships of Leguminosae.

    abstract::Phylogenomic analyses have helped resolve many recalcitrant relationships in the angiosperm tree of life, yet phylogenetic resolution of the backbone of the Leguminosae, one of the largest and most economically and ecologically important families, remains poor due to generally limited molecular data and incomplete tax...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa013

    authors: Zhang R,Wang YH,Jin JJ,Stull GW,Bruneau A,Cardoso D,De Queiroz LP,Moore MJ,Zhang SD,Chen SY,Wang J,Li DZ,Yi TS

    更新日期:2020-07-01 00:00:00

  • Repeated evolution of dioecy from monoecy in Siparunaceae (Laurales).

    abstract::Siparunaceae comprise Glossocalyx with one species in West Africa and Siparuna with 65 species in the neotropics; all have unisexual flowers, and 15 species are monoecious, 50 dioecious. Parsimony and maximum likelihood analyses of combined nuclear ribosomal ITS and chloroplast trnL-trnF intergenic spacer sequences yi...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351501753328820

    authors: Renner SS,Won H

    更新日期:2001-09-01 00:00:00

  • Multiple cophylogenetic analyses reveal frequent cospeciation between pelecaniform birds and Pectinopygus lice.

    abstract::Lice in the genus Pectinopygus parasitize a single order of birds (Pelecaniformes). To examine the degree of congruence between the phylogenies of 17 Pectinopygus species and their pelecaniform hosts, sequences from mitochondrial 12S rRNA, 16S rRNA, COI, and nuclear wingless and EF1-alpha genes (2290 nucleotides) and ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150701311370

    authors: Hughes J,Kennedy M,Johnson KP,Palma RL,Page RD

    更新日期:2007-04-01 00:00:00

  • Testing hybridization hypotheses based on incongruent gene trees.

    abstract::Hybridization is an important evolutionary mechanism in plants and has been increasingly documented in animals. Difficulty in reconstruction of reticulate evolution, however, has been a long-standing problem in phylogenetics. Consequently, hybrid speciation may play a major role in causing topological incongruence bet...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635159950127321

    authors: Sang T,Zhong Y

    更新日期:2000-09-01 00:00:00

  • Multiple data sets, high homoplasy, and the phylogeny of softshell turtles (Testudines: Trionychidae).

    abstract::We present a phylogenetic hypothesis and novel, rank-free classification for all extant species of softshell turtles (Testudines:Trionychidae). Our data set included DNA sequence data from two mitochondrial protein-coding genes and a approximately 1-kb nuclear intron for 23 of 26 recognized species, and 59 previously ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490503053

    authors: Engstrom TN,Shaffer HB,McCord WP

    更新日期:2004-10-01 00:00:00

  • Deep phylogeographic structure and environmental differentiation in the carnivorous plant Sarracenia alata.

    abstract::We collected ~29 kb of sequence data using Roche 454 pyrosequencing in order to estimate the timing and pattern of diversification in the carnivorous pitcher plant Sarracenia alata. Utilizing modified protocols for reduced representation library construction, we generated sequence data from 86 individuals across 10 po...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/sys048

    authors: Zellmer AJ,Hanes MM,Hird SM,Carstens BC

    更新日期:2012-10-01 00:00:00

  • Biogeography explains cophylogenetic patterns in toucan chewing lice.

    abstract::Historically, comparisons of host and parasite phylogenies have concentrated on cospeciation. However, many of these comparisons have demonstrated that the phylogenies of hosts and parasites are seldom completely congruent, suggesting that phenomena other than cospeciation play an important role in the evolution of ho...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490265085

    authors: Weckstein JD

    更新日期:2004-02-01 00:00:00

  • Evolution of a RNA polymerase gene family in Silene (Caryophyllaceae)-incomplete concerted evolution and topological congruence among paralogues.

    abstract::Four low-copy nuclear DNA intron regions from the second largest subunits of the RNA polymerase gene family (RPA2, RPB2, RPD2a, and RPD2b), the internal transcribed spacers (ITSs) from the nuclear ribosomal regions, and the rps16 intron from the chloroplast were sequenced and used in a phylogenetic analysis of 29 spec...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490888840

    authors: Popp M,Oxelman B

    更新日期:2004-12-01 00:00:00

  • Whole Genome Shotgun Phylogenomics Resolves the Pattern and Timing of Swallowtail Butterfly Evolution.

    abstract::Evolutionary relationships have remained unresolved in many well-studied groups, even though advances in next-generation sequencing and analysis, using approaches such as transcriptomics, anchored hybrid enrichment, or ultraconserved elements, have brought systematics to the brink of whole genome phylogenomics. Recent...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syz030

    authors: Allio R,Scornavacca C,Nabholz B,Clamens AL,Sperling FA,Condamine FL

    更新日期:2020-01-01 00:00:00

  • Congruence and conflict in the higher-level phylogenetics of squamate reptiles: an expanded phylogenomic perspective.

    abstract::Genome-scale data have the potential to clarify phylogenetic relationships across the tree of life, but have also revealed extensive gene tree conflict. This seeming paradox, whereby larger datasets both increase statistical confidence and uncover significant discordance, suggests that understanding sources of conflic...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa054

    authors: Singhal S,Colston TJ,Grundler MR,Smith SA,Costa GC,Colli GR,Moritz C,Pyron RA,Rabosky DL

    更新日期:2020-07-18 00:00:00

  • Paleontology, genomics, and combined-data phylogenetics: can molecular data improve phylogeny estimation for fossil taxa?

    abstract::The genomics revolution offers great promise for resolving the phylogeny of living taxa, but does it offer any benefits for reconstructing relationships among extinct (fossil) taxa? Superficially, the answer would seem to be "no," given that molecular data cannot be obtained for most fossil taxa. However, because foss...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp012

    authors: Wiens JJ

    更新日期:2009-02-01 00:00:00

  • A Skull Might Lie: Modeling Ancestral Ranges and Diet from Genes and Shape of Tree Squirrels.

    abstract::Tropical forests of Central and South America represent hotspots of biological diversity. Tree squirrels of the tribe Sciurini are an excellent model system for the study of tropical biodiversity as these squirrels disperse exceptional distances, and after colonizing the tropics of the Central and South America, they ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv054

    authors: Pečnerová P,Moravec JC,Martínková N

    更新日期:2015-11-01 00:00:00

  • The Ascomycota tree of life: a phylum-wide phylogeny clarifies the origin and evolution of fundamental reproductive and ecological traits.

    abstract::We present a 6-gene, 420-species maximum-likelihood phylogeny of Ascomycota, the largest phylum of Fungi. This analysis is the most taxonomically complete to date with species sampled from all 15 currently circumscribed classes. A number of superclass-level nodes that have previously evaded resolution and were unnamed...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp020

    authors: Schoch CL,Sung GH,López-Giráldez F,Townsend JP,Miadlikowska J,Hofstetter V,Robbertse B,Matheny PB,Kauff F,Wang Z,Gueidan C,Andrie RM,Trippe K,Ciufetti LM,Wynns A,Fraker E,Hodkinson BP,Bonito G,Groenewald JZ,Arzanlou

    更新日期:2009-04-01 00:00:00

  • The inference of gene trees with species trees.

    abstract::This article reviews the various models that have been used to describe the relationships between gene trees and species trees. Molecular phylogeny has focused mainly on improving models for the reconstruction of gene trees based on sequence alignments. Yet, most phylogeneticists seek to reveal the history of species....

    journal_title:Systematic biology

    pub_type: 杂志文章,评审

    doi:10.1093/sysbio/syu048

    authors: Szöllősi GJ,Tannier E,Daubin V,Boussau B

    更新日期:2015-01-01 00:00:00

  • How Should Genes and Taxa be Sampled for Phylogenomic Analyses with Missing Data? An Empirical Study in Iguanian Lizards.

    abstract::Targeted sequence capture is becoming a widespread tool for generating large phylogenomic data sets to address difficult phylogenetic problems. However, this methodology often generates data sets in which increasing the number of taxa and loci increases amounts of missing data. Thus, a fundamental (but still unresolve...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv058

    authors: Streicher JW,Schulte JA 2nd,Wiens JJ

    更新日期:2016-01-01 00:00:00

  • Untangling complex histories of genome mergings in high polyploids.

    abstract::Polyploidy, the duplication of entire genomes, plays a major role in plant evolution. In allopolyploids, genome duplication is associated with hybridization between two or more divergent genomes. Successive hybridization and polyploidization events can build up species complexes of allopolyploids with complicated netw...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150701424553

    authors: Brysting AK,Oxelman B,Huber KT,Moulton V,Brochmann C

    更新日期:2007-06-01 00:00:00