How Should Genes and Taxa be Sampled for Phylogenomic Analyses with Missing Data? An Empirical Study in Iguanian Lizards.

Abstract:

:Targeted sequence capture is becoming a widespread tool for generating large phylogenomic data sets to address difficult phylogenetic problems. However, this methodology often generates data sets in which increasing the number of taxa and loci increases amounts of missing data. Thus, a fundamental (but still unresolved) question is whether sampling should be designed to maximize sampling of taxa or genes, or to minimize the inclusion of missing data cells. Here, we explore this question for an ancient, rapid radiation of lizards, the pleurodont iguanians. Pleurodonts include many well-known clades (e.g., anoles, basilisks, iguanas, and spiny lizards) but relationships among families have proven difficult to resolve strongly and consistently using traditional sequencing approaches. We generated up to 4921 ultraconserved elements with sampling strategies including 16, 29, and 44 taxa, from 1179 to approximately 2.4 million characters per matrix and approximately 30% to 60% total missing data. We then compared mean branch support for interfamilial relationships under these 15 different sampling strategies for both concatenated (maximum likelihood) and species tree (NJst) approaches (after showing that mean branch support appears to be related to accuracy). We found that both approaches had the highest support when including loci with up to 50% missing taxa (matrices with ~40-55% missing data overall). Thus, our results show that simply excluding all missing data may be highly problematic as the primary guiding principle for the inclusion or exclusion of taxa and genes. The optimal strategy was somewhat different for each approach, a pattern that has not been shown previously. For concatenated analyses, branch support was maximized when including many taxa (44) but fewer characters (1.1 million). For species-tree analyses, branch support was maximized with minimal taxon sampling (16) but many loci (4789 of 4921). We also show that the choice of these sampling strategies can be critically important for phylogenomic analyses, since some strategies lead to demonstrably incorrect inferences (using the same method) that have strong statistical support. Our preferred estimate provides strong support for most interfamilial relationships in this important but phylogenetically challenging group.

journal_name

Syst Biol

journal_title

Systematic biology

authors

Streicher JW,Schulte JA 2nd,Wiens JJ

doi

10.1093/sysbio/syv058

subject

Has Abstract

pub_date

2016-01-01 00:00:00

pages

128-45

issue

1

eissn

1063-5157

issn

1076-836X

pii

syv058

journal_volume

65

pub_type

杂志文章
  • Monophyly, Taxon Sampling, and the Nature of Ranks in the Classification of Orb-Weaving Spiders (Araneae: Araneoidea).

    abstract::We address some of the taxonomic and classification changes proposed by Kuntner et al. (2019) in a comparative study on the evolution of sexual size dimorphism in nephiline spiders. Their proposal to recircumscribe araneids and to rank the subfamily Nephilinae as a family is fundamentally flawed as it renders the fami...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syz043

    authors: Kallal RJ,Dimitrov D,Arnedo MA,Giribet G,Hormiga G

    更新日期:2020-03-01 00:00:00

  • Testing congruence in phylogenomic analysis.

    abstract::Phylogenomic analyses of large sets of genes or proteins have the potential to revolutionize our understanding of the tree of life. However, problems arise because estimated phylogenies from individual loci often differ because of different histories, systematic bias, or stochastic error. We have developed Concaterpil...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150801910436

    authors: Leigh JW,Susko E,Baumgartner M,Roger AJ

    更新日期:2008-02-01 00:00:00

  • An Integrated Model of Phenotypic Trait Changes and Site-Specific Sequence Evolution.

    abstract::Recent years have seen a constant rise in the availability of trait data, including morphological features, ecological preferences, and life history characteristics. These phenotypic data provide means to associate genomic regions with phenotypic attributes, thus allowing the identification of phenotypic traits associ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syx032

    authors: Levy Karin E,Wicke S,Pupko T,Mayrose I

    更新日期:2017-11-01 00:00:00

  • Multistate characters and diet shifts: evolution of Erotylidae (Coleoptera).

    abstract::The dominance of angiosperms has played a direct role in the diversification of insects, especially Coleoptera. The shift to angiosperm feeding from other diets is likely to have increased the rate of speciation in Phytophaga. However, Phytophaga is only one of many hyperdiverse lineages of beetles and studies of host...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150701211844

    authors: Leschen RA,Buckley TR

    更新日期:2007-02-01 00:00:00

  • Large-scale phylogenies and measuring the performance of phylogenetic estimators.

    abstract::Performance measures of phylogenetic estimation methods such as accuracy, consistency, and power are an attempt at summarizing an ensemble of a given estimator's behavior. These summaries characterize an ensemble behavior with a single number, leading to a variety of definitions. In particular, the relationships betwe...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351598261021

    authors: Kim J

    更新日期:1998-03-01 00:00:00

  • Placing paleopolyploidy in relation to taxon divergence: a phylogenetic analysis in legumes using 39 gene families.

    abstract::Young polyploid events are easily diagnosed by various methods, but older polyploid events become increasingly difficult to identify as chromosomal rearrangements, tandem gene or partial chromosome duplications, changes in substitution rates among duplicated genes, pseudogenization or locus loss, and interlocus intera...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150590945359

    authors: Pfeil BE,Schlueter JA,Shoemaker RC,Doyle JJ

    更新日期:2005-06-01 00:00:00

  • Bayesian tests of topology hypotheses with an example from diving beetles.

    abstract::We review Bayesian approaches to model testing in general and to the assessment of topological hypotheses in particular. We show that the standard way of setting up Bayes factor tests of the monophyly of a group, or the placement of a sample sequence in a known reference tree, can be misleading. The reason for this is...

    journal_title:Systematic biology

    pub_type: 杂志文章,评审

    doi:10.1093/sysbio/syt029

    authors: Bergsten J,Nilsson AN,Ronquist F

    更新日期:2013-09-01 00:00:00

  • Testing hybridization hypotheses based on incongruent gene trees.

    abstract::Hybridization is an important evolutionary mechanism in plants and has been increasingly documented in animals. Difficulty in reconstruction of reticulate evolution, however, has been a long-standing problem in phylogenetics. Consequently, hybrid speciation may play a major role in causing topological incongruence bet...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635159950127321

    authors: Sang T,Zhong Y

    更新日期:2000-09-01 00:00:00

  • Cryptic Patterns of Speciation in Cryptic Primates: Microendemic Mouse Lemurs and the Multispecies Coalescent.

    abstract::Mouse lemurs (Microcebus) are a radiation of morphologically cryptic primates distributed throughout Madagascar for which the number of recognized species has exploded in the past two decades. This taxonomic revision has prompted understandable concern that there has been substantial oversplitting in the mouse lemur c...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa053

    authors: Poelstra J,Salmona J,Tiley GP,Schüßler D,Blanco MB,Andriambeloson JB,Bouchez O,Campbell CR,Etter PD,Hohenlohe PA,Hunnicutt KE,Iribar A,Johnson EA,Kappeler PM,Larsen PA,Manzi S,Ralison JM,Randrianambinina B,Rasoloariso

    更新日期:2020-07-08 00:00:00

  • Efficient exploration of the space of reconciled gene trees.

    abstract::Gene trees record the combination of gene-level events, such as duplication, transfer and loss (DTL), and species-level events, such as speciation and extinction. Gene tree-species tree reconciliation methods model these processes by drawing gene trees into the species tree using a series of gene and species-level eve...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syt054

    authors: Szöllõsi GJ,Rosikiewicz W,Boussau B,Tannier E,Daubin V

    更新日期:2013-11-01 00:00:00

  • Why the phylogenetic regression appears robust to tree misspecification.

    abstract::The phylogenetic comparative method uses estimates of evolutionary relationships to explicitly model the covariance structure of interspecific data. By accounting for common ancestry, the coevolution between 2 or more traits, as a response to one another or to environmental variables, can be studied without confoundin...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syq098

    authors: Stone EA

    更新日期:2011-05-01 00:00:00

  • Is congruence between data partitions a reliable predictor of phylogenetic accuracy? Empirically testing an iterative procedure for choosing among phylogenetic methods.

    abstract::The relationship between phylogenetic accuracy and congruence between data partitions collected from the same taxa was explored for mitochondrial DNA sequences from two well-supported vertebrate phylogenies. An iterative procedure was adopted whereby accuracy, phylogenetic signal, and congruence were measured before a...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/46.3.464

    authors: Cunningham CW

    更新日期:1997-09-01 00:00:00

  • How the worm got its pharynx: phylogeny, classification and Bayesian assessment of character evolution in Acoela.

    abstract::Acoela are marine microscopic worms currently thought to be the sister taxon of all other bilaterians. Acoels have long been used as models in evolutionary scenarios, and generalized conclusions about acoel and bilaterian ancestral features are frequently drawn from studies of single acoel species. There is no extensi...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syr073

    authors: Jondelius U,Wallberg A,Hooge M,Raikova OI

    更新日期:2011-12-01 00:00:00

  • Comparison of methods for species-tree inference in the sawfly genus Neodiprion (Hymenoptera: Diprionidae).

    abstract::Conifer-feeding sawflies in the genus Neodiprion provide an excellent opportunity to investigate the origin and maintenance of barriers to reproduction, but obtaining a phylogenetic estimate for comparative studies of Neodiprion speciation has proved difficult. Specifically, nonmonophyly within and discordance between...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150802580949

    authors: Linnen CR,Farrell BD

    更新日期:2008-12-01 00:00:00

  • Toward an integrated system of clade names.

    abstract::Although the proposition that higher taxa should correspond to clades is widely accepted, current nomenclature does not distinguish clearly between different clades in nested series. In particular, the same name is often applied to a total clade, its crown clade, and clades originating with various nodes, branches, an...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150701656378

    authors: de Queiroz K

    更新日期:2007-12-01 00:00:00

  • Bacterial species and speciation.

    abstract::Bacteria are profoundly different from eukaryotes in their patterns of genetic exchange. Nevertheless, ecological diversity is organized in the same way across all of life: individual organisms fall into more less discrete clusters on the basis of their phenotypic, ecological, and DNA sequence characteristics. Each se...

    journal_title:Systematic biology

    pub_type: 杂志文章,评审

    doi:10.1080/10635150118398

    authors: Cohan FM

    更新日期:2001-08-01 00:00:00

  • Novel versus unsupported clades: assessing the qualitative support for clades in MRP supertrees.

    abstract::Matrix representation with parsimony (MRP) supertree construction has been criticized because the supertree may specify clades that are contradicted by every source tree contributing to it. Such unsupported clades may also occur using other supertree methods; however, their incidence is largely unknown. In this study,...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:

    authors: Bininda-Emonds OR

    更新日期:2003-12-01 00:00:00

  • Molecular phylogenetics and evolution of maternal care in Membracine treehoppers.

    abstract::The treehopper subfamily Membracinae (Insecta: Hemiptera: Membracidae) comprises the majority of genera and species diversity in the New World tropics. These treehoppers exhibit a wide range of social behaviors, making them an excellent group for studying patterns of social evolution in insects. However, to date the t...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490445869

    authors: Lin CP,Danforth BN,Wood TK

    更新日期:2004-06-01 00:00:00

  • More characters or more taxa for a robust phylogeny--case study from the coffee family (Rubiaceae).

    abstract::Using different data sets mainly from the plant family Rubiaceae, but in parts also from the Apocynaceae, Asteraceae, Lardizabalaceae, Saxifragaceae, and Solanaceae, we have investigated the effect of number of characters, number of taxa, and kind of data on bootstrap values within phylogenetic trees. The percentage o...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351599260085

    authors: Bremer B,Jansen RK,Oxelman B,Backlund M,Lantz H,Kim KJ

    更新日期:1999-09-01 00:00:00

  • Simultaneously mapping and superimposing landmark configurations with parsimony as optimality criterion.

    abstract::All methods proposed to date for mapping landmark configurations on a phylogenetic tree start from an alignment generated by methods that make no use of phylogenetic information, usually by superimposing all configurations against a consensus configuration. In order to properly interpret differences between landmark c...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syr119

    authors: Catalano SA,Goloboff PA

    更新日期:2012-05-01 00:00:00

  • Multilocus Phylogeny of the Afrotropical Freshwater Crab Fauna Reveals Historical Drainage Connectivity and Transoceanic Dispersal Since the Eocene.

    abstract::Phylogenetic reconstruction, divergence time estimations and ancestral range estimation were undertaken for 66% of the Afrotropical freshwater crab fauna (Potamonautidae) based on four partial DNA loci (12S rRNA, 16S rRNA, cytochrome oxidase one [COI], and histone 3). The present study represents the most comprehensiv...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv011

    authors: Daniels SR,Phiri EE,Klaus S,Albrecht C,Cumberlidge N

    更新日期:2015-07-01 00:00:00

  • Diversification, Introgression, and Rampant Cytonuclear Discordance in Rocky Mountains Chipmunks (Sciuridae: Tamias).

    abstract::Evidence from natural systems suggests that hybridization between animal species is more common than traditionally thought, but the overall contribution of introgression to standing genetic variation within species remains unclear for most animal systems. Here, we use targeted exon-capture to sequence thousands of nuc...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa085

    authors: Sarver BAJ,Herrera ND,Sneddon D,Hunter SS,Settles ML,Kronenberg Z,Demboski JR,Good JM,Sullivan J

    更新日期:2021-01-07 00:00:00

  • The comparative method is not macroevolution: across-species evidence for within-species process.

    abstract::It is common for studies that employ the comparative method for the study of adaptation, i.e. documentation of potentially adaptive across-species patterns of trait-environment or trait-trait correlation, to be designated as "macroevolutionary." Authors are justified in using "macroevolution" in this way by appeal to ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa086

    authors: Olson ME

    更新日期:2021-01-07 00:00:00

  • The effect of phylogeny on interspecific body shape variation in darters (Pisces: Percidae).

    abstract::We conducted a geometric morphometric analysis of interspecific body shape variation among representatives of 31 species of darters (Pisces: Percidae) to determine whether there is evidence of a phylogenetic effect in body shape variation. Cartesian transformation grids representing relative shape differences of indiv...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150390197019

    authors: Guill JM,Heins DC,Hood CS

    更新日期:2003-08-01 00:00:00

  • Increased congruence does not necessarily indicate increased phylogenetic accuracy--the behavior of the incongruence length difference test in mixed-model analyses.

    abstract::Comprehensive phylogenetic analyses utilize data from distinct sources, including nuclear, mitochondrial, and plastid molecular sequences and morphology. Such heterogeneous datasets are likely to require distinct models of analysis, given the different histories of mutational biases operating on these characters. The ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351502753475853

    authors: Dowton M,Austin AD

    更新日期:2002-02-01 00:00:00

  • The inference of gene trees with species trees.

    abstract::This article reviews the various models that have been used to describe the relationships between gene trees and species trees. Molecular phylogeny has focused mainly on improving models for the reconstruction of gene trees based on sequence alignments. Yet, most phylogeneticists seek to reveal the history of species....

    journal_title:Systematic biology

    pub_type: 杂志文章,评审

    doi:10.1093/sysbio/syu048

    authors: Szöllősi GJ,Tannier E,Daubin V,Boussau B

    更新日期:2015-01-01 00:00:00

  • Measuring Stratigraphic Congruence Across Trees, Higher Taxa, and Time.

    abstract::The congruence between the order of cladistic branching and the first appearance dates of fossil lineages can be quantified using a variety of indices. Good matching is a prerequisite for the accurate time calibration of trees, while the distribution of congruence indices across large samples of cladograms has underpi...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw039

    authors: O'Connor A,Wills MA

    更新日期:2016-09-01 00:00:00

  • Congruence and conflict in the higher-level phylogenetics of squamate reptiles: an expanded phylogenomic perspective.

    abstract::Genome-scale data have the potential to clarify phylogenetic relationships across the tree of life, but have also revealed extensive gene tree conflict. This seeming paradox, whereby larger datasets both increase statistical confidence and uncover significant discordance, suggests that understanding sources of conflic...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa054

    authors: Singhal S,Colston TJ,Grundler MR,Smith SA,Costa GC,Colli GR,Moritz C,Pyron RA,Rabosky DL

    更新日期:2020-07-18 00:00:00

  • Phylogeny of Eunicida (Annelida) and exploring data congruence using a partition addition bootstrap alteration (PABA) approach.

    abstract::Even though relationships within Annelida are poorly understood, Eunicida is one of only a few major annelid lineages well supported by morphology. The seven recognized eunicid families possess sclerotized jaws that include mandibles and a maxillary apparatus. The maxillary apparatuses vary in shape and number of elem...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150500354910

    authors: Struck TH,Purschke G,Halanych KM

    更新日期:2006-02-01 00:00:00

  • Testing for Independence between Evolutionary Processes.

    abstract::Evolutionary events co-occurring along phylogenetic trees usually point to complex adaptive phenomena, sometimes implicating epistasis. While a number of methods have been developed to account for co-occurrence of events on the same internal or external branch of an evolutionary tree, there is a need to account for th...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw004

    authors: Behdenna A,Pothier J,Abby SS,Lambert A,Achaz G

    更新日期:2016-09-01 00:00:00