Optimal selection of gene and ingroup taxon sampling for resolving phylogenetic relationships.

Abstract:

:A controversial topic that underlies much of phylogenetic experimental design is the relative utility of increased taxonomic versus character sampling. Conclusions about the relative utility of adding characters or taxa to a current phylogenetic study have subtly hinged upon the appropriateness of the rate of evolution of the characters added for resolution of the phylogeny in question. Clearly, the addition of characters evolving at optimal rates will have much greater impact upon accurate phylogenetic analysis than will the addition of characters with an inappropriate rate of evolution. Development of practical analytical predictions of the asymptotic impact of adding additional taxa would complement computational investigations of the relative utility of these two methods of expanding acquired data. Accordingly, we here formulate a measure of the phylogenetic informativeness of the additional sampling of character states from a new taxon added to the canonical phylogenetic quartet. We derive the optimal rate of evolution for characters assessed in taxa to be sampled and a metric of informativeness based on the rate of evolution of the characters assessed in the new taxon and the distance of the new taxon from the internode of interest. Calculation of the informativeness per base pair of additional character sampling for included taxa versus additional character sampling for novel taxa can be used to estimate cost-effectiveness and optimal efficiency of phylogenetic experimental design. The approach requires estimation of rates of evolution of individual sites based on an alignment of genes orthologous to those to be sequenced, which may be identified in a well-established clade of sister taxa or of related taxa diverging at a deeper phylogenetic scale. Some approximate idea of the potential phylogenetic relationships of taxa to be sequenced is also desirable, such as may be obtained from ribosomal RNA sequence alone. Application to the solution of recalcitrant unresolved nodes in an otherwise well-known phylogeny is the most obvious application. We validate the theory by analysis of its predictions regarding the phylogenetic informativeness for taxon addition of 46 amino acid alignments of 21 fungal taxa. Gene and taxon sampling according to the theory herein and following a "deepest ingroup" heuristic are shown to provide significantly improved resolution of specified deep internodes.

journal_name

Syst Biol

journal_title

Systematic biology

authors

Townsend JP,Lopez-Giraldez F

doi

10.1093/sysbio/syq025

subject

Has Abstract

pub_date

2010-07-01 00:00:00

pages

446-57

issue

4

eissn

1063-5157

issn

1076-836X

pii

syq025

journal_volume

59

pub_type

杂志文章
  • Cryptic Patterns of Speciation in Cryptic Primates: Microendemic Mouse Lemurs and the Multispecies Coalescent.

    abstract::Mouse lemurs (Microcebus) are a radiation of morphologically cryptic primates distributed throughout Madagascar for which the number of recognized species has exploded in the past two decades. This taxonomic revision has prompted understandable concern that there has been substantial oversplitting in the mouse lemur c...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa053

    authors: Poelstra J,Salmona J,Tiley GP,Schüßler D,Blanco MB,Andriambeloson JB,Bouchez O,Campbell CR,Etter PD,Hohenlohe PA,Hunnicutt KE,Iribar A,Johnson EA,Kappeler PM,Larsen PA,Manzi S,Ralison JM,Randrianambinina B,Rasoloariso

    更新日期:2020-07-08 00:00:00

  • Phylogeny of frogs of the Physalaemus pustulosus species group, with an examination of data incongruence.

    abstract::Characters derived from advertisement calls, morphology, allozymes, and the sequences of the small subunit of the mitochondrial ribosomal gene (12S) and the cytochrome oxidase I (COI) mitochondrial gene were used to estimate the phylogeny of frogs of the Physalaemus pustulosus group (Leptodactylidae). The combinabilit...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351598260932

    authors: Cannatella DC,Hillis DM,Chippindale PT,Weigt L,Rand AS,Ryan MJ

    更新日期:1998-06-01 00:00:00

  • Bias in tree searches and its consequences for measuring group supports.

    abstract::When doing a bootstrap analysis with a single tree saved per pseudoreplicate, biased search algorithms may influence support values more than actual properties of the data set. Two methods commonly used for finding phylogenetic trees consist of randomizing the input order of species in multiple addition sequences foll...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syu051

    authors: Goloboff PA,Simmons MP

    更新日期:2014-11-01 00:00:00

  • Molecular and morphological phylogenetics of weevils (coleoptera, curculionoidea): do niche shifts accompany diversification?

    abstract::The main goals of this study were to provide a robust phylogeny for the families of the superfamily Curculionoidea, to discover relationships and major natural groups within the family Curculionidae, and to clarify the evolution of larval habits and host-plant associations in weevils to analyze their role in weevil di...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150290102465

    authors: Marvaldi AE,Sequeira AS,O'Brien CW,Farrell BD

    更新日期:2002-10-01 00:00:00

  • Multilocus phylogenetics of a rapid radiation in the genus Thomomys (Rodentia: Geomyidae).

    abstract::Species complexes undergoing rapid radiation present a challenge in molecular systematics because of the possibility that ancestral polymorphism is retained in component gene trees. Coalescent theory has demonstrated that gene trees often fail to match lineage trees when taxon divergence times are less than the ancest...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150802044011

    authors: Belfiore NM,Liu L,Moritz C

    更新日期:2008-04-01 00:00:00

  • Partial sequence homogenization in the 5S multigene families may generate sequence chimeras and spurious results in phylogenetic reconstructions.

    abstract::Multigene families have provided opportunities for evolutionary biologists to assess molecular evolution processes and phylogenetic reconstructions at deep and shallow systematic levels. However, the use of these markers is not free of technical and analytical challenges. Many evolutionary studies that used the nuclea...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syt101

    authors: Galián JA,Rosato M,Rosselló JA

    更新日期:2014-03-01 00:00:00

  • Radiation of extant cetaceans driven by restructuring of the oceans.

    abstract::The remarkable fossil record of whales and dolphins (Cetacea) has made them an exemplar of macroevolution. Although their overall adaptive transition from terrestrial to fully aquatic organisms is well known, this is not true for the radiation of modern whales. Here, we explore the diversification of extant cetaceans ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp060

    authors: Steeman ME,Hebsgaard MB,Fordyce RE,Ho SY,Rabosky DL,Nielsen R,Rahbek C,Glenner H,Sørensen MV,Willerslev E

    更新日期:2009-12-01 00:00:00

  • Integration of Anatomy Ontologies and Evo-Devo Using Structured Markov Models Suggests a New Framework for Modeling Discrete Phenotypic Traits.

    abstract::Modeling discrete phenotypic traits for either ancestral character state reconstruction or morphology-based phylogenetic inference suffers from ambiguities of character coding, homology assessment, dependencies, and selection of adequate models. These drawbacks occur because trait evolution is driven by two key proces...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syz005

    authors: Tarasov S

    更新日期:2019-09-01 00:00:00

  • Bayesian phylogenetic analysis of combined data.

    abstract::The recent development of Bayesian phylogenetic inference using Markov chain Monte Carlo (MCMC) techniques has facilitated the exploration of parameter-rich evolutionary models. At the same time, stochastic models have become more realistic (and complex) and have been extended to new types of data, such as morphology....

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490264699

    authors: Nylander JA,Ronquist F,Huelsenbeck JP,Nieves-Aldrey JL

    更新日期:2004-02-01 00:00:00

  • Measuring Stratigraphic Congruence Across Trees, Higher Taxa, and Time.

    abstract::The congruence between the order of cladistic branching and the first appearance dates of fossil lineages can be quantified using a variety of indices. Good matching is a prerequisite for the accurate time calibration of trees, while the distribution of congruence indices across large samples of cladograms has underpi...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw039

    authors: O'Connor A,Wills MA

    更新日期:2016-09-01 00:00:00

  • Southern hemisphere biogeography inferred by event-based models: plant versus animal patterns.

    abstract::The Southern Hemisphere has traditionally been considered as having a fundamentally vicariant history. The common trans-Pacific disjunctions are usually explained by the sequential breakup of the supercontinent Gondwana during the last 165 million years, causing successive division of an ancestral biota. However, rece...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490423430

    authors: Sanmartín I,Ronquist F

    更新日期:2004-04-01 00:00:00

  • Predicting total global species richness using rates of species description and estimates of taxonomic effort.

    abstract::We found that trends in the rate of description of 580,000 marine and terrestrial species, in the taxonomically authoritative World Register of Marine Species and Catalogue of Life databases, were similar until the 1950s. Since then, the relative number of marine to terrestrial species described per year has increased...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syr080

    authors: Costello MJ,Wilson S,Houlding B

    更新日期:2012-10-01 00:00:00

  • Swapping Birth and Death: Symmetries and Transformations in Phylodynamic Models.

    abstract::Stochastic birth-death models provide the foundation for studying and simulating evolutionary trees in phylodynamics. A curious feature of such models is that they exhibit fundamental symmetries when the birth and death rates are interchanged. In this article, we first provide intuitive reasons for these known transfo...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syz039

    authors: Stadler T,Steel M

    更新日期:2019-09-01 00:00:00

  • Hylid frog phylogeny and sampling strategies for speciose clades.

    abstract::How should characters and taxa be sampled to resolve efficiently the phylogeny of ancient and highly speciose groups? We addressed this question empirically in the treefrog family Hylidae, which contains > 800 species and may be nonmonophyletic with respect to other anuran families. We sampled 81 species (54 hylids an...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150500234625

    authors: Wiens JJ,Fetzner JW,Parkinson CL,Reeder TW

    更新日期:2005-10-01 00:00:00

  • A novel test for host-symbiont codivergence indicates ancient origin of fungal endophytes in grasses.

    abstract::Significant phylogenetic codivergence between plant or animal hosts (H) and their symbionts or parasites (P) indicates the importance of their interactions on evolutionary time scales. However, valid and realistic methods to test for codivergence are not fully developed. One of the systems where possible codivergence ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150802172184

    authors: Schardl CL,Craven KD,Speakman S,Stromberg A,Lindstrom A,Yoshida R

    更新日期:2008-06-01 00:00:00

  • The Ascomycota tree of life: a phylum-wide phylogeny clarifies the origin and evolution of fundamental reproductive and ecological traits.

    abstract::We present a 6-gene, 420-species maximum-likelihood phylogeny of Ascomycota, the largest phylum of Fungi. This analysis is the most taxonomically complete to date with species sampled from all 15 currently circumscribed classes. A number of superclass-level nodes that have previously evaded resolution and were unnamed...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp020

    authors: Schoch CL,Sung GH,López-Giráldez F,Townsend JP,Miadlikowska J,Hofstetter V,Robbertse B,Matheny PB,Kauff F,Wang Z,Gueidan C,Andrie RM,Trippe K,Ciufetti LM,Wynns A,Fraker E,Hodkinson BP,Bonito G,Groenewald JZ,Arzanlou

    更新日期:2009-04-01 00:00:00

  • Incomplete Lineage Sorting in Mammalian Phylogenomics.

    abstract::The impact of incomplete lineage sorting (ILS) on phylogenetic conflicts among genes, and the related issue of whether to account for ILS in species tree reconstruction, are matters of intense controversy. Here, focusing on full-genome data in placental mammals, we empirically test two assumptions underlying current u...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw082

    authors: Scornavacca C,Galtier N

    更新日期:2017-01-01 00:00:00

  • Using supermatrices for phylogenetic inquiry: an example using the sedges.

    abstract::In this article, we use supermatrix data-mining methods to reconstruct a large, highly inclusive phylogeny of Cyperaceae from nucleotide data available on GenBank. We explore the properties of these trees and their utility for phylogenetic inference, and show that even the highly incomplete alignments characteristic o...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/sys088

    authors: Hinchliff CE,Roalson EH

    更新日期:2013-03-01 00:00:00

  • Bayes estimators for phylogenetic reconstruction.

    abstract::Tree reconstruction methods are often judged by their accuracy, measured by how close they get to the true tree. Yet, most reconstruction methods like maximum likelihood (ML) do not explicitly maximize this accuracy. To address this problem, we propose a Bayesian solution. Given tree samples, we propose finding the tr...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syr021

    authors: Huggins PM,Li W,Haws D,Friedrich T,Liu J,Yoshida R

    更新日期:2011-07-01 00:00:00

  • Untangling complex histories of genome mergings in high polyploids.

    abstract::Polyploidy, the duplication of entire genomes, plays a major role in plant evolution. In allopolyploids, genome duplication is associated with hybridization between two or more divergent genomes. Successive hybridization and polyploidization events can build up species complexes of allopolyploids with complicated netw...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150701424553

    authors: Brysting AK,Oxelman B,Huber KT,Moulton V,Brochmann C

    更新日期:2007-06-01 00:00:00

  • Fitting nonstationary general-time-reversible models to obtain edge-lengths and frequencies for the barry-hartigan model.

    abstract::Among models of nucleotide evolution, the Barry and Hartigan (BH) model (also known as the General Markov Model) is very flexible as it allows separate arbitrary substitution matrices along edges. For a given tree, the estimates of the BH model are a set of joint probability matrices, each giving the pairwise frequenc...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/sys046

    authors: Zou L,Susko E,Field C,Roger AJ

    更新日期:2012-12-01 00:00:00

  • Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.

    abstract::Sampling across tree space is one of the major challenges in Bayesian phylogenetic inference using Markov chain Monte Carlo (MCMC) algorithms. Standard MCMC tree moves consider small random perturbations of the topology, and select from candidate trees at random or based on the distance between the old and new topolog...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa002

    authors: Zhang C,Huelsenbeck JP,Ronquist F

    更新日期:2020-09-01 00:00:00

  • Restriction-Site-Associated DNA Sequencing Reveals a Cryptic Viburnum Species on the North American Coastal Plain.

    abstract::Species are the starting point for most studies of ecology and evolution, but the proper circumscription of species can be extremely difficult in morphologically variable lineages, and there are still few convincing examples of molecularly informed species delimitation in plants. Here, we focus on the Viburnum nudum c...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syy084

    authors: Spriggs EL,Eaton DAR,Sweeney PW,Schlutius C,Edwards EJ,Donoghue MJ

    更新日期:2019-03-01 00:00:00

  • Phylogenomics of parasitic and non-parasitic lice (Insecta: Psocodea): Combining sequence data and Exploring compositional bias solutions in Next Generation Datasets.

    abstract::The insect order Psocodea is a diverse lineage comprising both parasitic (Phthiraptera) and non-parasitic members (Psocoptera). The extreme age and ecological diversity of the group may be associated with major genomic changes, such as base compositional biases expected to affect phylogenetic inference. Divergent morp...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa075

    authors: de Moya RS,Yoshizawa K,Walden KKO,Sweet AD,Dietrich CH,Johnson KP

    更新日期:2020-09-26 00:00:00

  • Paleontology, genomics, and combined-data phylogenetics: can molecular data improve phylogeny estimation for fossil taxa?

    abstract::The genomics revolution offers great promise for resolving the phylogeny of living taxa, but does it offer any benefits for reconstructing relationships among extinct (fossil) taxa? Superficially, the answer would seem to be "no," given that molecular data cannot be obtained for most fossil taxa. However, because foss...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp012

    authors: Wiens JJ

    更新日期:2009-02-01 00:00:00

  • Phylogenetic relationships of fig wasps pollinating functionally dioecious Ficus based on mitochondrial DNA sequences and morphology.

    abstract::The obligate mutualism between pollinating fig wasps in the family Agaonidae (Hymenoptera: Chalcidoidea) and Ficus species (Moraceae) is often regarded as an example of co-evolution but little is known about the history of the interaction, and understanding the origin of functionally dioecious fig pollination has been...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:

    authors: Weiblen GD

    更新日期:2001-04-01 00:00:00

  • A Total-Evidence Dated Phylogeny of Echinoidea Combining Phylogenomic and Paleontological Data.

    abstract::Phylogenomic and paleontological data constitute complementary resources for unravelling the phylogenetic relationships and divergence times of lineages, yet few studies have attempted to fully integrate them. Several unique properties of echinoids (sea urchins) make them especially useful for such synthetizing approa...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa069

    authors: Koch NM,Thompson JR

    更新日期:2020-09-03 00:00:00

  • Tracing the temporal and spatial origins of island endemics in the Mediterranean region: a case study from the citrus family (Ruta L., Rutaceae).

    abstract::Understanding the origin of island endemics is a central task of historical biogeography. Recent methodological advances provide a rigorous framework to determine the relative contribution of different biogeographic processes (e.g., vicariance, land migration, long-distance dispersal) to the origin of island endemics....

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syq046

    authors: Salvo G,Ho SY,Rosenbaum G,Ree R,Conti E

    更新日期:2010-12-01 00:00:00

  • Nomenclature for the Nameless: A Proposal for an Integrative Molecular Taxonomy of Cryptic Diversity Exemplified by Planktonic Foraminifera.

    abstract::Investigations of biodiversity, biogeography, and ecological processes rely on the identification of "species" as biologically significant, natural units of evolution. In this context, morphotaxonomy only provides an adequate level of resolution if reproductive isolation matches morphological divergence. In many group...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw031

    authors: Morard R,Escarguel G,Weiner AK,André A,Douady CJ,Wade CM,Darling KF,Ujiié Y,Seears HA,Quillévéré F,de Garidel-Thoron T,de Vargas C,Kucera M

    更新日期:2016-09-01 00:00:00

  • Testing congruence in phylogenomic analysis.

    abstract::Phylogenomic analyses of large sets of genes or proteins have the potential to revolutionize our understanding of the tree of life. However, problems arise because estimated phylogenies from individual loci often differ because of different histories, systematic bias, or stochastic error. We have developed Concaterpil...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150801910436

    authors: Leigh JW,Susko E,Baumgartner M,Roger AJ

    更新日期:2008-02-01 00:00:00