Taxon influence index: assessing taxon-induced incongruities in phylogenetic inference.

Abstract:

:Understanding the evolutionary history of species is at the core of molecular evolution and is done using several inference methods. The critical issue is to quantify the uncertainty of the inference. The posterior probabilities in Bayesian phylogenetic inference and the bootstrap values in frequentist approaches measure the variability of the estimates due to the sampling of sites from genes and the sampling of genes from genomes. However, they do not measure the uncertainty due to taxon sampling. Taxa that experienced molecular homoplasy, recent selection, a spur of evolution, and so forth may disrupt the inference and cause incongruences in the estimated phylogeny. We define a taxon influence index to assess the influence of each taxon on the phylogeny. We found that although most taxa have a weak influence on the phylogeny, a small fraction of influential taxa strongly alter it even in clades only loosely related to them. We conclude that highly influential taxa should be given special attention and sampling them more thoroughly can lead to more dependable phylogenies.

journal_name

Syst Biol

journal_title

Systematic biology

authors

Mariadassou M,Bar-Hen A,Kishino H

doi

10.1093/sysbio/syr129

subject

Has Abstract

pub_date

2012-03-01 00:00:00

pages

337-45

issue

2

eissn

1063-5157

issn

1076-836X

pii

syr129

journal_volume

61

pub_type

杂志文章
  • Fitting nonstationary general-time-reversible models to obtain edge-lengths and frequencies for the barry-hartigan model.

    abstract::Among models of nucleotide evolution, the Barry and Hartigan (BH) model (also known as the General Markov Model) is very flexible as it allows separate arbitrary substitution matrices along edges. For a given tree, the estimates of the BH model are a set of joint probability matrices, each giving the pairwise frequenc...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/sys046

    authors: Zou L,Susko E,Field C,Roger AJ

    更新日期:2012-12-01 00:00:00

  • Toward an integrated system of clade names.

    abstract::Although the proposition that higher taxa should correspond to clades is widely accepted, current nomenclature does not distinguish clearly between different clades in nested series. In particular, the same name is often applied to a total clade, its crown clade, and clades originating with various nodes, branches, an...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150701656378

    authors: de Queiroz K

    更新日期:2007-12-01 00:00:00

  • An Integrated Model of Phenotypic Trait Changes and Site-Specific Sequence Evolution.

    abstract::Recent years have seen a constant rise in the availability of trait data, including morphological features, ecological preferences, and life history characteristics. These phenotypic data provide means to associate genomic regions with phenotypic attributes, thus allowing the identification of phenotypic traits associ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syx032

    authors: Levy Karin E,Wicke S,Pupko T,Mayrose I

    更新日期:2017-11-01 00:00:00

  • Phylogenomics, Origin and Diversification of Anthozoans (Phylum Cnidaria).

    abstract::Anthozoan cnidarians (corals and sea anemones) include some of the world's most important foundation species, capable of building massive reef complexes that support entire ecosystems. Although previous molecular phylogenetic analyses have revealed widespread homoplasy of the morphological characters traditionally use...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa103

    authors: McFadden CS,Quattrini AM,Brugler MR,Cowman PF,Dueñas LF,Kitahara MV,Paz-García DA,Reimer JD,Rodríguez E

    更新日期:2021-01-28 00:00:00

  • Tracing the temporal and spatial origins of island endemics in the Mediterranean region: a case study from the citrus family (Ruta L., Rutaceae).

    abstract::Understanding the origin of island endemics is a central task of historical biogeography. Recent methodological advances provide a rigorous framework to determine the relative contribution of different biogeographic processes (e.g., vicariance, land migration, long-distance dispersal) to the origin of island endemics....

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syq046

    authors: Salvo G,Ho SY,Rosenbaum G,Ree R,Conti E

    更新日期:2010-12-01 00:00:00

  • Quantification of homoplasy for nucleotide transitions and transversions and a reexamination of assumptions in weighted phylogenetic analysis.

    abstract::Nucleotide transitions are frequently down-weighted relative to transversions in phylogenetic analysis. This is based on the assumption that transitions, by virtue of their greater evolutionary rate, exhibit relatively more homoplasy and are therefore less reliable phylogenetic characters. Relative amounts of homoplas...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351500750049734

    authors: Broughton RE,Stanley SE,Durrett RT

    更新日期:2000-12-01 00:00:00

  • Multiple data sets, high homoplasy, and the phylogeny of softshell turtles (Testudines: Trionychidae).

    abstract::We present a phylogenetic hypothesis and novel, rank-free classification for all extant species of softshell turtles (Testudines:Trionychidae). Our data set included DNA sequence data from two mitochondrial protein-coding genes and a approximately 1-kb nuclear intron for 23 of 26 recognized species, and 59 previously ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490503053

    authors: Engstrom TN,Shaffer HB,McCord WP

    更新日期:2004-10-01 00:00:00

  • Southern hemisphere biogeography inferred by event-based models: plant versus animal patterns.

    abstract::The Southern Hemisphere has traditionally been considered as having a fundamentally vicariant history. The common trans-Pacific disjunctions are usually explained by the sequential breakup of the supercontinent Gondwana during the last 165 million years, causing successive division of an ancestral biota. However, rece...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150490423430

    authors: Sanmartín I,Ronquist F

    更新日期:2004-04-01 00:00:00

  • Paleontology, genomics, and combined-data phylogenetics: can molecular data improve phylogeny estimation for fossil taxa?

    abstract::The genomics revolution offers great promise for resolving the phylogeny of living taxa, but does it offer any benefits for reconstructing relationships among extinct (fossil) taxa? Superficially, the answer would seem to be "no," given that molecular data cannot be obtained for most fossil taxa. However, because foss...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp012

    authors: Wiens JJ

    更新日期:2009-02-01 00:00:00

  • Gene Tree Discordance Causes Apparent Substitution Rate Variation.

    abstract::Substitution rates are known to be variable among genes, chromosomes, species, and lineages due to multifarious biological processes. Here, we consider another source of substitution rate variation due to a technical bias associated with gene tree discordance. Discordance has been found to be rampant in genome-wide da...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw018

    authors: Mendes FK,Hahn MW

    更新日期:2016-07-01 00:00:00

  • Maximizing phylogenetic diversity in biodiversity conservation: Greedy solutions to the Noah's Ark problem.

    abstract::The Noah's Ark Problem (NAP) is a comprehensive cost-effectiveness methodology for biodiversity conservation that was introduced by Weitzman (1998) and utilizes the phylogenetic tree containing the taxa of interest to assess biodiversity. Given a set of taxa, each of which has a particular survival probability that ca...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150600873876

    authors: Hartmann K,Steel M

    更新日期:2006-08-01 00:00:00

  • Placing paleopolyploidy in relation to taxon divergence: a phylogenetic analysis in legumes using 39 gene families.

    abstract::Young polyploid events are easily diagnosed by various methods, but older polyploid events become increasingly difficult to identify as chromosomal rearrangements, tandem gene or partial chromosome duplications, changes in substitution rates among duplicated genes, pseudogenization or locus loss, and interlocus intera...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150590945359

    authors: Pfeil BE,Schlueter JA,Shoemaker RC,Doyle JJ

    更新日期:2005-06-01 00:00:00

  • The inference of gene trees with species trees.

    abstract::This article reviews the various models that have been used to describe the relationships between gene trees and species trees. Molecular phylogeny has focused mainly on improving models for the reconstruction of gene trees based on sequence alignments. Yet, most phylogeneticists seek to reveal the history of species....

    journal_title:Systematic biology

    pub_type: 杂志文章,评审

    doi:10.1093/sysbio/syu048

    authors: Szöllősi GJ,Tannier E,Daubin V,Boussau B

    更新日期:2015-01-01 00:00:00

  • Testing congruence in phylogenomic analysis.

    abstract::Phylogenomic analyses of large sets of genes or proteins have the potential to revolutionize our understanding of the tree of life. However, problems arise because estimated phylogenies from individual loci often differ because of different histories, systematic bias, or stochastic error. We have developed Concaterpil...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150801910436

    authors: Leigh JW,Susko E,Baumgartner M,Roger AJ

    更新日期:2008-02-01 00:00:00

  • A novel test for host-symbiont codivergence indicates ancient origin of fungal endophytes in grasses.

    abstract::Significant phylogenetic codivergence between plant or animal hosts (H) and their symbionts or parasites (P) indicates the importance of their interactions on evolutionary time scales. However, valid and realistic methods to test for codivergence are not fully developed. One of the systems where possible codivergence ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/10635150802172184

    authors: Schardl CL,Craven KD,Speakman S,Stromberg A,Lindstrom A,Yoshida R

    更新日期:2008-06-01 00:00:00

  • BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics.

    abstract::Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting m...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syr100

    authors: Ayres DL,Darling A,Zwickl DJ,Beerli P,Holder MT,Lewis PO,Huelsenbeck JP,Ronquist F,Swofford DL,Cummings MP,Rambaut A,Suchard MA

    更新日期:2012-01-01 00:00:00

  • Species-Level Phylogeny and Polyploid Relationships in Hordeum (Poaceae) Inferred by Next-Generation Sequencing and In Silico Cloning of Multiple Nuclear Loci.

    abstract::Polyploidization is an important speciation mechanism in the barley genus Hordeum. To analyze evolutionary changes after allopolyploidization, knowledge of parental relationships is essential. One chloroplast and 12 nuclear single-copy loci were amplified by polymerase chain reaction (PCR) in all Hordeum plus six out-...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv035

    authors: Brassac J,Blattner FR

    更新日期:2015-09-01 00:00:00

  • Phylogenetic Trees and Networks Can Serve as Powerful and Complementary Approaches for Analysis of Genomic Data.

    abstract::Genomic data have had a profound impact on nearly every biological discipline. In systematics and phylogenetics, the thousands of loci that are now being sequenced can be analyzed under the multispecies coalescent model (MSC) to explicitly account for gene tree discordance due to incomplete lineage sorting (ILS). Howe...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syz056

    authors: Blair C,Ané C

    更新日期:2020-05-01 00:00:00

  • Is congruence between data partitions a reliable predictor of phylogenetic accuracy? Empirically testing an iterative procedure for choosing among phylogenetic methods.

    abstract::The relationship between phylogenetic accuracy and congruence between data partitions collected from the same taxa was explored for mitochondrial DNA sequences from two well-supported vertebrate phylogenies. An iterative procedure was adopted whereby accuracy, phylogenetic signal, and congruence were measured before a...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/46.3.464

    authors: Cunningham CW

    更新日期:1997-09-01 00:00:00

  • Bayes estimators for phylogenetic reconstruction.

    abstract::Tree reconstruction methods are often judged by their accuracy, measured by how close they get to the true tree. Yet, most reconstruction methods like maximum likelihood (ML) do not explicitly maximize this accuracy. To address this problem, we propose a Bayesian solution. Given tree samples, we propose finding the tr...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syr021

    authors: Huggins PM,Li W,Haws D,Friedrich T,Liu J,Yoshida R

    更新日期:2011-07-01 00:00:00

  • Repeated evolution of dioecy from monoecy in Siparunaceae (Laurales).

    abstract::Siparunaceae comprise Glossocalyx with one species in West Africa and Siparuna with 65 species in the neotropics; all have unisexual flowers, and 15 species are monoecious, 50 dioecious. Parsimony and maximum likelihood analyses of combined nuclear ribosomal ITS and chloroplast trnL-trnF intergenic spacer sequences yi...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351501753328820

    authors: Renner SS,Won H

    更新日期:2001-09-01 00:00:00

  • Split support and split conflict randomization tests in phylogenetic inference.

    abstract::Randomization tests allow the formulation and statistical testing of null hypotheses about the quality of entire data sets or the quality of fit between the data and particular phylogenetic hypotheses. Randomization tests of phylogenetic hypotheses based on the concepts of split support and split conflict are describe...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351598260662

    authors: Wilkinson M

    更新日期:1998-12-01 00:00:00

  • Robustness of compound Dirichlet priors for Bayesian inference of branch lengths.

    abstract::We modified the phylogenetic program MrBayes 3.1.2 to incorporate the compound Dirichlet priors for branch lengths proposed recently by Rannala, Zhu, and Yang (2012. Tail paradox, partial identifiability and influential priors in Bayesian branch length inference. Mol. Biol. Evol. 29:325-335.) as a solution to the prob...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/sys030

    authors: Zhang C,Rannala B,Yang Z

    更新日期:2012-10-01 00:00:00

  • More characters or more taxa for a robust phylogeny--case study from the coffee family (Rubiaceae).

    abstract::Using different data sets mainly from the plant family Rubiaceae, but in parts also from the Apocynaceae, Asteraceae, Lardizabalaceae, Saxifragaceae, and Solanaceae, we have investigated the effect of number of characters, number of taxa, and kind of data on bootstrap values within phylogenetic trees. The percentage o...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351599260085

    authors: Bremer B,Jansen RK,Oxelman B,Backlund M,Lantz H,Kim KJ

    更新日期:1999-09-01 00:00:00

  • A Skull Might Lie: Modeling Ancestral Ranges and Diet from Genes and Shape of Tree Squirrels.

    abstract::Tropical forests of Central and South America represent hotspots of biological diversity. Tree squirrels of the tribe Sciurini are an excellent model system for the study of tropical biodiversity as these squirrels disperse exceptional distances, and after colonizing the tropics of the Central and South America, they ...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syv054

    authors: Pečnerová P,Moravec JC,Martínková N

    更新日期:2015-11-01 00:00:00

  • Effect of nonindependent substitution on phylogenetic accuracy.

    abstract::All current phylogenetic methods assume that DNA substitutions are independent among sites. However, ample empirical evidence suggests that the process of substitution is not independent but is, in fact, temporally and spatially correlated. The robustness of several commonly used phylogenetic methods to the assumption...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1080/106351599260319

    authors: Huelsenbeck JP,Nielsen R

    更新日期:1999-06-01 00:00:00

  • New heuristic methods for joint species delimitation and species tree inference.

    abstract::Species delimitation and species tree inference are difficult problems in cases of recent divergence, especially when different loci have different histories. This paper quantifies the difficulty of jointly finding the division of samples to species and estimating a species tree without constraining the possible assig...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syp077

    authors: O'Meara BC

    更新日期:2010-01-01 00:00:00

  • Phylogenetic signal variation in the genomes of Medicago (Fabaceae).

    abstract::Genome-scale data offer the opportunity to clarify phylogenetic relationships that are difficult to resolve with few loci, but they can also identify genomic regions with evolutionary history distinct from that of the species history. We collected whole-genome sequence data from 29 taxa in the legume genus Medicago, t...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syt009

    authors: Yoder JB,Briskine R,Mudge J,Farmer A,Paape T,Steele K,Weiblen GD,Bharti AK,Zhou P,May GD,Young ND,Tiffin P

    更新日期:2013-05-01 00:00:00

  • Phylogenomics of parasitic and non-parasitic lice (Insecta: Psocodea): Combining sequence data and Exploring compositional bias solutions in Next Generation Datasets.

    abstract::The insect order Psocodea is a diverse lineage comprising both parasitic (Phthiraptera) and non-parasitic members (Psocoptera). The extreme age and ecological diversity of the group may be associated with major genomic changes, such as base compositional biases expected to affect phylogenetic inference. Divergent morp...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syaa075

    authors: de Moya RS,Yoshizawa K,Walden KKO,Sweet AD,Dietrich CH,Johnson KP

    更新日期:2020-09-26 00:00:00

  • Nomenclature for the Nameless: A Proposal for an Integrative Molecular Taxonomy of Cryptic Diversity Exemplified by Planktonic Foraminifera.

    abstract::Investigations of biodiversity, biogeography, and ecological processes rely on the identification of "species" as biologically significant, natural units of evolution. In this context, morphotaxonomy only provides an adequate level of resolution if reproductive isolation matches morphological divergence. In many group...

    journal_title:Systematic biology

    pub_type: 杂志文章

    doi:10.1093/sysbio/syw031

    authors: Morard R,Escarguel G,Weiner AK,André A,Douady CJ,Wade CM,Darling KF,Ujiié Y,Seears HA,Quillévéré F,de Garidel-Thoron T,de Vargas C,Kucera M

    更新日期:2016-09-01 00:00:00