Abstract:
:We develop a maximum likelihood (ML) method for estimating migration rates between species using genomic sequence data. A species tree is used to accommodate the phylogenetic relationships among three species, allowing for migration between the two sister species, while the third species is used as an out-group. A Markov chain characterization of the genealogical process of coalescence and migration is used to integrate out the migration histories at each locus analytically, whereas Gaussian quadrature is used to integrate over the coalescent times on each genealogical tree numerically. This is an extension of our early implementation of the symmetrical isolation-with-migration model for three species to accommodate arbitrary loci with two or three sequences per locus and to allow asymmetrical migration rates. Our implementation can accommodate tens of thousands of loci, making it feasible to analyze genome-scale data sets to test for gene flow. We calculate the posterior probabilities of gene trees at individual loci to identify genomic regions that are likely to have been transferred between species due to gene flow. We conduct a simulation study to examine the statistical properties of the likelihood ratio test for gene flow between the two in-group species and of the ML estimates of model parameters such as the migration rate. Inclusion of data from a third out-group species is found to increase dramatically the power of the test and the precision of parameter estimation. We compiled and analyzed several genomic data sets from the Drosophila fruit flies. Our analyses suggest no migration from D. melanogaster to D. simulans, and a significant amount of gene flow from D. simulans to D. melanogaster, at the rate of ~0.02 migrant individuals per generation. We discuss the utility of the multispecies coalescent model for species tree estimation, accounting for incomplete lineage sorting and migration.
journal_name
Syst Bioljournal_title
Systematic biologyauthors
Dalquen DA,Zhu T,Yang Zdoi
10.1093/sysbio/syw063subject
Has Abstractpub_date
2017-05-01 00:00:00pages
379-398issue
3eissn
1063-5157issn
1076-836Xpii
syw063journal_volume
66pub_type
杂志文章abstract::Genome-scale data offer the opportunity to clarify phylogenetic relationships that are difficult to resolve with few loci, but they can also identify genomic regions with evolutionary history distinct from that of the species history. We collected whole-genome sequence data from 29 taxa in the legume genus Medicago, t...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syt009
更新日期:2013-05-01 00:00:00
abstract::We present a 6-gene, 420-species maximum-likelihood phylogeny of Ascomycota, the largest phylum of Fungi. This analysis is the most taxonomically complete to date with species sampled from all 15 currently circumscribed classes. A number of superclass-level nodes that have previously evaded resolution and were unnamed...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syp020
更新日期:2009-04-01 00:00:00
abstract::Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invaria...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150490503035
更新日期:2004-08-01 00:00:00
abstract::Targeted sequence capture is becoming a widespread tool for generating large phylogenomic data sets to address difficult phylogenetic problems. However, this methodology often generates data sets in which increasing the number of taxa and loci increases amounts of missing data. Thus, a fundamental (but still unresolve...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syv058
更新日期:2016-01-01 00:00:00
abstract::The Noah's Ark Problem (NAP) is a comprehensive cost-effectiveness methodology for biodiversity conservation that was introduced by Weitzman (1998) and utilizes the phylogenetic tree containing the taxa of interest to assess biodiversity. Given a set of taxa, each of which has a particular survival probability that ca...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150600873876
更新日期:2006-08-01 00:00:00
abstract::Rapidly evolving pathogens, such as viruses and bacteria, accumulate genetic change at a similar timescale over which their epidemiological processes occur, such that, it is possible to make inferences about their infectious spread using phylogenetic time-trees. For this purpose it is necessary to choose a phylodynami...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syy048
更新日期:2019-03-01 00:00:00
abstract::We prove that the slope parameter of the ordinary least squares regression of phylogenetically independent contrasts (PICs) conducted through the origin is identical to the slope parameter of the method of generalized least squares (GLSs) regression under a Brownian motion model of evolution. This equivalence has seve...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syr118
更新日期:2012-05-01 00:00:00
abstract::Penelope-like elements (PLEs) are a relatively little studied class of eukaryotic retroelements, distinguished by the presence of the GIY-YIG endonuclease domain, the ability of some representatives to retain introns, and the similarity of PLE-encoded reverse transcriptases to telomerases. Although these retrotranspos...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150601077683
更新日期:2006-12-01 00:00:00
abstract::Lineage sorting and introgression can lead to incongruence among gene phylogenies, complicating the inference of species trees for large groups of taxa that have recently and rapidly radiated. In addition, it can be difficult to determine which of these processes is responsible for this incongruence. We explore these ...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150600697283
更新日期:2006-06-01 00:00:00
abstract::It is common for studies that employ the comparative method for the study of adaptation, i.e. documentation of potentially adaptive across-species patterns of trait-environment or trait-trait correlation, to be designated as "macroevolutionary." Authors are justified in using "macroevolution" in this way by appeal to ...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syaa086
更新日期:2021-01-07 00:00:00
abstract::Multigene families have provided opportunities for evolutionary biologists to assess molecular evolution processes and phylogenetic reconstructions at deep and shallow systematic levels. However, the use of these markers is not free of technical and analytical challenges. Many evolutionary studies that used the nuclea...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syt101
更新日期:2014-03-01 00:00:00
abstract::Species delimitation and species tree inference are difficult problems in cases of recent divergence, especially when different loci have different histories. This paper quantifies the difficulty of jointly finding the division of samples to species and estimating a species tree without constraining the possible assig...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syp077
更新日期:2010-01-01 00:00:00
abstract::Tree reconstruction methods are often judged by their accuracy, measured by how close they get to the true tree. Yet, most reconstruction methods like maximum likelihood (ML) do not explicitly maximize this accuracy. To address this problem, we propose a Bayesian solution. Given tree samples, we propose finding the tr...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syr021
更新日期:2011-07-01 00:00:00
abstract::Mouse lemurs (Microcebus) are a radiation of morphologically cryptic primates distributed throughout Madagascar for which the number of recognized species has exploded in the past two decades. This taxonomic revision has prompted understandable concern that there has been substantial oversplitting in the mouse lemur c...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syaa053
更新日期:2020-07-08 00:00:00
abstract::Performance measures of phylogenetic estimation methods such as accuracy, consistency, and power are an attempt at summarizing an ensemble of a given estimator's behavior. These summaries characterize an ensemble behavior with a single number, leading to a variety of definitions. In particular, the relationships betwe...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/106351598261021
更新日期:1998-03-01 00:00:00
abstract::Stochastic birth-death models provide the foundation for studying and simulating evolutionary trees in phylodynamics. A curious feature of such models is that they exhibit fundamental symmetries when the birth and death rates are interchanged. In this article, we first provide intuitive reasons for these known transfo...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syz039
更新日期:2019-09-01 00:00:00
abstract::This article reviews the various models that have been used to describe the relationships between gene trees and species trees. Molecular phylogeny has focused mainly on improving models for the reconstruction of gene trees based on sequence alignments. Yet, most phylogeneticists seek to reveal the history of species....
journal_title:Systematic biology
pub_type: 杂志文章,评审
doi:10.1093/sysbio/syu048
更新日期:2015-01-01 00:00:00
abstract::Understanding the evolutionary history of species is at the core of molecular evolution and is done using several inference methods. The critical issue is to quantify the uncertainty of the inference. The posterior probabilities in Bayesian phylogenetic inference and the bootstrap values in frequentist approaches meas...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syr129
更新日期:2012-03-01 00:00:00
abstract::The Cracidae is one of the most endangered and distinctive bird families in the Neotropics, yet the higher relationships among taxa remain uncertain. The molecular phylogeny of its 11 genera was inferred using 10,678 analyzable sites (5,412 from seven different mitochondrial segments and 5,266 sites from four nuclear ...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150290102519
更新日期:2002-12-01 00:00:00
abstract::Among models of nucleotide evolution, the Barry and Hartigan (BH) model (also known as the General Markov Model) is very flexible as it allows separate arbitrary substitution matrices along edges. For a given tree, the estimates of the BH model are a set of joint probability matrices, each giving the pairwise frequenc...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/sys046
更新日期:2012-12-01 00:00:00
abstract::The main goals of this study were to provide a robust phylogeny for the families of the superfamily Curculionoidea, to discover relationships and major natural groups within the family Curculionidae, and to clarify the evolution of larval habits and host-plant associations in weevils to analyze their role in weevil di...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150290102465
更新日期:2002-10-01 00:00:00
abstract::We collected ~29 kb of sequence data using Roche 454 pyrosequencing in order to estimate the timing and pattern of diversification in the carnivorous pitcher plant Sarracenia alata. Utilizing modified protocols for reduced representation library construction, we generated sequence data from 86 individuals across 10 po...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/sys048
更新日期:2012-10-01 00:00:00
abstract::The genomics revolution offers great promise for resolving the phylogeny of living taxa, but does it offer any benefits for reconstructing relationships among extinct (fossil) taxa? Superficially, the answer would seem to be "no," given that molecular data cannot be obtained for most fossil taxa. However, because foss...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syp012
更新日期:2009-02-01 00:00:00
abstract::Previous phylogenetic analyses of tetrapod 18S ribosomal RNA (rRNA) sequences support the grouping of birds with mammals, whereas other molecular data, and morphological and paleontological data favor the grouping of birds with crocodiles. The 18S rRNA gene has consequently been considered odd, serving as "definitive ...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150390196948
更新日期:2003-06-01 00:00:00
abstract::How should characters and taxa be sampled to resolve efficiently the phylogeny of ancient and highly speciose groups? We addressed this question empirically in the treefrog family Hylidae, which contains > 800 species and may be nonmonophyletic with respect to other anuran families. We sampled 81 species (54 hylids an...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150500234625
更新日期:2005-10-01 00:00:00
abstract::The evolutionary history of gains and losses of vegetative reproductive propagules (soredia) in Porpidia s.l., a group of lichen-forming ascomycetes, was clarified using Bayesian Markov chain Monte Carlo (MCMC) approaches to monophyly tests and a combined MCMC and maximum likelihood approach to ancestral character sta...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1080/10635150600697465
更新日期:2006-06-01 00:00:00
abstract::Evolutionary relationships have remained unresolved in many well-studied groups, even though advances in next-generation sequencing and analysis, using approaches such as transcriptomics, anchored hybrid enrichment, or ultraconserved elements, have brought systematics to the brink of whole genome phylogenomics. Recent...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syz030
更新日期:2020-01-01 00:00:00
abstract::The spatial distribution of biomes has changed considerably over deep time, so the geographical opportunity for an evolutionary lineage to shift into a new biome may depend on how the availability and connectivity of biomes has varied temporally. To better understand how lineages shift between biomes in space and time...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syaa045
更新日期:2021-01-01 00:00:00
abstract::In recent years, there has been controversy whether multidimensional data such as geometric morphometric data or information on gene expression can be used for estimating phylogenies. This study uses simulations of evolution in multidimensional phenotype spaces to address this question and to identify specific factors...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syaa003
更新日期:2020-09-01 00:00:00
abstract::We found that trends in the rate of description of 580,000 marine and terrestrial species, in the taxonomically authoritative World Register of Marine Species and Catalogue of Life databases, were similar until the 1950s. Since then, the relative number of marine to terrestrial species described per year has increased...
journal_title:Systematic biology
pub_type: 杂志文章
doi:10.1093/sysbio/syr080
更新日期:2012-10-01 00:00:00