Improving phylogenetic inference with a semiempirical amino acid substitution model.

Abstract:

:Amino acid substitution matrices describe the rates by which amino acids are replaced during evolution. In contrast to nucleotide or codon models, amino acid substitution matrices are in general parameterless and empirically estimated, probably because there is no obvious parametrization for amino acid substitutions. Principal component analysis has previously been used to improve codon substitution models by empirically finding the most relevant parameters. Here, we apply the same method to amino acid substitution matrices, leading to a semiempirical substitution model that can adjust the transition rates to the protein sequences under investigation. Our new model almost invariably achieves the best likelihood values in large-scale comparisons with established amino acid substitution models (JTT, WAG, and LG). In particular for longer alignments, these likelihood gains are considerably larger than what could be expected from simply having more parameters. The application of our model differs from that of mixture models (such as UL2 or UL3), as we optimize one rate matrix per alignment, whereas mixture models apply the variation per alignments site. This makes our model computationally more efficient, while the performance is comparable to that of UL3. Applied to the phylogenetic problem of the origin of placental mammals, our new model and the UL3 mixed model are the only ones of the tested models that cluster Afrotheria and Xenarthra into a clade called Atlantogenata, which would be in correspondence with recent findings using more sophisticated phylogenetic methods.

journal_name

Mol Biol Evol

authors

Zoller S,Schneider A

doi

10.1093/molbev/mss229

subject

Has Abstract

pub_date

2013-02-01 00:00:00

pages

469-79

issue

2

eissn

0737-4038

issn

1537-1719

pii

mss229

journal_volume

30

pub_type

杂志文章
  • Colocality to Cofunctionality: Eukaryotic Gene Neighborhoods as a Resource for Function Discovery.

    abstract::Diverging from the classic paradigm of random gene order in eukaryotes, gene proximity can be leveraged to systematically identify functionally related gene neighborhoods in eukaryotes, utilizing techniques pioneered in bacteria. Current methods of identifying gene neighborhoods typically rely on sequence similarity t...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msaa221

    authors: Foflonker F,Blaby-Haas CE

    更新日期:2021-01-23 00:00:00

  • Ancestry-Specific Analyses Reveal Differential Demographic Histories and Opposite Selective Pressures in Modern South Asian Populations.

    abstract::Genetic variation in contemporary South Asian populations follows a northwest to southeast decreasing cline of shared West Eurasian ancestry. A growing body of ancient DNA evidence is being used to build increasingly more realistic models of demographic changes in the last few thousand years. Through high-quality mode...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msz037

    authors: Yelmen B,Mondal M,Marnetto D,Pathak AK,Montinaro F,Gallego Romero I,Kivisild T,Metspalu M,Pagani L

    更新日期:2019-08-01 00:00:00

  • Sequence evolution in bacterial endosymbionts having extreme base compositions.

    abstract::A major limitation on ability to reconstruct bacterial evolution is the lack of dated ancestors that might be used to evaluate and calibrate molecular clocks. Vertically transmitted symbionts that have cospeciated with animal hosts offer a firm basis for calibrating sequence evolution in bacteria, since fossils of the...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a026071

    authors: Clark MA,Moran NA,Baumann P

    更新日期:1999-11-01 00:00:00

  • Parallel genetic and phenotypic evolution of DNA superhelicity in experimental populations of Escherichia coli.

    abstract::DNA supercoiling is the master function that interconnects chromosome structure and global gene transcription. This function has recently been shown to be under strong selection in Escherichia coli. During the evolution of 12 initially identical populations propagated in a defined environment for 20,000 generations, p...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msq099

    authors: Crozat E,Winkworth C,Gaffé J,Hallin PF,Riley MA,Lenski RE,Schneider D

    更新日期:2010-09-01 00:00:00

  • Layers of evolvability in a bacteriophage life history trait.

    abstract::Functional redundancy in genomes arises from genes with overlapping functions, allowing phenotypes to persist after gene knockouts. Evolutionary redundancy or evolvability of a genome is one step removed, in that functional redundancy is absent but the genome has the potential to evolve to restore a lost phenotype. Ex...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msp037

    authors: Heineman RH,Bull JJ,Molineux IJ

    更新日期:2009-06-01 00:00:00

  • Kinesin-related genes from diplomonad, sponge, amphioxus, and cyclostomes: divergence pattern of kinesin family and evolution of giardial membrane-bounded organella.

    abstract::To understand the question of whether divergence of eukaryotic genes by gene duplications and domain shufflings proceeded gradually or intermittently during evolution, we have cloned and sequenced Giardia lamblia cDNAs encoding kinesins and kinesin-related proteins and have obtained 13 kinesin-related cDNAs, some of w...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a004215

    authors: Iwabe N,Miyata T

    更新日期:2002-09-01 00:00:00

  • A highly conserved nuclear gene for low-level phylogenetics: elongation factor-1 alpha recovers morphology-based tree for heliothine moths.

    abstract::Molecular systematists need increased access to nuclear genes. Highly conserved, low copy number protein-encoding nuclear genes have attractive features for phylogenetic inference but have heretofore been applied mostly to very ancient divergences. By virtue of their synonymous substitutions, such genes should contain...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040244

    authors: Cho S,Mitchell A,Regier JC,Mitter C,Poole RW,Friedlander TP,Zhao S

    更新日期:1995-07-01 00:00:00

  • Isolation of homeodomain-leucine zipper genes from the moss Physcomitrella patens and the evolution of homeodomain-leucine zipper genes in land plants.

    abstract::Homeobox genes encode transcription factors involved in many aspects of developmental processes. The homeodomain-leucine zipper (HD-Zip) genes, which are characterized by the presence of both a homeodomain and a leucine zipper motif, form a clade within the homeobox superfamily and were previously reported only from v...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a003828

    authors: Sakakibara K,Nishiyama T,Kato M,Hasebe M

    更新日期:2001-04-01 00:00:00

  • Relative efficiencies of the maximum parsimony and distance-matrix methods in obtaining the correct phylogenetic tree.

    abstract::The relative efficiencies of the maximum parsimony (MP) and distance-matrix methods in obtaining the correct tree (topology) were studied by using computer simulation. The distance-matrix methods examined are the neighbor-joining, distance-Wagner, Tateno et al. modified Farris, Faith, and Li methods. In the computer s...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040497

    authors: Sourdis J,Nei M

    更新日期:1988-05-01 00:00:00

  • The genealogy of a sequence subject to purifying selection at multiple sites.

    abstract::We investigate the effect of purifying selection at multiple sites on both the shape of the genealogy and the distribution of mutations on the tree. We find that the primary effect of purifying selection on a genealogy is to shift the distribution of mutations on the tree, whereas the shape of the tree remains largely...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a004199

    authors: Williamson S,Orive ME

    更新日期:2002-08-01 00:00:00

  • Short repetitive sequences in green algal mitochondrial genomes: potential roles in mitochondrial genome evolution.

    abstract::Current data on green algal mitochondrial genomes suggest an unexpected dichotomy within the group with respect to genome structure, organization, and sequence affiliations. The present study suggests that there is a correlation between this dichotomy on one hand and the differences in the abundance, base composition,...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a025972

    authors: Nedelcu AM,Lee RW

    更新日期:1998-06-01 00:00:00

  • Empirical assessment of RAD sequencing for interspecific phylogeny.

    abstract::Next-generation sequencing opened up new possibilities in phylogenetics; however, choosing an appropriate method of sample preparation remains challenging. Here, we demonstrate that restriction-site-associated DNA sequencing (RAD-seq) generates useful data for phylogenomics. Analysis of our RAD library using current b...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msu063

    authors: Cruaud A,Gautier M,Galan M,Foucaud J,Sauné L,Genson G,Dubois E,Nidelet S,Deuve T,Rasplus JY

    更新日期:2014-05-01 00:00:00

  • From DNA to fitness differences: sequences and structures of adaptive variants of Colias phosphoglucose isomerase (PGI).

    abstract::Colias eurytheme butterflies display extensive allozyme polymorphism in the enzyme phosphoglucose isomerase (PGI). Earlier studies on biochemical and fitness effects of these genotypes found evidence of strong natural selection maintaining this polymorphism in the wild. Here we analyze the molecular features of this p...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msj062

    authors: Wheat CW,Watt WB,Pollock DD,Schulte PM

    更新日期:2006-03-01 00:00:00

  • Myxosporea (Myxozoa, Cnidaria) Lack DNA Cytosine Methylation.

    abstract::DNA cytosine methylation is central to many biological processes, including regulation of gene expression, cellular differentiation, and development. This DNA modification is conserved across animals, having been found in representatives of sponges, ctenophores, cnidarians, and bilaterians, and with very few known ins...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msaa214

    authors: Kyger R,Luzuriaga-Neira A,Layman T,Milkewitz Sandberg TO,Singh D,Huchon D,Peri S,Atkinson SD,Bartholomew JL,Yi SV,Alvarez-Ponce D

    更新日期:2021-01-23 00:00:00

  • Molecular evolution of cytochrome c oxidase: rate variation among subunit VIa isoforms.

    abstract::Cytochrome c oxidase (COX) consists of 13 subunits, 3 encoded in the mitochondrial genome and 10 in the nucleus. Little is known of the role of the nuclear-encoded subunits, some of which exhibit tissue-specific isoforms. Subunit VIa is unique in having tissue-specific isoforms in all mammalian species examined. We ex...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a025798

    authors: Schmidt TR,Jaradat SA,Goodman M,Lomax MI,Grossman LI

    更新日期:1997-06-01 00:00:00

  • Ignoring heterozygous sites biases phylogenomic estimates of divergence times: implications for the evolutionary history of microtus voles.

    abstract::Phylogenetic reconstruction of the evolutionary history of closely related organisms may be difficult because of the presence of unsorted lineages and of a relatively high proportion of heterozygous sites that are usually not handled well by phylogenetic programs. Genomic data may provide enough fixed polymorphisms to...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/mst271

    authors: Lischer HE,Excoffier L,Heckel G

    更新日期:2014-04-01 00:00:00

  • Proteomic Analysis of Histones H2A/H2B and Variant Hv1 in Tetrahymena thermophila Reveals an Ancient Network of Chaperones.

    abstract::Epigenetic information, which can be passed on independently of the DNA sequence, is stored in part in the form of histone posttranslational modifications and specific histone variants. Although complexes necessary for deposition have been identified for canonical and variant histones, information regarding the chroma...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msz039

    authors: Ashraf K,Nabeel-Shah S,Garg J,Saettone A,Derynck J,Gingras AC,Lambert JP,Pearlman RE,Fillingham J

    更新日期:2019-05-01 00:00:00

  • IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.

    abstract::Large phylogenomics data sets require fast tree inference methods, especially for maximum-likelihood (ML) phylogenies. Fast programs exist, but due to inherent heuristics to find optimal trees, it is not clear whether the best tree is found. Thus, there is need for additional approaches that employ different search st...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msu300

    authors: Nguyen LT,Schmidt HA,von Haeseler A,Minh BQ

    更新日期:2015-01-01 00:00:00

  • Comparison of site-specific rate-inference methods for protein sequences: empirical Bayesian methods are superior.

    abstract::The degree to which an amino acid site is free to vary is strongly dependent on its structural and functional importance. An amino acid that plays an essential role is unlikely to change over evolutionary time. Hence, the evolutionary rate at an amino acid site is indicative of how conserved this site is and, in turn,...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msh194

    authors: Mayrose I,Graur D,Ben-Tal N,Pupko T

    更新日期:2004-09-01 00:00:00

  • Molecular evolution of FLORICAULA/LEAFY orthologs in the Andropogoneae (Poaceae).

    abstract::Members of the grass family (Poaceae) exhibit a broad range of inflorescence structures and other morphologies, making the grasses an interesting model system for studying the evolution of development. Here we present an analysis of the molecular evolution of FLORICAULA/LEAFY-like genes, which are important developmen...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msi095

    authors: Bomblies K,Doebley JF

    更新日期:2005-04-01 00:00:00

  • sppIDer: A Species Identification Tool to Investigate Hybrid Genomes with High-Throughput Sequencing.

    abstract::The genomics era has expanded our knowledge about the diversity of the living world, yet harnessing high-throughput sequencing data to investigate alternative evolutionary trajectories, such as hybridization, is still challenging. Here we present sppIDer, a pipeline for the characterization of interspecies hybrids and...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msy166

    authors: Langdon QK,Peris D,Kyle B,Hittinger CT

    更新日期:2018-11-01 00:00:00

  • Predicting mammalian SINE subfamily activity from A-tail length.

    abstract::Based on previous observations that newly inserted LINEs and SINEs have particularly long 3' A-tails, which shorten rapidly during evolutionary time, we have analyzed the rat and mouse genomes for evidence of recently inserted SINEs and LINEs. We find that the youngest predicted subfamilies of rodent identifier (ID) e...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msh225

    authors: Odom GL,Robichaux JL,Deininger PL

    更新日期:2004-11-01 00:00:00

  • The use of chloroplast DNA to resolve plant phylogenies: noncoding versus rbcL sequences.

    abstract::Direct sequencing of polymerase chain reaction products is now an expanding area of plant systematics and evolution. Within angiosperms the rbcL gene has been widely sequenced and used for inferring plant phylogenies at higher taxonomic levels. Unfortunately rbcL does not usually contain enough information to resolve ...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040157

    authors: Gielly L,Taberlet P

    更新日期:1994-09-01 00:00:00

  • The complete mitochondrial genome of Tupaia belangeri and the phylogenetic affiliation of scandentia to other eutherian orders.

    abstract::The complete mitochondrial genome of Tupaia belangeri, a representative of the eutherian order Scandentia, was determined and compared with full-length mitochondrial sequences of other eutherian orders described to date. The complete mitochondrial genome is 16, 754 nt in length, with no obvious deviation from the gene...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a026417

    authors: Schmitz J,Ohme M,Zischler H

    更新日期:2000-09-01 00:00:00

  • Maximum likelihood implementation of an isolation-with-migration model with three species for testing speciation with gene flow.

    abstract::We implement an isolation with migration model for three species, with migration occurring between two closely related species while an out-group species is used to provide further information concerning gene trees and model parameters. The model is implemented in the likelihood framework for analyzing multilocus geno...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/mss118

    authors: Zhu T,Yang Z

    更新日期:2012-10-01 00:00:00

  • Genome-Wide Analysis in Brazilians Reveals Highly Differentiated Native American Genome Regions.

    abstract::Despite its population, geographic size, and emerging economic importance, disproportionately little genome-scale research exists into genetic factors that predispose Brazilians to disease, or the population genetics of risk. After identification of suitable proxy populations and careful analysis of tri-continental ad...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msw249

    authors: Mychaleckyj JC,Havt A,Nayak U,Pinkerton R,Farber E,Concannon P,Lima AA,Guerrant RL

    更新日期:2017-03-01 00:00:00

  • A role for selection in regulating the evolutionary emergence of disease-causing and other coding CAG repeats in humans and mice.

    abstract::The evolutionary expansion of CAG repeats in human triplet expansion disease genes is intriguing because of their deleterious phenotype. In the past, this expansion has been suggested to reflect a broad genomewide expansion of repeats, which would imply that mutational and evolutionary processes acting on repeats diff...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a003873

    authors: Hancock JM,Worthey EA,Santibáñez-Koref MF

    更新日期:2001-06-01 00:00:00

  • Genomic Evidence for Adaptive Inversion Clines in Drosophila melanogaster.

    abstract::Clines in chromosomal inversion polymorphisms-presumably driven by climatic gradients-are common but there is surprisingly little evidence for selection acting on them. Here we address this long-standing issue in Drosophila melanogaster by using diagnostic single nucleotide polymorphism (SNP) markers to estimate inver...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msw016

    authors: Kapun M,Fabian DK,Goudet J,Flatt T

    更新日期:2016-05-01 00:00:00

  • Phylogenetic position of phylum Nemertini, inferred from 18S rRNA sequences: molecular data as a test of morphological character homology.

    abstract::Partial 18S rRNA sequence of the nemertine Cerebratulus lacteus was obtained and compared with those of coelomate metazoans and acoelomate platyhelminths to test whether nemertines share a most recent common ancestor with the platyhelminths, as traditionally has been implied, or whether nemertines lie within a protost...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040716

    authors: Turbeville JM,Field KG,Raff RA

    更新日期:1992-03-01 00:00:00

  • How meaningful are Bayesian support values?

    abstract::In this study, we used an empirical example based on 100 mitochondrial genomes from higher teleost fishes to compare the accuracy of parsimony-based jackknife values with Bayesian support values. Phylogenetic analyses of 366 partitions, using differential taxon and character sampling from the entire data matrix of 100...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msh014

    authors: Simmons MP,Pickett KM,Miya M

    更新日期:2004-01-01 00:00:00