A class frequency mixture model that adjusts for site-specific amino acid frequencies and improves inference of protein phylogeny.

Abstract:

BACKGROUND:Widely used substitution models for proteins, such as the Jones-Taylor-Thornton (JTT) or Whelan and Goldman (WAG) models, are based on empirical amino acid interchange matrices estimated from databases of protein alignments that incorporate the average amino acid frequencies of the data set under examination (e.g JTT + F). Variation in the evolutionary process between sites is typically modelled by a rates-across-sites distribution such as the gamma (Gamma) distribution. However, sites in proteins also vary in the kinds of amino acid interchanges that are favoured, a feature that is ignored by standard empirical substitution matrices. Here we examine the degree to which the pattern of evolution at sites differs from that expected based on empirical amino acid substitution models and evaluate the impact of these deviations on phylogenetic estimation. RESULTS:We analyzed 21 large protein alignments with two statistical tests designed to detect deviation of site-specific amino acid distributions from data simulated under the standard empirical substitution model: JTT+ F + Gamma. We found that the number of states at a given site is, on average, smaller and the frequencies of these states are less uniform than expected based on a JTT + F + Gamma substitution model. With a four-taxon example, we show that phylogenetic estimation under the JTT + F + Gamma model is seriously biased by a long-branch attraction artefact if the data are simulated under a model utilizing the observed site-specific amino acid frequencies from an alignment. Principal components analyses indicate the existence of at least four major site-specific frequency classes in these 21 protein alignments. Using a mixture model with these four separate classes of site-specific state frequencies plus a fifth class of global frequencies (the JTT + cF + Gamma model), significant improvements in model fit for real data sets can be achieved. This simple mixture model also reduces the long-branch attraction problem, as shown by simulations and analyses of a real phylogenomic data set. CONCLUSION:Protein families display site-specific evolutionary dynamics that are ignored by standard protein phylogenetic models. Accurate estimation of protein phylogenies requires models that accommodate the heterogeneity in the evolutionary process across sites. To this end, we have implemented a class frequency mixture model (cF) in a freely available program called QmmRAxML for phylogenetic estimation.

journal_name

BMC Evol Biol

journal_title

BMC evolutionary biology

authors

Wang HC,Li K,Susko E,Roger AJ

doi

10.1186/1471-2148-8-331

subject

Has Abstract

pub_date

2008-12-16 00:00:00

pages

331

issn

1471-2148

pii

1471-2148-8-331

journal_volume

8

pub_type

杂志文章
  • Origin and diversification of the basic helix-loop-helix gene family in metazoans: insights from comparative genomics.

    abstract:BACKGROUND:Molecular and genetic analyses conducted in model organisms such as Drosophila and vertebrates, have provided a wealth of information about how networks of transcription factors control the proper development of these species. Much less is known, however, about the evolutionary origin of these elaborated net...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-7-33

    authors: Simionato E,Ledent V,Richards G,Thomas-Chollier M,Kerner P,Coornaert D,Degnan BM,Vervoort M

    更新日期:2007-03-02 00:00:00

  • Support for the reproductive ground plan hypothesis of social evolution and major QTL for ovary traits of Africanized worker honey bees (Apis mellifera L.).

    abstract:BACKGROUND:The reproductive ground plan hypothesis of social evolution suggests that reproductive controls of a solitary ancestor have been co-opted during social evolution, facilitating the division of labor among social insect workers. Despite substantial empirical support, the generality of this hypothesis is not un...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-11-95

    authors: Graham AM,Munday MD,Kaftanoglu O,Page RE Jr,Amdam GV,Rueppell O

    更新日期:2011-04-13 00:00:00

  • Detecting the molecular scars of evolution in the Mycobacterium tuberculosis complex by analyzing interrupted coding sequences.

    abstract:BACKGROUND:Computer-assisted analyses have shown that all bacterial genomes contain a small percentage of open reading frames with a frameshift or in-frame stop codon We report here a comparative analysis of these interrupted coding sequences (ICDSs) in six isolates of M. tuberculosis, two of M. bovis and one of M. afr...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-8-78

    authors: Deshayes C,Perrodou E,Euphrasie D,Frapy E,Poch O,Bifani P,Lecompte O,Reyrat JM

    更新日期:2008-03-06 00:00:00

  • Barcoding success as a function of phylogenetic relatedness in Viburnum, a clade of woody angiosperms.

    abstract:BACKGROUND:The chloroplast genes matK and rbcL have been proposed as a "core" DNA barcode for identifying plant species. Published estimates of successful species identification using these loci (70-80%) may be inflated because they may have involved comparisons among distantly related species within target genera. To ...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-12-73

    authors: Clement WL,Donoghue MJ

    更新日期:2012-05-30 00:00:00

  • Fused eco29kIR- and M genes coding for a fully functional hybrid polypeptide as a model of molecular evolution of restriction-modification systems.

    abstract:BACKGROUND:The discovery of restriction endonucleases and modification DNA methyltransferases, key instruments of genetic engineering, opened a new era of molecular biology through development of the recombinant DNA technology. Today, the number of potential proteins assigned to type II restriction enzymes alone is bey...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-11-35

    authors: Mokrishcheva ML,Solonin AS,Nikitin DV

    更新日期:2011-02-03 00:00:00

  • Accelerated evolutionary rates in tropical and oceanic parmelioid lichens (Ascomycota).

    abstract:BACKGROUND:The rate of nucleotide substitutions is not constant across the Tree of Life, and departures from a molecular clock have been commonly reported. Within parmelioid lichens, the largest group of macrolichens, large discrepancies in branch lengths between clades were found in previous studies. Using an extended...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-8-257

    authors: Lumbsch HT,Hipp AL,Divakar PK,Blanco O,Crespo A

    更新日期:2008-09-22 00:00:00

  • Timeframe of speciation inferred from secondary contact zones in the European tree frog radiation (Hyla arborea group).

    abstract:BACKGROUND:Hybridization between incipient species is expected to become progressively limited as their genetic divergence increases and reproductive isolation proceeds. Amphibian radiations and their secondary contact zones are useful models to infer the timeframes of speciation, but empirical data from natural system...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/s12862-015-0385-2

    authors: Dufresnes C,Brelsford A,Crnobrnja-Isailović J,Tzankov N,Lymberakis P,Perrin N

    更新日期:2015-08-08 00:00:00

  • Evolution of plant senescence.

    abstract:BACKGROUND:Senescence is integral to the flowering plant life-cycle. Senescence-like processes occur also in non-angiosperm land plants, algae and photosynthetic prokaryotes. Increasing numbers of genes have been assigned functions in the regulation and execution of angiosperm senescence. At the same time there has bee...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-9-163

    authors: Thomas H,Huang L,Young M,Ougham H

    更新日期:2009-07-14 00:00:00

  • Acetylcholinesterase alterations reveal the fitness cost of mutations conferring insecticide resistance.

    abstract:BACKGROUND:Insecticide resistance is now common in insects due to the frequent use of chemicals to control them, which provides a useful tool to study the adaptation of eukaryotic genome to new environments. Although numerous potential mutations may provide high level of resistance, only few alleles are found in insect...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-4-5

    authors: Shi MA,Lougarre A,Alies C,Frémaux I,Tang ZH,Stojan J,Fournier D

    更新日期:2004-02-06 00:00:00

  • Patterns of kinesin evolution reveal a complex ancestral eukaryote with a multifunctional cytoskeleton.

    abstract:BACKGROUND:The genesis of the eukaryotes was a pivotal event in evolution and was accompanied by the acquisition of numerous new cellular features including compartmentalization by cytoplasmic organelles, mitosis and meiosis, and ciliary motility. Essential for the development of these features was the tubulin cytoskel...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-10-110

    authors: Wickstead B,Gull K,Richards TA

    更新日期:2010-04-27 00:00:00

  • The explosive radiation of Cheirolophus (Asteraceae, Cardueae) in Macaronesia.

    abstract:BACKGROUND:Considered a biodiversity hotspot, the Canary Islands have been the key subjects of numerous evolutionary studies concerning a large variety of organisms. The genus Cheirolophus (Asteraceae) represents one of the largest plant radiations in the Canarian archipelago. In contrast, only a few species occur in t...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-14-118

    authors: Vitales D,Garnatje T,Pellicer J,Vallès J,Santos-Guerra A,Sanmartín I

    更新日期:2014-06-02 00:00:00

  • Phylogenetic analysis of the vertebrate excitatory/neutral amino acid transporter (SLC1/EAAT) family reveals lineage specific subfamilies.

    abstract:BACKGROUND:The composition and expression of vertebrate gene families is shaped by species specific gene loss in combination with a number of gene and genome duplication events (R1, R2 in all vertebrates, R3 in teleosts) and depends on the ecological and evolutionary context. In this study we analyzed the evolutionary ...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-10-117

    authors: Gesemann M,Lesslauer A,Maurer CM,Schönthaler HB,Neuhauss SC

    更新日期:2010-04-29 00:00:00

  • Genetic structure and bio-climatic modeling support allopatric over parapatric speciation along a latitudinal gradient.

    abstract:BACKGROUND:Four of the five species of Telopea (Proteaceae) are distributed in a latitudinal replacement pattern on the south-eastern Australian mainland. In similar circumstances, a simple allopatric speciation model that identifies the origins of genetic isolation within temporal geographic separation is considered a...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-12-149

    authors: Rossetto M,Allen CB,Thurlby KA,Weston PH,Milner ML

    更新日期:2012-08-20 00:00:00

  • A phylogenomic profile of hemerythrins, the nonheme diiron binding respiratory proteins.

    abstract:BACKGROUND:Hemerythrins, are the non-heme, diiron binding respiratory proteins of brachiopods, priapulids and sipunculans; they are also found in annelids and bacteria, where their functions have not been fully elucidated. RESULTS:A search for putative Hrs in the genomes of 43 archaea, 444 bacteria and 135 eukaryotes,...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-8-244

    authors: Bailly X,Vanin S,Chabasse C,Mizuguchi K,Vinogradov SN

    更新日期:2008-09-02 00:00:00

  • Population structure and plumage polymorphism: The intraspecific evolutionary relationships of a polymorphic raptor, Buteo jamaicensis harlani.

    abstract:BACKGROUND:Phenotypic and molecular genetic data often provide conflicting patterns of intraspecific relationships confounding phylogenetic inference, particularly among birds where a variety of environmental factors may influence plumage characters. Among diurnal raptors, the taxonomic relationship of Buteo jamaicensi...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-10-224

    authors: Hull JM,Mindell DP,Talbot SL,Kay EH,Hoekstra HE,Ernest HB

    更新日期:2010-07-22 00:00:00

  • The influence of body size and net diversification rate on molecular evolution during the radiation of animal phyla.

    abstract:BACKGROUND:Molecular clock dates, which place the origin of animal phyla deep in the Precambrian, have been used to reject the hypothesis of a rapid evolutionary radiation of animal phyla supported by the fossil record. One possible explanation of the discrepancy is the potential for fast substitution rates early in th...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-7-95

    authors: Fontanillas E,Welch JJ,Thomas JA,Bromham L

    更新日期:2007-06-26 00:00:00

  • The repertoire of G protein-coupled receptors in the sea squirt Ciona intestinalis.

    abstract:BACKGROUND:G protein-coupled receptors (GPCRs) constitute a large family of integral transmembrane receptor proteins that play a central role in signal transduction in eukaryotes. The genome of the protochordate Ciona intestinalis has a compact size with an ancestral complement of many diversified gene families of vert...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-8-129

    authors: Kamesh N,Aradhyam GK,Manoj N

    更新日期:2008-05-01 00:00:00

  • Natural selection drove metabolic specialization of the chromatophore in Paulinella chromatophora.

    abstract:BACKGROUND:Genome degradation of host-restricted mutualistic endosymbionts has been attributed to inactivating mutations and genetic drift while genes coding for host-relevant functions are conserved by purifying selection. Unlike their free-living relatives, the metabolism of mutualistic endosymbionts and endosymbiont...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/s12862-017-0947-6

    authors: Valadez-Cano C,Olivares-Hernández R,Resendis-Antonio O,DeLuna A,Delaye L

    更新日期:2017-04-14 00:00:00

  • Reticulate phylogeny of gastropod-shell-breeding cichlids from Lake Tanganyika--the result of repeated introgressive hybridization.

    abstract:BACKGROUND:The tribe Lamprologini is the major substrate breeding lineage of Lake Tanganyika's cichlid species flock. Among several different life history strategies found in lamprologines, the adaptation to live and breed in empty gastropod shells is probably the most peculiar. Although shell-breeding arose several ti...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-7-7

    authors: Koblmüller S,Duftner N,Sefc KM,Aibara M,Stipacek M,Blanc M,Egger B,Sturmbauer C

    更新日期:2007-01-25 00:00:00

  • The first known fossil Uma: ecological evolution and the origins of North American fringe-toed lizards.

    abstract:BACKGROUND:Fossil evidence suggests that extant North American lizard genera (north of Mexico) evolved during the Miocene. Although fossils of the clade Phrynosomatidae (spiny lizards and sand lizards) have been reported, there have been no previously described fossils of the fringe-toed sand lizards (Uma). In the exta...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/s12862-019-1501-5

    authors: Scarpetta SG

    更新日期:2019-09-06 00:00:00

  • Synonymous site conservation in the HIV-1 genome.

    abstract:BACKGROUND:Synonymous or silent mutations are usually thought to evolve neutrally. However, accumulating recent evidence has demonstrated that silent mutations may destabilize RNA structures or disrupt cis regulatory motifs superimposed on coding sequences. Such observations suggest the existence of stretches of codon ...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-13-164

    authors: Mayrose I,Stern A,Burdelova EO,Sabo Y,Laham-Karam N,Zamostiano R,Bacharach E,Pupko T

    更新日期:2013-08-04 00:00:00

  • Cooperators Unite! Assortative linking promotes cooperation particularly for medium sized associations.

    abstract:BACKGROUND:Evolution of cooperative behaviour is widely studied in different models where interaction is heterogeneous, although static among individuals. However, in nature individuals can often recognize each other and chose, besides to cooperate or not, to preferentially associate with or to avoid certain individual...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-10-173

    authors: Kun A,Boza G,Scheuring I

    更新日期:2010-06-11 00:00:00

  • Molecular evolution of the vertebrate TLR1 gene family--a complex history of gene duplication, gene conversion, positive selection and co-evolution.

    abstract:BACKGROUND:The Toll-like receptors represent a large superfamily of type I transmembrane glycoproteins, some common to a wide range of species and others are more restricted in their distribution. Most members of the Toll-like receptor superfamily have few paralogues; the exception is the TLR1 gene family with four clo...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-11-149

    authors: Huang Y,Temperley ND,Ren L,Smith J,Li N,Burt DW

    更新日期:2011-05-28 00:00:00

  • Chromosome painting in three-toed sloths: a cytogenetic signature and ancestral karyotype for Xenarthra.

    abstract:BACKGROUND:Xenarthra (sloths, armadillos and anteaters) represent one of four currently recognized Eutherian mammal supraorders. Some phylogenomic studies point to the possibility of Xenarthra being at the base of the Eutherian tree, together or not with the supraorder Afrotheria. We performed painting with human autos...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-12-36

    authors: Azevedo NF,Svartman M,Manchester A,de Moraes-Barros N,Stanyon R,Vianna-Morgante AM

    更新日期:2012-03-19 00:00:00

  • You don't have the guts: a diverse set of fungi survive passage through Macrotermes bellicosus termite guts.

    abstract:BACKGROUND:Monoculture farming poses significant disease challenges, but fungus-farming termites are able to successfully keep their monoculture crop free from contamination by other fungi. It has been hypothesised that obligate gut passage of all plant substrate used to manure the fungal symbiont is key to accomplish ...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/s12862-020-01727-z

    authors: Bos N,Guimaraes L,Palenzuela R,Renelies-Hamilton J,Maccario L,Silue SK,Koné N'A,Poulsen M

    更新日期:2020-12-09 00:00:00

  • Environment-dependent microevolution in a Mediterranean pine (Pinus pinaster Aiton).

    abstract:BACKGROUND:A central question for understanding the evolutionary responses of plant species to rapidly changing environments is the assessment of their potential for short-term (in one or a few generations) genetic change. In our study, we consider the case of Pinus pinaster Aiton (maritime pine), a widespread Mediterr...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/s12862-014-0200-5

    authors: Alía R,Chambel R,Notivol E,Climent J,González-Martínez SC

    更新日期:2014-09-23 00:00:00

  • Reduced alphabet of prebiotic amino acids optimally encodes the conformational space of diverse extant protein folds.

    abstract:BACKGROUND:There is wide agreement that only a subset of the twenty standard amino acids existed prebiotically in sufficient concentrations to form functional polypeptides. We ask how this subset, postulated as {A,D,E,G,I,L,P,S,T,V}, could have formed structures stable enough to found metabolic pathways. Inspired by al...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/s12862-019-1464-6

    authors: Solis AD

    更新日期:2019-07-30 00:00:00

  • Whole chloroplast genome and gene locus phylogenies reveal the taxonomic placement and relationship of Tripidium (Panicoideae: Andropogoneae) to sugarcane.

    abstract:BACKGROUND:For over 50 years, attempts have been made to introgress agronomically useful traits from Erianthus sect. Ripidium (Tripidium) species into sugarcane based on both genera being part of the 'Saccharum Complex', an interbreeding group of species believed to be involved in the origins of sugarcane. However, rec...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/s12862-019-1356-9

    authors: Lloyd Evans D,Joshi SV,Wang J

    更新日期:2019-01-25 00:00:00

  • Phylogeny, structural evolution and functional diversification of the plant PHOSPHATE1 gene family: a focus on Glycine max.

    abstract:BACKGROUND:PHOSPHATE1 (PHO1) gene family members have diverse roles in plant growth and development, and they have been studied in Arabidopsis, rice, and Physcomitrella. However, it has yet to be described in other plants. Therefore, we surveyed the evolutionary patterns of genomes within the plant PHO1 gene family, fo...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-13-103

    authors: He L,Zhao M,Wang Y,Gai J,He C

    更新日期:2013-05-24 00:00:00

  • The role of chromosome variation in the speciation of the red brocket deer complex: the study of reproductive isolation in females.

    abstract:BACKGROUND:The red brocket deer, Mazama americana, has at least six distinct karyotypes in different regions of South America that suggest the existence of various species that are today all referred to as M. americana. From an evolutionary perspective, the red brockets are a relatively recent clade that has gone throu...

    journal_title:BMC evolutionary biology

    pub_type: 杂志文章

    doi:10.1186/1471-2148-14-40

    authors: Cursino MS,Salviano MB,Abril VV,Zanetti Edos S,Duarte JM

    更新日期:2014-03-04 00:00:00