Abstract:
BACKGROUND:Widely used substitution models for proteins, such as the Jones-Taylor-Thornton (JTT) or Whelan and Goldman (WAG) models, are based on empirical amino acid interchange matrices estimated from databases of protein alignments that incorporate the average amino acid frequencies of the data set under examination (e.g JTT + F). Variation in the evolutionary process between sites is typically modelled by a rates-across-sites distribution such as the gamma (Gamma) distribution. However, sites in proteins also vary in the kinds of amino acid interchanges that are favoured, a feature that is ignored by standard empirical substitution matrices. Here we examine the degree to which the pattern of evolution at sites differs from that expected based on empirical amino acid substitution models and evaluate the impact of these deviations on phylogenetic estimation. RESULTS:We analyzed 21 large protein alignments with two statistical tests designed to detect deviation of site-specific amino acid distributions from data simulated under the standard empirical substitution model: JTT+ F + Gamma. We found that the number of states at a given site is, on average, smaller and the frequencies of these states are less uniform than expected based on a JTT + F + Gamma substitution model. With a four-taxon example, we show that phylogenetic estimation under the JTT + F + Gamma model is seriously biased by a long-branch attraction artefact if the data are simulated under a model utilizing the observed site-specific amino acid frequencies from an alignment. Principal components analyses indicate the existence of at least four major site-specific frequency classes in these 21 protein alignments. Using a mixture model with these four separate classes of site-specific state frequencies plus a fifth class of global frequencies (the JTT + cF + Gamma model), significant improvements in model fit for real data sets can be achieved. This simple mixture model also reduces the long-branch attraction problem, as shown by simulations and analyses of a real phylogenomic data set. CONCLUSION:Protein families display site-specific evolutionary dynamics that are ignored by standard protein phylogenetic models. Accurate estimation of protein phylogenies requires models that accommodate the heterogeneity in the evolutionary process across sites. To this end, we have implemented a class frequency mixture model (cF) in a freely available program called QmmRAxML for phylogenetic estimation.
journal_name
BMC Evol Bioljournal_title
BMC evolutionary biologyauthors
Wang HC,Li K,Susko E,Roger AJdoi
10.1186/1471-2148-8-331subject
Has Abstractpub_date
2008-12-16 00:00:00pages
331issn
1471-2148pii
1471-2148-8-331journal_volume
8pub_type
杂志文章abstract:BACKGROUND:Molecular and genetic analyses conducted in model organisms such as Drosophila and vertebrates, have provided a wealth of information about how networks of transcription factors control the proper development of these species. Much less is known, however, about the evolutionary origin of these elaborated net...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-7-33
更新日期:2007-03-02 00:00:00
abstract:BACKGROUND:The reproductive ground plan hypothesis of social evolution suggests that reproductive controls of a solitary ancestor have been co-opted during social evolution, facilitating the division of labor among social insect workers. Despite substantial empirical support, the generality of this hypothesis is not un...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-11-95
更新日期:2011-04-13 00:00:00
abstract:BACKGROUND:Computer-assisted analyses have shown that all bacterial genomes contain a small percentage of open reading frames with a frameshift or in-frame stop codon We report here a comparative analysis of these interrupted coding sequences (ICDSs) in six isolates of M. tuberculosis, two of M. bovis and one of M. afr...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-8-78
更新日期:2008-03-06 00:00:00
abstract:BACKGROUND:The chloroplast genes matK and rbcL have been proposed as a "core" DNA barcode for identifying plant species. Published estimates of successful species identification using these loci (70-80%) may be inflated because they may have involved comparisons among distantly related species within target genera. To ...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-12-73
更新日期:2012-05-30 00:00:00
abstract:BACKGROUND:The discovery of restriction endonucleases and modification DNA methyltransferases, key instruments of genetic engineering, opened a new era of molecular biology through development of the recombinant DNA technology. Today, the number of potential proteins assigned to type II restriction enzymes alone is bey...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-11-35
更新日期:2011-02-03 00:00:00
abstract:BACKGROUND:The rate of nucleotide substitutions is not constant across the Tree of Life, and departures from a molecular clock have been commonly reported. Within parmelioid lichens, the largest group of macrolichens, large discrepancies in branch lengths between clades were found in previous studies. Using an extended...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-8-257
更新日期:2008-09-22 00:00:00
abstract:BACKGROUND:Hybridization between incipient species is expected to become progressively limited as their genetic divergence increases and reproductive isolation proceeds. Amphibian radiations and their secondary contact zones are useful models to infer the timeframes of speciation, but empirical data from natural system...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/s12862-015-0385-2
更新日期:2015-08-08 00:00:00
abstract:BACKGROUND:Senescence is integral to the flowering plant life-cycle. Senescence-like processes occur also in non-angiosperm land plants, algae and photosynthetic prokaryotes. Increasing numbers of genes have been assigned functions in the regulation and execution of angiosperm senescence. At the same time there has bee...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-9-163
更新日期:2009-07-14 00:00:00
abstract:BACKGROUND:Insecticide resistance is now common in insects due to the frequent use of chemicals to control them, which provides a useful tool to study the adaptation of eukaryotic genome to new environments. Although numerous potential mutations may provide high level of resistance, only few alleles are found in insect...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-4-5
更新日期:2004-02-06 00:00:00
abstract:BACKGROUND:The genesis of the eukaryotes was a pivotal event in evolution and was accompanied by the acquisition of numerous new cellular features including compartmentalization by cytoplasmic organelles, mitosis and meiosis, and ciliary motility. Essential for the development of these features was the tubulin cytoskel...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-10-110
更新日期:2010-04-27 00:00:00
abstract:BACKGROUND:Considered a biodiversity hotspot, the Canary Islands have been the key subjects of numerous evolutionary studies concerning a large variety of organisms. The genus Cheirolophus (Asteraceae) represents one of the largest plant radiations in the Canarian archipelago. In contrast, only a few species occur in t...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-14-118
更新日期:2014-06-02 00:00:00
abstract:BACKGROUND:The composition and expression of vertebrate gene families is shaped by species specific gene loss in combination with a number of gene and genome duplication events (R1, R2 in all vertebrates, R3 in teleosts) and depends on the ecological and evolutionary context. In this study we analyzed the evolutionary ...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-10-117
更新日期:2010-04-29 00:00:00
abstract:BACKGROUND:Four of the five species of Telopea (Proteaceae) are distributed in a latitudinal replacement pattern on the south-eastern Australian mainland. In similar circumstances, a simple allopatric speciation model that identifies the origins of genetic isolation within temporal geographic separation is considered a...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-12-149
更新日期:2012-08-20 00:00:00
abstract:BACKGROUND:Hemerythrins, are the non-heme, diiron binding respiratory proteins of brachiopods, priapulids and sipunculans; they are also found in annelids and bacteria, where their functions have not been fully elucidated. RESULTS:A search for putative Hrs in the genomes of 43 archaea, 444 bacteria and 135 eukaryotes,...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-8-244
更新日期:2008-09-02 00:00:00
abstract:BACKGROUND:Phenotypic and molecular genetic data often provide conflicting patterns of intraspecific relationships confounding phylogenetic inference, particularly among birds where a variety of environmental factors may influence plumage characters. Among diurnal raptors, the taxonomic relationship of Buteo jamaicensi...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-10-224
更新日期:2010-07-22 00:00:00
abstract:BACKGROUND:Molecular clock dates, which place the origin of animal phyla deep in the Precambrian, have been used to reject the hypothesis of a rapid evolutionary radiation of animal phyla supported by the fossil record. One possible explanation of the discrepancy is the potential for fast substitution rates early in th...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-7-95
更新日期:2007-06-26 00:00:00
abstract:BACKGROUND:G protein-coupled receptors (GPCRs) constitute a large family of integral transmembrane receptor proteins that play a central role in signal transduction in eukaryotes. The genome of the protochordate Ciona intestinalis has a compact size with an ancestral complement of many diversified gene families of vert...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-8-129
更新日期:2008-05-01 00:00:00
abstract:BACKGROUND:Genome degradation of host-restricted mutualistic endosymbionts has been attributed to inactivating mutations and genetic drift while genes coding for host-relevant functions are conserved by purifying selection. Unlike their free-living relatives, the metabolism of mutualistic endosymbionts and endosymbiont...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/s12862-017-0947-6
更新日期:2017-04-14 00:00:00
abstract:BACKGROUND:The tribe Lamprologini is the major substrate breeding lineage of Lake Tanganyika's cichlid species flock. Among several different life history strategies found in lamprologines, the adaptation to live and breed in empty gastropod shells is probably the most peculiar. Although shell-breeding arose several ti...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-7-7
更新日期:2007-01-25 00:00:00
abstract:BACKGROUND:Fossil evidence suggests that extant North American lizard genera (north of Mexico) evolved during the Miocene. Although fossils of the clade Phrynosomatidae (spiny lizards and sand lizards) have been reported, there have been no previously described fossils of the fringe-toed sand lizards (Uma). In the exta...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/s12862-019-1501-5
更新日期:2019-09-06 00:00:00
abstract:BACKGROUND:Synonymous or silent mutations are usually thought to evolve neutrally. However, accumulating recent evidence has demonstrated that silent mutations may destabilize RNA structures or disrupt cis regulatory motifs superimposed on coding sequences. Such observations suggest the existence of stretches of codon ...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-13-164
更新日期:2013-08-04 00:00:00
abstract:BACKGROUND:Evolution of cooperative behaviour is widely studied in different models where interaction is heterogeneous, although static among individuals. However, in nature individuals can often recognize each other and chose, besides to cooperate or not, to preferentially associate with or to avoid certain individual...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-10-173
更新日期:2010-06-11 00:00:00
abstract:BACKGROUND:The Toll-like receptors represent a large superfamily of type I transmembrane glycoproteins, some common to a wide range of species and others are more restricted in their distribution. Most members of the Toll-like receptor superfamily have few paralogues; the exception is the TLR1 gene family with four clo...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-11-149
更新日期:2011-05-28 00:00:00
abstract:BACKGROUND:Xenarthra (sloths, armadillos and anteaters) represent one of four currently recognized Eutherian mammal supraorders. Some phylogenomic studies point to the possibility of Xenarthra being at the base of the Eutherian tree, together or not with the supraorder Afrotheria. We performed painting with human autos...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-12-36
更新日期:2012-03-19 00:00:00
abstract:BACKGROUND:Monoculture farming poses significant disease challenges, but fungus-farming termites are able to successfully keep their monoculture crop free from contamination by other fungi. It has been hypothesised that obligate gut passage of all plant substrate used to manure the fungal symbiont is key to accomplish ...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/s12862-020-01727-z
更新日期:2020-12-09 00:00:00
abstract:BACKGROUND:A central question for understanding the evolutionary responses of plant species to rapidly changing environments is the assessment of their potential for short-term (in one or a few generations) genetic change. In our study, we consider the case of Pinus pinaster Aiton (maritime pine), a widespread Mediterr...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/s12862-014-0200-5
更新日期:2014-09-23 00:00:00
abstract:BACKGROUND:There is wide agreement that only a subset of the twenty standard amino acids existed prebiotically in sufficient concentrations to form functional polypeptides. We ask how this subset, postulated as {A,D,E,G,I,L,P,S,T,V}, could have formed structures stable enough to found metabolic pathways. Inspired by al...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/s12862-019-1464-6
更新日期:2019-07-30 00:00:00
abstract:BACKGROUND:For over 50 years, attempts have been made to introgress agronomically useful traits from Erianthus sect. Ripidium (Tripidium) species into sugarcane based on both genera being part of the 'Saccharum Complex', an interbreeding group of species believed to be involved in the origins of sugarcane. However, rec...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/s12862-019-1356-9
更新日期:2019-01-25 00:00:00
abstract:BACKGROUND:PHOSPHATE1 (PHO1) gene family members have diverse roles in plant growth and development, and they have been studied in Arabidopsis, rice, and Physcomitrella. However, it has yet to be described in other plants. Therefore, we surveyed the evolutionary patterns of genomes within the plant PHO1 gene family, fo...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-13-103
更新日期:2013-05-24 00:00:00
abstract:BACKGROUND:The red brocket deer, Mazama americana, has at least six distinct karyotypes in different regions of South America that suggest the existence of various species that are today all referred to as M. americana. From an evolutionary perspective, the red brockets are a relatively recent clade that has gone throu...
journal_title:BMC evolutionary biology
pub_type: 杂志文章
doi:10.1186/1471-2148-14-40
更新日期:2014-03-04 00:00:00