Ancestral sequence reconstruction in primate mitochondrial DNA: compositional bias and effect on functional inference.

Abstract:

:Reconstruction of ancestral DNA and amino acid sequences is an important means of inferring information about past evolutionary events. Such reconstructions suggest changes in molecular function and evolutionary processes over the course of evolution and are used to infer adaptation and convergence. Maximum likelihood (ML) is generally thought to provide relatively accurate reconstructed sequences compared to parsimony, but both methods lead to the inference of multiple directional changes in nucleotide frequencies in primate mitochondrial DNA (mtDNA). To better understand this surprising result, as well as to better understand how parsimony and ML differ, we constructed a series of computationally simple "conditional pathway" methods that differed in the number of substitutions allowed per site along each branch, and we also evaluated the entire Bayesian posterior frequency distribution of reconstructed ancestral states. We analyzed primate mitochondrial cytochrome b (Cyt-b) and cytochrome oxidase subunit I (COI) genes and found that ML reconstructs ancestral frequencies that are often more different from tip sequences than are parsimony reconstructions. In contrast, frequency reconstructions based on the posterior ensemble more closely resemble extant nucleotide frequencies. Simulations indicate that these differences in ancestral sequence inference are probably due to deterministic bias caused by high uncertainty in the optimization-based ancestral reconstruction methods (parsimony, ML, Bayesian maximum a posteriori). In contrast, ancestral nucleotide frequencies based on an average of the Bayesian set of credible ancestral sequences are much less biased. The methods involving simpler conditional pathway calculations have slightly reduced likelihood values compared to full likelihood calculations, but they can provide fairly unbiased nucleotide reconstructions and may be useful in more complex phylogenetic analyses than considered here due to their speed and flexibility. To determine whether biased reconstructions using optimization methods might affect inferences of functional properties, ancestral primate mitochondrial tRNA sequences were inferred and helix-forming propensities for conserved pairs were evaluated in silico. For ambiguously reconstructed nucleotides at sites with high base composition variability, ancestral tRNA sequences from Bayesian analyses were more compatible with canonical base pairing than were those inferred by other methods. Thus, nucleotide bias in reconstructed sequences apparently can lead to serious bias and inaccuracies in functional predictions.

journal_name

Mol Biol Evol

authors

Krishnan NM,Seligmann H,Stewart CB,De Koning AP,Pollock DD

doi

10.1093/molbev/msh198

subject

Has Abstract

pub_date

2004-10-01 00:00:00

pages

1871-83

issue

10

eissn

0737-4038

issn

1537-1719

pii

msh198

journal_volume

21

pub_type

杂志文章
  • Proteomics and comparative genomic investigations reveal heterogeneity in evolutionary rate of male reproductive proteins in mice (Mus domesticus).

    abstract::Male reproductive fitness is strongly affected by seminal fluid. In addition to interacting with the female environment, seminal fluid mediates important physiological characteristics of sperm, including capacitation and motility. In mammals, the male reproductive tract shows a striking degree of compartmentalization,...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msp094

    authors: Dean MD,Clark NL,Findlay GD,Karn RC,Yi X,Swanson WJ,MacCoss MJ,Nachman MW

    更新日期:2009-08-01 00:00:00

  • Relative efficiencies of the maximum parsimony and distance-matrix methods in obtaining the correct phylogenetic tree.

    abstract::The relative efficiencies of the maximum parsimony (MP) and distance-matrix methods in obtaining the correct tree (topology) were studied by using computer simulation. The distance-matrix methods examined are the neighbor-joining, distance-Wagner, Tateno et al. modified Farris, Faith, and Li methods. In the computer s...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040497

    authors: Sourdis J,Nei M

    更新日期:1988-05-01 00:00:00

  • Amino Acid metabolism conflicts with protein diversity.

    abstract::The 20 protein-coding amino acids are found in proteomes with different relative abundances. The most abundant amino acid, leucine, is nearly an order of magnitude more prevalent than the least abundant amino acid, cysteine. Amino acid metabolic costs differ similarly, constraining their incorporation into proteins. O...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msu228

    authors: Krick T,Verstraete N,Alonso LG,Shub DA,Ferreiro DU,Shub M,Sánchez IE

    更新日期:2014-11-01 00:00:00

  • Colocality to Cofunctionality: Eukaryotic Gene Neighborhoods as a Resource for Function Discovery.

    abstract::Diverging from the classic paradigm of random gene order in eukaryotes, gene proximity can be leveraged to systematically identify functionally related gene neighborhoods in eukaryotes, utilizing techniques pioneered in bacteria. Current methods of identifying gene neighborhoods typically rely on sequence similarity t...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msaa221

    authors: Foflonker F,Blaby-Haas CE

    更新日期:2021-01-23 00:00:00

  • Efficient implementation of MrBayes on multi-GPU.

    abstract::MrBayes, using Metropolis-coupled Markov chain Monte Carlo (MCMCMC or (MC)(3)), is a popular program for Bayesian inference. As a leading method of using DNA data to infer phylogeny, the (MC)(3) Bayesian algorithm and its improved and parallel versions are now not fast enough for biologists to analyze massive real-wor...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/mst043

    authors: Bao J,Xia H,Zhou J,Liu X,Wang G

    更新日期:2013-06-01 00:00:00

  • Evolutionary history of true crabs (Crustacea: Decapoda: Brachyura) and the origin of freshwater crabs.

    abstract::Crabs of the infra-order Brachyura are one of the most diverse groups of crustaceans with approximately 7,000 described species in 98 families, occurring in marine, freshwater, and terrestrial habitats. The relationships among the brachyuran families are poorly understood due to the high morphological complexity of th...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msu068

    authors: Tsang LM,Schubart CD,Ahyong ST,Lai JC,Au EY,Chan TY,Ng PK,Chu KH

    更新日期:2014-05-01 00:00:00

  • Evolution of the TIR domain-containing adaptors in humans: swinging between constraint and adaptation.

    abstract::Natural selection is expected to act strongly on immune system genes as hosts adapt to novel, diverse, and coevolving pathogens. Population genetic studies of host defense genes with parallel functions in model organisms have revealed distinct evolutionary histories among the different components-receptors, adaptors, ...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msr137

    authors: Fornarino S,Laval G,Barreiro LB,Manry J,Vasseur E,Quintana-Murci L

    更新日期:2011-11-01 00:00:00

  • Proceedings of the SMBE Tri-National Young Investigators' Workshop 2005. Genome-wide search of gene conversions in duplicated genes of mouse and rat.

    abstract::Gene conversion is considered to play important roles in the formation of genomic makeup such as homogenization of multigene families and diversification of alleles. We devised two statistical tests on quartets for detecting gene conversion events. Each "quartet" consists of two pairs of orthologous sequences supposed...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msj093

    authors: Ezawa K,OOta S,Saitou N,SMBE Tri-National Young Investigators.

    更新日期:2006-05-01 00:00:00

  • Performance of a new invariants method on homogeneous and nonhomogeneous quartet trees.

    abstract::An attempt to use phylogenetic invariants for tree reconstruction was made at the end of the 80s and the beginning of the 90s by several researchers (the initial idea due to Lake [1987] and Cavender and Felsenstein [1987]). However, the efficiency of methods based on invariants is still in doubt (Huelsenbeck 1995; Jin...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msl153

    authors: Casanellas M,Fernández-Sánchez J

    更新日期:2007-01-01 00:00:00

  • The robustness of two phylogenetic methods: four-taxon simulations reveal a slight superiority of maximum likelihood over neighbor joining.

    abstract::The robustness (sensitivity to violation of assumptions) of the maximum-likelihood and neighbor-joining methods was examined using simulation. Maximum likelihood and neighbor joining were implemented with Jukes-Cantor, Kimura, and gamma models of DNA substitution. Simulations were performed in which the assumptions of...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040261

    authors: Huelsenbeck JP

    更新日期:1995-09-01 00:00:00

  • Origin of Nogo-A by domain shuffling in an early jawed vertebrate.

    abstract::Unlike mammals, fish are able to regenerate axons in their central nervous system. This difference has been partly attributed to the loss/acquisition of inhibitory proteins during evolution. Nogo-A--the longest isoform of the reticulon4 (rtn4) gene product--is commonly found in mammalian myelin where it acts as a pote...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msq313

    authors: Shypitsyna A,Málaga-Trillo E,Reuter A,Stuermer CA

    更新日期:2011-04-01 00:00:00

  • Evolution of primate ABO blood group genes and their homologous genes.

    abstract::There are three common alleles (A, B, and O) at the human ABO blood group locus. We compared nucleotide sequences of these alleles, and relatively large numbers of nucleotide differences were found among them. These differences correspond to the divergence time of at least a few million years, which is unusually large...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a025776

    authors: Saitou N,Yamamoto F

    更新日期:1997-04-01 00:00:00

  • Patterns of sequence variation in the mitochondrial D-loop region of shrews.

    abstract::Direct sequencing of the mitochondrial displacement loop (D-loop) of shrews (genus Sorex) for the region between the tRNA(Pro) and the conserved sequence block-F revealed variable numbers of 79-bp tandem repeats. These repeats were found in all 19 individuals sequenced, representing three subspecies and one closely re...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040096

    authors: Stewart DT,Baker AJ

    更新日期:1994-01-01 00:00:00

  • QNet: an agglomerative method for the construction of phylogenetic networks from weighted quartets.

    abstract::We present QNet, a method for constructing split networks from weighted quartet trees. QNet can be viewed as a quartet analogue of the distance-based Neighbor-Net (NNet) method for network construction. Just as NNet, QNet works by agglomeratively computing a collection of circular weighted splits of the taxa set which...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msl180

    authors: Grünewald S,Forslund K,Dress A,Moulton V

    更新日期:2007-02-01 00:00:00

  • Theoretical foundation of the minimum-evolution method of phylogenetic inference.

    abstract::The minimum-evolution (ME) method of phylogenetic inference is based on the assumption that the tree with the smallest sum of branch length estimates is most likely to be the true one. In the past this assumption has been used without mathematical proof. Here we present the theoretical basis of this method by showing ...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040056

    authors: Rzhetsky A,Nei M

    更新日期:1993-09-01 00:00:00

  • First-Step Mutations during Adaptation Restore the Expression of Hundreds of Genes.

    abstract::The temporal change of phenotypes during the adaptive process remains largely unexplored, as do the genetic changes that affect these phenotypic changes. Here we focused on three mutations that rose to high frequency in the early stages of adaptation within 12 Escherichia coli populations subjected to thermal stress (...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msv228

    authors: Rodríguez-Verdugo A,Tenaillon O,Gaut BS

    更新日期:2016-01-01 00:00:00

  • Local adaptation and vector-mediated population structure in Plasmodium vivax malaria.

    abstract::Plasmodium vivax in southern Mexico exhibits different infectivities to 2 local mosquito vectors, Anopheles pseudopunctipennis and Anopheles albimanus. Previous work has tied these differences in mosquito infectivity to variation in the central repeat motif of the malaria parasite's circumsporozoite (csp) gene, but su...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msn073

    authors: Joy DA,Gonzalez-Ceron L,Carlton JM,Gueye A,Fay M,McCutchan TF,Su XZ

    更新日期:2008-06-01 00:00:00

  • Rooting the tree of life using nonubiquitous genes.

    abstract::Insertion and deletion (indel)-based analyses have great potential for rooting the tree of life, but their use has been limited because they require ubiquitous sequences that have not been horizontally/laterally transferred. Very few such sequences exist. Here we describe and demonstrate a new algorithm that can use n...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msl140

    authors: Lake JA,Herbold CW,Rivera MC,Servin JA,Skophammer RG

    更新日期:2007-01-01 00:00:00

  • A Working Model of the Deep Relationships of Diverse Modern Human Genetic Lineages Outside of Africa.

    abstract::A major topic of interest in human prehistory is how the large-scale genetic structure of modern populations outside of Africa was established. Demographic models have been developed that capture the relationships among small numbers of populations or within particular geographical regions, but constructing a phylogen...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msw293

    authors: Lipson M,Reich D

    更新日期:2017-04-01 00:00:00

  • Positions of multiple insertions in SSU rDNA of lichen-forming fungi.

    abstract::Lichen-forming fungi, in symbiotic associations with algae, frequently have nuclear small subunit ribosomal DNA (SSU rDNA) longer than the 1,800 nucleotides typical for eukaryotes. The lichen-forming ascomycetous fungus Lecanora dispersa contains insertions at eight distinct positions of its SSU rDNA; the lichen-formi...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040199

    authors: Gargas A,DePriest PT,Taylor JW

    更新日期:1995-03-01 00:00:00

  • MitoFish and MiFish Pipeline: A Mitochondrial Genome Database of Fish with an Analysis Pipeline for Environmental DNA Metabarcoding.

    abstract::Fish mitochondrial genome (mitogenome) data form a fundamental basis for revealing vertebrate evolution and hydrosphere ecology. Here, we report recent functional updates of MitoFish, which is a database of fish mitogenomes with a precise annotation pipeline MitoAnnotator. Most importantly, we describe implementation ...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msy074

    authors: Sato Y,Miya M,Fukunaga T,Sado T,Iwasaki W

    更新日期:2018-06-01 00:00:00

  • Positively selected disease response orthologous gene sets in the cereals identified using Sorghum bicolor L. Moench expression profiles and comparative genomics.

    abstract::Disease response genes (DRGs) diverge under recurrent positive selection as a result of a molecular arms race between hosts and pathogens. Most of these studies were conducted in animals, and few defense genes have been shown to evolve adaptively in plants. To test for adaptation in the molecules mediating disease res...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msp114

    authors: Zamora A,Sun Q,Hamblin MT,Aquadro CF,Kresovich S

    更新日期:2009-09-01 00:00:00

  • In Planta Recapitulation of Isoprene Synthase Evolution from Ocimene Synthases.

    abstract::Isoprene is the most abundant biogenic volatile hydrocarbon compound naturally emitted by plants and plays a major role in atmospheric chemistry. It has been proposed that isoprene synthases (IspS) may readily evolve from other terpene synthases, but this hypothesis has not been experimentally investigated. We isolate...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msx178

    authors: Li M,Xu J,Algarra Alarcon A,Carlin S,Barbaro E,Cappellin L,Velikova V,Vrhovsek U,Loreto F,Varotto C

    更新日期:2017-10-01 00:00:00

  • Phylogenetic Clustering by Linear Integer Programming (PhyCLIP).

    abstract::Subspecies nomenclature systems of pathogens are increasingly based on sequence data. The use of phylogenetics to identify and differentiate between clusters of genetically similar pathogens is particularly prevalent in virology from the nomenclature of human papillomaviruses to highly pathogenic avian influenza (HPAI...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msz053

    authors: Han AX,Parker E,Scholer F,Maurer-Stroh S,Russell CA

    更新日期:2019-07-01 00:00:00

  • Gene Tree Discordance Does Not Explain Away the Temporal Decline of Convergence in Mammalian Protein Sequence Evolution.

    abstract::Several authors reported lower frequencies of protein sequence convergence between more distantly related evolutionary lineages and attributed this trend to epistasis, which renders the acceptable amino acids at a site more different and convergence less likely in more divergent lineages. A recent primate study, howev...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msx109

    authors: Zou Z,Zhang J

    更新日期:2017-07-01 00:00:00

  • Keeping it local: evidence for positive selection in Swedish Arabidopsis thaliana.

    abstract::Detecting positive selection in species with heterogeneous habitats and complex demography is notoriously difficult and prone to statistical biases. The model plant Arabidopsis thaliana exemplifies this problem: In spite of the large amounts of data, little evidence for classic selective sweeps has been found. Moreove...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msu247

    authors: Huber CD,Nordborg M,Hermisson J,Hellmann I

    更新日期:2014-11-01 00:00:00

  • Heterotypy in the N-terminal region of growth/differentiation factor 5 (GDF5) mature protein during teleost evolution.

    abstract::Heterotypy is now recognized as a generative force in the formation of new proteins through modification of existing proteins. We report that heterotypy in the N-terminal region of the mature growth/differentiation factor 5 (GDF5) protein occurred during evolution of teleosts. N-terminal length variation of GDF5 was f...

    journal_title:Molecular biology and evolution

    pub_type: 信件,评审

    doi:10.1093/molbev/msn041

    authors: Fujimura K,Terai Y,Ishiguro N,Miya M,Nishida M,Okada N

    更新日期:2008-05-01 00:00:00

  • Retortamonad flagellates are closely related to diplomonads--implications for the history of mitochondrial function in eukaryote evolution.

    abstract::We present the first molecular phylogenetic examination of the evolutionary position of retortamonads, a group of mitochondrion-lacking flagellates usually found as commensals of the intestinal tracts of vertebrates. Our phylogenies include small subunit ribosomal gene sequences from six retortamonad isolates-four fro...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a004135

    authors: Silberman JD,Simpson AG,Kulda J,Cepicka I,Hampl V,Johnson PJ,Roger AJ

    更新日期:2002-05-01 00:00:00

  • FADS1 and the Timing of Human Adaptation to Agriculture.

    abstract::Variation at the FADS1/FADS2 gene cluster is functionally associated with differences in lipid metabolism and is often hypothesized to reflect adaptation to an agricultural diet. Here, we test the evidence for this relationship using both modern and ancient DNA data. We show that almost all the inhabitants of Europe c...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msy180

    authors: Mathieson S,Mathieson I

    更新日期:2018-12-01 00:00:00

  • Molecular Biology and Evolution of Cancer: From Discovery to Action.

    abstract::Cancer progression is an evolutionary process. During this process, evolving cancer cell populations encounter restrictive ecological niches within the body, such as the primary tumor, circulatory system, and diverse metastatic sites. Efforts to prevent or delay cancer evolution-and progression-require a deep understa...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msz242

    authors: Somarelli JA,Gardner H,Cannataro VL,Gunady EF,Boddy AM,Johnson NA,Fisk JN,Gaffney SG,Chuang JH,Li S,Ciccarelli FD,Panchenko AR,Megquier K,Kumar S,Dornburg A,DeGregori J,Townsend JP

    更新日期:2020-02-01 00:00:00