Evolutionary distances for protein-coding sequences: modeling site-specific residue frequencies.


:Estimation of evolutionary distances from coding sequences must take into account protein-level selection to avoid relative underestimation of longer evolutionary distances. Current modeling of selection via site-to-site rate heterogeneity generally neglects another aspect of selection, namely position-specific amino acid frequencies. These frequencies determine the maximum dissimilarity expected for highly diverged but functionally and structurally conserved sequences, and hence are crucial for estimating long distances. We introduce a codon-level model of coding sequence evolution in which position-specific amino acid frequencies are free parameters. In our implementation, these are estimated from an alignment using methods described previously. We use simulations to demonstrate the importance and feasibility of modeling such behavior; our model produces linear distance estimates over a wide range of distances, while several alternative models underestimate long distances relative to short distances. Site-to-site differences in rates, as well as synonymous/nonsynonymous and first/second/third-codon-position differences, arise as a natural consequence of the site-to-site differences in amino acid frequencies.


Mol Biol Evol


Halpern AL,Bruno WJ




Has Abstract


1998-07-01 00:00:00












  • A genome-wide search for signals of high-altitude adaptation in Tibetans.

    abstract::Genetic studies of Tibetans, an ethnic group with a long-lasting presence on the Tibetan Plateau which is known as the highest plateau in the world, may offer a unique opportunity to understand the biological adaptations of human beings to high-altitude environments. We conducted a genome-wide study of 1,000,000 genet...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Xu S,Li S,Yang Y,Tan J,Lou H,Jin W,Yang L,Pan X,Wang J,Shen Y,Wu B,Wang H,Jin L

    更新日期:2011-02-01 00:00:00

  • Higher ribosomal RNA substitution rates in Bacillariophyceae and Dasycladales than in Mollusca, Echinodermata, and Actinistia-Tetrapoda.

    abstract::Molecular evolutionary rates within two protistan and three metazoan taxa were estimated using divergence times derived from fossil records. The results indicate that the small-subunit rRNA sequences within Dasycladales (Chlorophyta) and Bacillariophyceae evolved at a rate approximately two to three times faster than ...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Sorhannus U

    更新日期:1996-09-01 00:00:00

  • The complete chloroplast and mitochondrial DNA sequence of Ostreococcus tauri: organelle genomes of the smallest eukaryote are examples of compaction.

    abstract::The complete nucleotide sequence of the mt (mitochondrial) and cp (chloroplast) genomes of the unicellular green alga Ostreococcus tauri has been determined. The mt genome assembles as a circle of 44,237 bp and contains 65 genes. With an overall average length of only 42 bp for the intergenic regions, this is the most...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Robbens S,Derelle E,Ferraz C,Wuyts J,Moreau H,Van de Peer Y

    更新日期:2007-04-01 00:00:00

  • Protein function, connectivity, and duplicability in yeast.

    abstract::Protein-protein interaction networks have evolved mainly through connectivity rewiring and gene duplication. However, how protein function influences these processes and how a network grows in time have not been well studied. Using protein-protein interaction data and genomic data from the budding yeast, we first exam...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Prachumwat A,Li WH

    更新日期:2006-01-01 00:00:00

  • Lactate dehydrogenase A as a highly abundant eye lens protein in platypus (Ornithorhynchus anatinus): upsilon (upsilon)-crystallin.

    abstract::Vertebrate eye lenses mostly contain two abundant types of proteins, the alpha-crystallins and the beta/gamma-crystallins. In addition, certain housekeeping enzymes are highly expressed as crystallins in various taxa. We now observed an unusual approximately 41-kd protein that makes up 16% to 18% of the total protein ...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: van Rheede T,Amons R,Stewart N,de Jong WW

    更新日期:2003-06-01 00:00:00

  • A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments.

    abstract::Inaccurate inference of positional homologies in multiple sequence alignments and systematic errors introduced by alignment heuristics obfuscate phylogenetic inference. Alignment masking, the elimination of phylogenetically uninformative or misleading sites from an alignment before phylogenetic analysis, is a common p...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Rajan V

    更新日期:2013-03-01 00:00:00

  • Telomeres and longevity: testing an evolutionary hypothesis.

    abstract::Identifying mechanisms that underlie variation in adult survivorship provide insight into the evolution of life history strategies and phenotypic variation in longevity. There is accumulating evidence that shortening telomeres, the protective caps at the ends of chromosomes, play an important role in individual variat...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Haussmann MF,Mauck RA

    更新日期:2008-01-01 00:00:00

  • DNA-DNA hybridization evidence of the rapid rate of muroid rodent DNA evolution.

    abstract::Single-copy nuclear DNAs (scnDNAs) of eight species of arvicoline and six species of murine rodents were compared using DNA-DNA hybridization. The branching pattern derived from the DNA comparisons is congruent with the fossil evidence and supported by comparative biochemical, chromosomal, and morphological studies. T...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Catzeflis FM,Sheldon FH,Ahlquist JE,Sibley CG

    更新日期:1987-05-01 00:00:00

  • Spiking of contemporary human template DNA with ancient DNA extracts induces mutations under PCR and generates nonauthentic mitochondrial sequences.

    abstract::Proof of authenticity is the greatest challenge in palaeogenetic research, and many safeguards have become standard routine in laboratories specialized on ancient DNA research. Here we describe an as-yet unknown source of artifacts that will require special attention in the future. We show that ancient DNA extracts on...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Pusch CM,Bachmann L

    更新日期:2004-05-01 00:00:00

  • Evolution and phylogenetic utility of alignment gaps within intron sequences of three nuclear genes in bumble bees (Bombus).

    abstract::To test whether gaps resulting from sequence alignment contain phylogenetic signal concordant with those of base substitutions, we analyzed the occurrence of indel mutations upon a well-resolved, substitution-based tree for three nuclear genes in bumble bees (Bombus, Apidae: Bombini). The regions analyzed were exon an...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Kawakita A,Sota T,Ascher JS,Ito M,Tanaka H,Kato M

    更新日期:2003-01-01 00:00:00

  • Bayesian estimation of past population dynamics in BEAST 1.10 using the Skygrid coalescent model.

    abstract::Inferring past population dynamics over time from heterochronous molecular sequence data is often achieved using the Bayesian Skygrid model, a non-parametric coalescent model that estimates the effective population size over time. Available in BEAST, a cross-platform program for Bayesian analysis of molecular sequence...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Hill V,Baele G

    更新日期:2019-07-31 00:00:00

  • Evolution of the T1 retroposon family in the Anopheles gambiae complex.

    abstract::The T1 family of retrotransposable elements is interspersed and moderately repeated in five member species of the Anopheles gambiae sibling-species complex and has diverged little since the radiation of the complex. T1 includes two closely related but independent subfamilies, defined by the presence or absence of link...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Besansky NJ

    更新日期:1990-05-01 00:00:00

  • BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data.

    abstract::We propose an improved version of the neighbor-joining (NJ) algorithm of Saitou and Nei. This new algorithm, BIONJ, follows the same agglomerative scheme as NJ, which consists of iteratively picking a pair of taxa, creating a new mode which represents the cluster of these taxa, and reducing the distance matrix by repl...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Gascuel O

    更新日期:1997-07-01 00:00:00

  • G+C content variation along and among Saccharomyces cerevisiae chromosomes.

    abstract::Past analyses of the genome of the yeast Saccharomyces cerevisiae have revealed substantial regional variation in G+C content. Important questions remain, though, as to the origin, nature, significance, and generality of this variation. We conducted an extensive analysis of the yeast genome to try to answer these ques...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Bradnam KR,Seoighe C,Sharp PM,Wolfe KH

    更新日期:1999-05-01 00:00:00

  • Pervasive indels and their evolutionary dynamics after the fish-specific genome duplication.

    abstract::Insertions and deletions (indels) in protein-coding genes are important sources of genetic variation. Their role in creating new proteins may be especially important after gene duplication. However, little is known about how indels affect the divergence of duplicate genes. We here study thousands of duplicate genes in...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Guo B,Zou M,Wagner A

    更新日期:2012-10-01 00:00:00

  • The number of alleles at a microsatellite defines the allele frequency spectrum and facilitates fast accurate estimation of theta.

    abstract::Theoretical work focused on microsatellite variation has produced a number of important results, including the expected distribution of repeat sizes and the expected squared difference in repeat size between two randomly selected samples. However, closed-form expressions for the sampling distribution and frequency spe...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Haasl RJ,Payseur BA

    更新日期:2010-12-01 00:00:00

  • The effect of unequal transversion rates on the accuracy of evolutionary parsimony.

    abstract::Evolutionary parsimony is an easy-to-use method of phylogenetic inference that is based on nucleic acid sequences and that does not require the assumption that evolutionary processes in the various sites on the molecule are identical. It does, however, require a parameter constraint, known as the "balanced transversio...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Navidi WC,Beckett-Lemus L

    更新日期:1992-11-01 00:00:00

  • Population Parameters Underlying an Ongoing Soft Sweep in Southeast Asian Malaria Parasites.

    abstract::Multiple kelch13 alleles conferring artemisinin resistance (ART-R) are currently spreading through Southeast Asian malaria parasite populations, providing a unique opportunity to observe an ongoing soft selective sweep, investigate why resistance alleles have evolved multiple times and determine fundamental population...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Anderson TJ,Nair S,McDew-White M,Cheeseman IH,Nkhoma S,Bilgic F,McGready R,Ashley E,Pyae Phyo A,White NJ,Nosten F

    更新日期:2017-01-01 00:00:00

  • Phylogenetics in Caenorhabditis elegans: an analysis of divergence and outcrossing.

    abstract::This study establishes a phylogenetic framework for the natural geographic isolates of the widely studied nematode species Caenorhabditis elegans. Virtually complete mitochondrial genomes are sequenced from 27 C. elegans natural isolates to characterize mitochondrial divergence patterns and to investigate the evolutio...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Denver DR,Morris K,Thomas WK

    更新日期:2003-03-01 00:00:00

  • Divergence time of porcine reproductive and respiratory syndrome virus subtypes.

    abstract::Porcine reproductive and respiratory syndrome virus (PRRSV) recently emerged in domestic pigs of Western Europe and North America. Although time of emergence was identical on the two continents, genetic composition was markedly different with a clear geographical subtype structure, indicating that subtypes diverged in...

    journal_title:Molecular biology and evolution

    pub_type: 评论,信件


    authors: Forsberg R

    更新日期:2005-11-01 00:00:00

  • Statistical evaluation of the Rodin-Ohno hypothesis: sense/antisense coding of ancestral class I and II aminoacyl-tRNA synthetases.

    abstract::We tested the idea that ancestral class I and II aminoacyl-tRNA synthetases arose on opposite strands of the same gene. We assembled excerpted 94-residue Urgenes for class I tryptophanyl-tRNA synthetase (TrpRS) and class II Histidyl-tRNA synthetase (HisRS) from a diverse group of species, by identifying and catenating...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Chandrasekaran SN,Yardimci GG,Erdogan O,Roach J,Carter CW Jr

    更新日期:2013-07-01 00:00:00

  • Proteomic Analysis of Histones H2A/H2B and Variant Hv1 in Tetrahymena thermophila Reveals an Ancient Network of Chaperones.

    abstract::Epigenetic information, which can be passed on independently of the DNA sequence, is stored in part in the form of histone posttranslational modifications and specific histone variants. Although complexes necessary for deposition have been identified for canonical and variant histones, information regarding the chroma...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Ashraf K,Nabeel-Shah S,Garg J,Saettone A,Derynck J,Gingras AC,Lambert JP,Pearlman RE,Fillingham J

    更新日期:2019-05-01 00:00:00

  • Non-African populations of Drosophila melanogaster have a unique origin.

    abstract::Drosophila melanogaster is widely used as a model in DNA variation studies. Patterns of polymorphism have, however, been affected by the history of this species, which is thought to have recently spread out of Africa to the rest of the world. We analyzed DNA sequence variation in 11 populations, including four contine...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Baudry E,Viginier B,Veuille M

    更新日期:2004-08-01 00:00:00

  • An improved likelihood ratio test for detecting site-specific functional divergence among clades of protein-coding genes.

    abstract::Maximum likelihood codon substitution models have proven useful for studying when and how protein function evolves, but they have recently been criticized on a number of fronts. The strengths and weaknesses of such methods must therefore be identified and improved upon. Here, using simulations, we show that the Clade ...

    journal_title:Molecular biology and evolution

    pub_type: 信件


    authors: Weadick CJ,Chang BS

    更新日期:2012-05-01 00:00:00

  • Analysis of Genetic Variation Indicates DNA Shape Involvement in Purifying Selection.

    abstract::Noncoding DNA sequences, which play various roles in gene expression and regulation, are under evolutionary pressure. Gene regulation requires specific protein-DNA binding events, and our previous studies showed that both DNA sequence and shape readout are employed by transcription factors (TFs) to achieve DNA binding...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Wang X,Zhou T,Wunderlich Z,Maurano MT,DePace AH,Nuzhdin SV,Rohs R

    更新日期:2018-08-01 00:00:00

  • Wake up of transposable elements following Drosophila simulans worldwide colonization.

    abstract::Transposable elements (TEs) make up around 10%-15% of the Drosophila melanogaster genome, but its sibling species Drosophila simulans carries only one third as many such repeat sequences. We do not, however, have an overall view of copy numbers of the various classes of TEs (long terminal repeat [LTR] retrotransposons...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Vieira C,Lepetit D,Dumont S,Biémont C

    更新日期:1999-09-01 00:00:00

  • Nucleolar binding sequences of the ribosomal protein S6e family reside in evolutionary highly conserved peptide clusters.

    abstract::Proteomic analyses of the nucleolus have revealed almost 700 functionally diverse proteins implicated in ribosome biogenesis, nucleolar assembly, and regulation of vital cellular processes. However, this nucleolar inventory has not unveiled a specific consensus motif necessary for nucleolar binding. The ribosomal prot...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Kundu-Michalik S,Bisotti MA,Lipsius E,Bauche A,Kruppa A,Klokow T,Kammler G,Kruppa J

    更新日期:2008-03-01 00:00:00

  • Multilocus analysis of nucleotide variation of Oryza sativa and its wild relatives: severe bottleneck during domestication of rice.

    abstract::Varying degrees of reduction of genetic diversity in crops relative to their wild progenitors occurred during the process of domestication. Such information, however, has not been available for the Asian cultivated rice (Oryza sativa) despite its importance as a staple food and a model organism. To reveal levels and p...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Zhu Q,Zheng X,Luo J,Gaut BS,Ge S

    更新日期:2007-03-01 00:00:00

  • Maximum likelihood implementation of an isolation-with-migration model with three species for testing speciation with gene flow.

    abstract::We implement an isolation with migration model for three species, with migration occurring between two closely related species while an out-group species is used to provide further information concerning gene trees and model parameters. The model is implemented in the likelihood framework for analyzing multilocus geno...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Zhu T,Yang Z

    更新日期:2012-10-01 00:00:00

  • Lokiarchaeota Marks the Transition between the Archaeal and Eukaryotic Selenocysteine Encoding Systems.

    abstract::Selenocysteine (Sec) is the 21st amino acid in the genetic code, inserted in response to UGA codons with the help of RNA structures, the SEC Insertion Sequence (SECIS) elements. The three domains of life feature distinct strategies for Sec insertion in proteins and its utilization. While bacteria and archaea possess s...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章


    authors: Mariotti M,Lobanov AV,Manta B,Santesmasses D,Bofill A,Guigó R,Gabaldón T,Gladyshev VN

    更新日期:2016-09-01 00:00:00