A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments.

Abstract:

:Inaccurate inference of positional homologies in multiple sequence alignments and systematic errors introduced by alignment heuristics obfuscate phylogenetic inference. Alignment masking, the elimination of phylogenetically uninformative or misleading sites from an alignment before phylogenetic analysis, is a common practice in phylogenetic analysis. Although masking is often done manually, automated methods are necessary to handle the much larger data sets being prepared today. In this study, we introduce the concept of subsplits and demonstrate their use in extracting phylogenetic signal from alignments. We design a clustering approach for alignment masking where each cluster contains similar columns-similarity being defined on the basis of compatible subsplits; our approach then identifies noisy clusters and eliminates them. Trees inferred from the columns in the retained clusters are found to be topologically closer to the reference trees. We test our method on numerous standard benchmarks (both synthetic and biological data sets) and compare its performance with other methods of alignment masking. We find that our method can eliminate sites more accurately than other methods, particularly on divergent data, and can improve the topologies of the inferred trees in likelihood-based analyses. Software available upon request from the author.

journal_name

Mol Biol Evol

authors

Rajan V

doi

10.1093/molbev/mss264

subject

Has Abstract

pub_date

2013-03-01 00:00:00

pages

689-712

issue

3

eissn

0737-4038

issn

1537-1719

pii

mss264

journal_volume

30

pub_type

杂志文章
  • The Evolution of Gene Expression Underlying Vision Loss in Cave Animals.

    abstract::Dissecting the evolutionary genetic processes underlying eye reduction and vision loss in obligate cave-dwelling organisms has been a long-standing challenge in evolutionary biology. Independent vision loss events in related subterranean organisms can provide critical insight into these processes as well as into the n...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msy106

    authors: Stern DB,Crandall KA

    更新日期:2018-08-01 00:00:00

  • Neighboring genes show interchromosomal colocalization after their separation.

    abstract::The order of genes on eukaryotic chromosomes is nonrandom. Some neighboring genes show order conservation among species, while some neighboring genes separate during evolution. Here, we investigated whether neighboring genes show interactions after their separation. We found that neighboring gene pairs tend to show in...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msu065

    authors: Dai Z,Xiong Y,Dai X

    更新日期:2014-05-01 00:00:00

  • Estimates of DNA and protein sequence divergence: an examination of some assumptions.

    abstract::Some of the assumptions underlying estimates of DNA and protein sequence divergence are examined. A solution for the variance of these estimates that allows for different mutation rates and different population sizes in each species and for an arbitrary structure in the initial population is obtained. It is shown that...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040303

    authors: Golding GB

    更新日期:1983-12-01 00:00:00

  • Mosaic origin of the heme biosynthesis pathway in photosynthetic eukaryotes.

    abstract::Heme biosynthesis represents one of the most essential metabolic pathways in living organisms, providing the precursors for cytochrome prosthetic groups, photosynthetic pigments, and vitamin B(12). Using genomic data, we have compared the heme pathway in the diatom Thalassiosira pseudonana and the red alga Cyanidiosch...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msi230

    authors: Oborník M,Green BR

    更新日期:2005-12-01 00:00:00

  • Distinct Patterns of Selection in Selenium-Dependent Genes between Land and Aquatic Vertebrates.

    abstract::Selenium (Se), a sparse element on earth, is an essential micronutrient in the vertebrate diet and its intake depends on its content in soils and waters worldwide. Selenium is required due to its function in selenoproteins, which contain selenocysteine (Sec), the 21st amino acid in the genetic code, as one of their co...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msy070

    authors: Sarangi GK,Romagné F,Castellano S

    更新日期:2018-07-01 00:00:00

  • Selection against deleterious LINE-1-containing loci in the human lineage.

    abstract::We compared sex chromosomal and autosomal regions of similar GC contents and found that the human Y chromosome contains nine times as many full-length (FL) ancestral LINE-1 (L1) elements per megabase as do autosomes and that the X chromosome contains three times as many. In addition, both sex chromosomes contain a ca....

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a003893

    authors: Boissinot S,Entezam A,Furano AV

    更新日期:2001-06-01 00:00:00

  • Conditional gene genealogies under strong purifying selection.

    abstract::The ancestral selection graph, conditioned on the allelic types in the sample, is used to obtain a limiting gene genealogical process under strong selection. In an equilibrium, two-allele system with strong selection, neutral gene genealogies are predicted for random samples and for samples containing at most one unfa...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msn209

    authors: Wakeley J

    更新日期:2008-12-01 00:00:00

  • The rhizome of the multidrug-resistant Enterobacter aerogenes genome reveals how new "killer bugs" are created because of a sympatric lifestyle.

    abstract::Here, we sequenced the 5,419,609 bp circular genome of an Enterobacter aerogenes clinical isolate that killed a patient and was resistant to almost all current antibiotics (except gentamicin) commonly used to treat Enterobacterial infections, including colistin. Genomic and phylogenetic analyses explain the discrepanc...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/mss236

    authors: Diene SM,Merhej V,Henry M,El Filali A,Roux V,Robert C,Azza S,Gavory F,Barbe V,La Scola B,Raoult D,Rolain JM

    更新日期:2013-02-01 00:00:00

  • Evidence that the human X chromosome is enriched for male-specific but not female-specific genes.

    abstract::There is increasing evidence that X chromosomes have an unusual complement of genes, especially genes that have sex-specific expression. However, whereas in worm and fly the X chromosome has a dearth of male-specific genes, in mice genes that are uniquely expressed in spermatogonia are especially abundant on the X chr...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msg131

    authors: Lercher MJ,Urrutia AO,Hurst LD

    更新日期:2003-07-01 00:00:00

  • Evolution of closely linked gene pairs in vertebrate genomes.

    abstract::The orientation of closely linked genes in mammalian genomes is not random: there are more head-to-head (h2h) gene pairs than expected. To understand the origin of this enrichment in h2h gene pairs, we have analyzed the phylogenetic distribution of gene pairs separated by less than 600 bp of intergenic DNA (gene duos)...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msn136

    authors: Franck E,Hulsen T,Huynen MA,de Jong WW,Lubsen NH,Madsen O

    更新日期:2008-09-01 00:00:00

  • Molecular evolution of glutamate receptors: a primitive signaling mechanism that existed before plants and animals diverged.

    abstract::We performed a genealogical analysis of the ionotropic glutamate receptor (iGluR) gene family, which includes the animal iGluRs and the newly isolated glutamate receptor-like genes (GLR) of plants discovered in Arabidopsis. Distance measures firmly placed the plant GLR genes within the iGluR clade as opposed to other ...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a026167

    authors: Chiu J,DeSalle R,Lam HM,Meisel L,Coruzzi G

    更新日期:1999-06-01 00:00:00

  • Jumbled genomes: missing Apicomplexan synteny.

    abstract::Whole-genome comparisons provide insight into genome evolution by informing on gene repertoires, gene gains/losses, and genome organization. Most of our knowledge about eukaryotic genome evolution is derived from studies of multicellular model organisms. The eukaryotic phylum Apicomplexa contains obligate intracellula...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msr103

    authors: DeBarry JD,Kissinger JC

    更新日期:2011-10-01 00:00:00

  • Precision and accuracy of divergence time estimates from STR and SNPSTR variation.

    abstract::Inference of intraspecific population divergence patterns typically requires genetic data for molecular markers with relatively high mutation rates. Microsatellites, or short tandem repeat (STR) polymorphisms, have proven informative in many such investigations. These markers are characterized, however, by high levels...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msh212

    authors: Ramakrishnan U,Mountain JL

    更新日期:2004-10-01 00:00:00

  • Organellar RNA editing and plant-specific extensions of pentatricopeptide repeat proteins in jungermanniid but not in marchantiid liverworts.

    abstract::The pyrimidine exchange type of RNA editing in land plant (embryophyte) organelles has largely remained an enigma with respect to its biochemical mechanisms, the underlying specificities, and its raison d'être. Apparently arising with the earliest embryophytes, RNA editing is conspicuously absent in one clade of liver...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msn084

    authors: Rüdinger M,Polsakiewicz M,Knoop V

    更新日期:2008-07-01 00:00:00

  • DNA-DNA hybridization evidence of the rapid rate of muroid rodent DNA evolution.

    abstract::Single-copy nuclear DNAs (scnDNAs) of eight species of arvicoline and six species of murine rodents were compared using DNA-DNA hybridization. The branching pattern derived from the DNA comparisons is congruent with the fossil evidence and supported by comparative biochemical, chromosomal, and morphological studies. T...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a040444

    authors: Catzeflis FM,Sheldon FH,Ahlquist JE,Sibley CG

    更新日期:1987-05-01 00:00:00

  • Adaptive evolution of metabolic pathways in Drosophila.

    abstract::The adaptive significance of enzyme variation has been of central interest in population genetics. Yet, how natural selection operates on enzymes in the larger context of biochemical pathways has not been broadly explored. A basic expectation is that natural selection on metabolic phenotypes will target enzymes that c...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msm057

    authors: Flowers JM,Sezgin E,Kumagai S,Duvernell DD,Matzkin LM,Schmidt PS,Eanes WF

    更新日期:2007-06-01 00:00:00

  • From DNA to fitness differences: sequences and structures of adaptive variants of Colias phosphoglucose isomerase (PGI).

    abstract::Colias eurytheme butterflies display extensive allozyme polymorphism in the enzyme phosphoglucose isomerase (PGI). Earlier studies on biochemical and fitness effects of these genotypes found evidence of strong natural selection maintaining this polymorphism in the wild. Here we analyze the molecular features of this p...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msj062

    authors: Wheat CW,Watt WB,Pollock DD,Schulte PM

    更新日期:2006-03-01 00:00:00

  • The first venomous crustacean revealed by transcriptomics and functional morphology: remipede venom glands express a unique toxin cocktail dominated by enzymes and a neurotoxin.

    abstract::Animal venoms have evolved many times. Venomous species are especially common in three of the four main groups of arthropods (Chelicerata, Myriapoda, and Hexapoda), which together represent tens of thousands of species of venomous spiders, scorpions, centipedes, and hymenopterans. Surprisingly, despite their great div...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/mst199

    authors: von Reumont BM,Blanke A,Richter S,Alvarez F,Bleidorn C,Jenner RA

    更新日期:2014-01-01 00:00:00

  • Phylogenomic test of the hypotheses for the evolutionary origin of eukaryotes.

    abstract::The evolutionary origin of eukaryotes is a question of great interest for which many different hypotheses have been proposed. These hypotheses predict distinct patterns of evolutionary relationships for individual genes of the ancestral eukaryotic genome. The availability of numerous completely sequenced genomes cover...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/mst272

    authors: Rochette NC,Brochier-Armanet C,Gouy M

    更新日期:2014-04-01 00:00:00

  • High rate of large-scale hemizygous deletions in asexually propagating Daphnia: implications for the evolution of sex.

    abstract::The origin and maintenance of sex remains one of the most debated topics in evolutionary biology. Investigations of the molecular genetic consequences of asexuality, such as direct estimation of mutation and recombination rates in asexual lineages, are critical for explaining the prevalence of sex in nature. In this s...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msq199

    authors: Xu S,Omilian AR,Cristescu ME

    更新日期:2011-01-01 00:00:00

  • Persistence of the mitochondrial lineage responsible for the Irish potato famine in extant new world phytophthora infestans.

    abstract::The plant pathogen Phytophthora infestans emerged in Europe in 1845, triggering the Irish potato famine and massive European potato crop losses that continued until effective fungicides were widely employed in the 20th century. Today the pathogen is ubiquitous, with more aggressive and virulent strains surfacing in re...

    journal_title:Molecular biology and evolution

    pub_type: 历史文章,信件

    doi:10.1093/molbev/msu086

    authors: Martin MD,Ho SY,Wales N,Ristaino JB,Gilbert MT

    更新日期:2014-06-01 00:00:00

  • Partial genetic turnover in neandertals: continuity in the East and population replacement in the West.

    abstract::Remarkably little is known about the population-level processes leading up to the extinction of the neandertal. To examine this, we use mitochondrial DNA sequences from 13 neandertal individuals, including a novel sequence from northern Spain, to examine neandertal demographic history. Our analyses indicate that recen...

    journal_title:Molecular biology and evolution

    pub_type: 信件

    doi:10.1093/molbev/mss074

    authors: Dalén L,Orlando L,Shapiro B,Brandström-Durling M,Quam R,Gilbert MT,Díez Fernández-Lomana JC,Willerslev E,Arsuaga JL,Götherström A

    更新日期:2012-08-01 00:00:00

  • Altered trans-regulatory control of gene expression in multiple anthocyanin genes contributes to adaptive flower color evolution in Mimulus aurantiacus.

    abstract::A fundamental goal in evolutionary biology is to identify the molecular changes responsible for adaptive evolution. In this study, we describe a genetic analysis to determine whether the molecular changes contributing to adaptive flower color divergence in Mimulus aurantiacus affect gene expression or enzymatic activi...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msn268

    authors: Streisfeld MA,Rausher MD

    更新日期:2009-02-01 00:00:00

  • ERAD-derived preprotein transport across the second outermost plastid membrane of diatoms.

    abstract::The diatom Phaeodactylum tricornutum harbors a plastid that is surrounded by four membranes and evolved by way of secondary endosymbiosis. Like land plants, most of its plastid proteins are encoded as preproteins on the nuclear genome of the host cell and are resultantly redirected into the organelle. Because two more...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msp079

    authors: Hempel F,Bullmann L,Lau J,Zauner S,Maier UG

    更新日期:2009-08-01 00:00:00

  • Patterns of divergence during evolution of alpha 1-proteinase inhibitors in mammals.

    abstract::alpha 1-Proteinase inhibitor (alpha 1-PI), a member of the serine proteinase inhibitor superfamily, has a primary role in controlling neutrophil elastase activity within the mammalian circulation. Several studies have indicated that the reactive center region of alpha 1-PI, the amino acid sequence of which is critical...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a025594

    authors: Goodwin RL,Baumann H,Berger FG

    更新日期:1996-02-01 00:00:00

  • Conserved gene order at the nuclear periphery in Drosophila.

    abstract::Whether higher-order chromatin organization is related to genome stability over evolutionary time remains elusive. We find that regions of conserved gene order across the genus Drosophila are larger if they harbor genes bound by B-type lamin (Lam) and Suppressor of Under-Replication (SUUR), two proteins located at the...

    journal_title:Molecular biology and evolution

    pub_type: 信件

    doi:10.1093/molbev/msr178

    authors: Ranz JM,Díaz-Castillo C,Petersen R

    更新日期:2012-01-01 00:00:00

  • Short repetitive sequences in green algal mitochondrial genomes: potential roles in mitochondrial genome evolution.

    abstract::Current data on green algal mitochondrial genomes suggest an unexpected dichotomy within the group with respect to genome structure, organization, and sequence affiliations. The present study suggests that there is a correlation between this dichotomy on one hand and the differences in the abundance, base composition,...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a025972

    authors: Nedelcu AM,Lee RW

    更新日期:1998-06-01 00:00:00

  • Molecular Biology and Evolution of Cancer: From Discovery to Action.

    abstract::Cancer progression is an evolutionary process. During this process, evolving cancer cell populations encounter restrictive ecological niches within the body, such as the primary tumor, circulatory system, and diverse metastatic sites. Efforts to prevent or delay cancer evolution-and progression-require a deep understa...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/molbev/msz242

    authors: Somarelli JA,Gardner H,Cannataro VL,Gunady EF,Boddy AM,Johnson NA,Fisk JN,Gaffney SG,Chuang JH,Li S,Ciccarelli FD,Panchenko AR,Megquier K,Kumar S,Dornburg A,DeGregori J,Townsend JP

    更新日期:2020-02-01 00:00:00

  • Proliferation of direct repeats near the Oenothera chloroplast DNA origin of replication.

    abstract::The spacer between the 16S and 23S rRNA genes of the chloroplast DNA has been implicated as an origin of replication in several species of plants. In the evening primrose, Oenothera, this site was found to vary greatly in size, with plastid genomes (plastomes) being readily distinguished. To determine whether plastome...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a025645

    authors: Sears BB,Stoike LL,Chiu WL

    更新日期:1996-07-01 00:00:00

  • Global patterns of human DNA sequence variation in a 10-kb region on chromosome 1.

    abstract::Human DNA variation is currently a subject of intense research because of its importance for studying human origins, evolution, and demographic history and for association studies of complex diseases. A approximately 10-kb region on chromosome 1, which contains only four small exons (each <155 bp), was sequenced for 6...

    journal_title:Molecular biology and evolution

    pub_type: 杂志文章

    doi:10.1093/oxfordjournals.molbev.a003795

    authors: Yu N,Zhao Z,Fu YX,Sambuughin N,Ramsay M,Jenkins T,Leskinen E,Patthy L,Jorde LB,Kuromori T,Li WH

    更新日期:2001-02-01 00:00:00