Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system.

Abstract:

:Three measures of sequence dissimilarity have been compared on a computer-generated model system in which substitutions in random sequences were made at randomly selected sites and the replacement character was chosen at random from the set of characters different from the original occupant of the site. The three measures were the conventional mismatch count between aligned sequences (AMC = m) and two measures not requiring prior sequence alignment. The latter two measures were the squared Euclidean distance between vectors of counts of t-tuples (t = 1-6) of characters in the two sequences (multiplet distribution distances or MDD = d) and counts of characters not covered by word structures of statistically significant length common to the two sequences (common long words or CLW = SIB, SIS, or SAB). Average MDD distances were found to be two times average mismatch counts in the simulated sequences for all values of t from 1 to 6 and all degrees of substitution from one per sequence to so many as to produce, effectively, random sequences. This simple relation held independently of sequence length and of sequence composition. The relation was confirmed by exact results on small model systems and by formal asymptotic results in the limit of so few substitutions that no double hits occur and in the limit of two random sequences. The coefficient of variation for MDD distances was greater than that for mismatch counts for singlets but both measures approached the same low value for sextets. Needleman-Wunsch alignment produced incorrect mismatch counts at higher degrees of substitution. The model satisfied the conditions for the derivation of the Jukes-Cantor asymptotic adjustment, but its application produced increasingly bad results with increasing degrees of substitution in accord with earlier results on model and natural sequences. This fact was a consequence of the increase with increasing degrees of substitution of the sensitivity of the adjustment to error in the observations. Average CLW distances for a variety of common word structures were more or less parallel to MDD distances for appropriately long t-tuples. These results on model systems supported the validity of the two dissimilarity measures not requiring sequence alignment that was found in earlier work on natural sequences (Blaisdell 1989).

journal_name

J Mol Evol

authors

Blaisdell BE

doi

10.1007/BF02602925

subject

Has Abstract

pub_date

1989-12-01 00:00:00

pages

538-47

issue

6

eissn

0022-2844

issn

1432-1432

journal_volume

29

pub_type

杂志文章
  • Phylogeny and megasystematics of phagotrophic heterokonts (kingdom Chromista).

    abstract::Heterokonts are evolutionarily important as the most nutritionally diverse eukaryote supergroup and the most species-rich branch of the eukaryotic kingdom Chromista. Ancestrally photosynthetic/phagotrophic algae (mixotrophs), they include several ecologically important purely heterotrophic lineages, all grossly unders...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-004-0353-8

    authors: Cavalier-Smith T,Chao EE

    更新日期:2006-04-01 00:00:00

  • DNA polymerase C of the thermophilic bacterium Thermus aquaticus: classification and phylogenetic analysis of the family C DNA polymerases.

    abstract::Bacterial family C DNA polymerases (DNA pol IIIs), the major chromosomal replicative enzymes, have been provisionally classified based on primary sequences and domain structures into three classes: class I (Escherichia coli DNA pol C-type), class II (Bacillus subtilis DNA pol C-type), and class III (cyanobacterial DNA...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/pl00006520

    authors: Huang YP,Ito J

    更新日期:1999-06-01 00:00:00

  • Comparative study of translation termination sites and release factors (RF1 and RF2) in procaryotes.

    abstract::Translation termination is catalyzed by release factors that recognize stop codons. However, previous works have shown that in some bacteria, the termination process also involves bases around stop codons. Recently, Ito et al. analyzed release factors and identified the amino acids therein that recognize stop codons. ...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-002-2435-9

    authors: Ozawa Y,Saito R,Washio T,Tomita M

    更新日期:2003-06-01 00:00:00

  • A Continuum of Evolving De Novo Genes Drives Protein-Coding Novelty in Drosophila.

    abstract::Orphan genes, lacking detectable homologs in outgroup species, typically represent 10-30% of eukaryotic genomes. Efforts to find the source of these young genes indicate that de novo emergence from non-coding DNA may in part explain their prevalence. Here, we investigate the roots of orphan gene emergence in the Droso...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-020-09939-z

    authors: Heames B,Schmitz J,Bornberg-Bauer E

    更新日期:2020-05-01 00:00:00

  • Expression pattern diversity and functional conservation between retroposed PRAT genes from Drosophila melanogaster and Drosophila virilis.

    abstract::Gene duplication by retrotransposition duplicates only the coding and untranslated regions of a gene and, thus, biases retroduplicated genes toward having different expression patterns from their parental genes. As such, genes duplicated by retrotransposition are more likely to develop novel expression domains. To exp...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-008-9098-0

    authors: Penney J,Bossé J,Clark DV

    更新日期:2008-05-01 00:00:00

  • Gene conversion vs point mutation in generating variability at the antigen recognition site of major histocompatibility complex loci.

    abstract::In order to assess the roles of gene conversion followed by natural selection and balancing selection for point mutations in polymorphisms at major histocompatibility complex (MHC) loci, DNA sequences of several mammalian taxa were analyzed. Synonymous and nonsynonymous diversities were estimated separately for the an...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/BF00170662

    authors: Ohta T

    更新日期:1995-08-01 00:00:00

  • Hitchhiking and the population genetic structure of avian influenza virus.

    abstract::Previous studies have revealed a major difference in the phylogenetic structure, extent of genetic diversity, and selection pressure between the surface glycoproteins and internal gene segments of avian influenza viruses (AIV) sampled from wild birds. However, what evolutionary processes are responsible for these stri...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-009-9312-8

    authors: Chen R,Holmes EC

    更新日期:2010-01-01 00:00:00

  • Domain-Specific Proteogenomic Analysis of Collagens to Evaluate De Novo Sequencing Results and Database Information.

    abstract::Collagen is an important structural protein and the most abundant protein in mammals. In several research fields, structural analysis of collagens is performed. Fibrillar collagens almost entirely consist of continuous repeats of GXY, where G is glycine, X is often proline or alanine and Y is often hydroxyproline or a...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-018-9844-x

    authors: Kleinnijenhuis AJ,van Holthoon FL

    更新日期:2018-06-01 00:00:00

  • A skewed distribution of amino acids at recognition sites of the hypervariable region of immunoglobulins.

    abstract::Antibody binding site are formed by six hypervariable regions or complementarity determining regions (CDRs). The CDRs, three from the heavy chain and three from the light chain, are known as hypervariable segments and provide a surface complementary to that of the epitope. In recent work it was found that the amino ac...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/BF00175497

    authors: Vargas-Madrazo E,Lara-Ochoa F,Jiménez-Montaño M

    更新日期:1994-01-01 00:00:00

  • The "Origin-of-Life Reactor" and Reduction of CO2 by H2 in Inorganic Precipitates.

    abstract::It has been suggested that inorganic membranes were forerunners of organic membranes at the origin of life. Such membranes, interposed between alkaline fluid in submarine vents and the more acidic Hadean ocean, were thought to house inorganic molecular machines. H+ flowed down the pH gradient (ΔpH) from ocean to vent ...

    journal_title:Journal of molecular evolution

    pub_type: 信件,评审

    doi:10.1007/s00239-017-9805-9

    authors: Jackson JB

    更新日期:2017-08-01 00:00:00

  • The interaction of protein structure, selection, and recombination on the evolution of the type-1 fimbrial major subunit (fimA) from Escherichia coli.

    abstract::Fimbrial adhesins allow bacteria to interact with and attach to their environment. The bacteria possibly benefit from these interactions, but all external structures including adhesins also allow bacteria to be identified by other organisms. Thus adhesion molecules might be under multiple forms of selection including ...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s002390010148

    authors: Peek AS,Souza V,Eguiarte LE,Gaut BS

    更新日期:2001-02-01 00:00:00

  • Fungal origin by horizontal transfer of a plant mitochondrial group I intron in the chimeric CoxI gene of Peperomia.

    abstract::We present phylogenetic evidence that a group I intron in an angiosperm mitochondrial gene arose recently by horizontal transfer from a fungal donor species. A 1,716-bp fragment of the mitochondrial coxI gene from the angiosperm Peperomia polybotrya was amplified via the polymerase chain reaction and sequenced. Compar...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/BF00175814

    authors: Vaughn JC,Mason MT,Sper-Whitis GL,Kuhlman P,Palmer JD

    更新日期:1995-11-01 00:00:00

  • The structure and gene repertoire of an ancient red algal plastid genome.

    abstract::Photosynthetic eukaryotes can, according to features of their chloroplasts, be divided into two major groups: the red and the green lineage of plastid evolution. To extend the knowledge about the evolution of the red lineage we have sequenced and analyzed the chloroplast genome (cp-genome) of Cyanidium caldarium RK1, ...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s002390010101

    authors: Glöckner G,Rosenthal A,Valentin K

    更新日期:2000-10-01 00:00:00

  • Population genetics of Y-chromosome short tandem repeats in humans.

    abstract::Eight human short tandem repeat polymorphisms (STRs) also known as microsatellites-DYS19, DYS388, DYS390, DYS391, DYS392, DYS393, DYS389I, and DYS389II, mapping in the Y chromosome-were analyzed in two Iberian samples (Basques and Catalans). Allele frequency distributions showed significant differences only for DYS392...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/pl00006229

    authors: Pérez-Lezaun A,Calafell F,Seielstad M,Mateu E,Comas D,Bosch E,Bertranpetit J

    更新日期:1997-09-01 00:00:00

  • Forty million years of independent evolution: a mitochondrial gene and its corresponding nuclear pseudogene in primates.

    abstract::Sequences from nuclear mitochondrial pseudogenes (numts) that originated by transfer of genetic information from mitochondria to the nucleus offer a unique opportunity to compare different regimes of molecular evolution. Analyzing a 1621-nt-long numt of the rRNA specifying mitochondrial DNA residing on human chromosom...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-004-0293-3

    authors: Schmitz J,Piskurek O,Zischler H

    更新日期:2005-07-01 00:00:00

  • Iterative character weighting based on mutation frequency: a new method for constructing phyletic trees.

    abstract::In this paper we present an iterative character weighting method for the construction of phyletic trees. An initial tree is used to calculate the character weights, which are the number of mutations normalized so that the possible range is corrected for. The weights obtained are used to adjust the tree; this process i...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/BF02101127

    authors: van Ooyen A,Hogeweg P

    更新日期:1990-10-01 00:00:00

  • Functional constraints of 6-phosphogluconate dehydrogenase (6-PGD) based on sequence and structural information.

    abstract::The pentose phosphate cycle is considered as a major source of NADPH and pentose needed for nucleic acid biosynthesis. 6-Phosphogluconate dehydrogenase (6PGD), an enzyme participating in this cycle, catalyzes the oxidative decarboxylation of 6PGD to ribulose 5-phosphate with the subsequent release of CO2 and the reduc...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-004-2630-y

    authors: Goulielmos GN,Eliopoulos E,Loukas M,Tsakas S

    更新日期:2004-09-01 00:00:00

  • A statistical approach to identify ancient template DNA.

    abstract::One of the key problems in the study of ancient DNA is that of authenticating sequences obtained from PCR amplifications of highly degraded samples. Contamination of ancient samples and postmortem damage to endogenous DNA templates are the major obstacles facing researchers in this task. In particular, the authenticat...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-006-0259-8

    authors: Helgason A,Pálsson S,Lalueza-Fox C,Ghosh S,Sigurdardóttir S,Baker A,Hrafnkelsson B,Arnadóttir L,Thorsteinsdóttir U,Stefánsson K

    更新日期:2007-07-01 00:00:00

  • Comparative nucleotide diversity across North American and European populus species.

    abstract::Nucleotide polymorphisms in two North American balsam poplars (Populus trichocarpa Torr. & Gray and P. balsamifera L.; section Tacamahaca), and one Eurasian aspen (P. tremula L.; section Populus) were compared using nine loci involved in defense, stress response, photoperiodism, freezing tolerance, and housekeeping. N...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-012-9504-5

    authors: Ismail M,Soolanayakanahally RY,Ingvarsson PK,Guy RD,Jansson S,Silim SN,El-Kassaby YA

    更新日期:2012-06-01 00:00:00

  • Genome-wide analysis of the Fusarium oxysporum mimp family of MITEs and mobilization of both native and de novo created mimps.

    abstract::We have performed a genome-wide analysis of the mimp family of miniature inverted-repeat transposable elements, taking advantage of the recent release of the F. oxysporum genome sequence. Using different approaches, we detected 103 mimp elements, corresponding to 75 nonredundant copies, half of which are located on a ...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-008-9164-7

    authors: Bergemann M,Lespinet O,M'Barek SB,Daboussi MJ,Dufresne M

    更新日期:2008-12-01 00:00:00

  • Variation in protein structure and function: Primate hemoglobins.

    abstract::Variation in structure among primate hemoglobins is associated with variation in function. This supports the hypothesis that most substitutions observed among homologous proteins in different species have been fixed by natural selection because they contribute to the fitness of the genotype. It does not support the co...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/BF01653958

    authors: Sullivan B

    更新日期:1972-12-01 00:00:00

  • Elucidating the population histories and transmission dynamics of papillomaviruses using phylogenetic trees.

    abstract::Using gene genealogies constructed from gene sequence data, we show that both the mucosal and cutaneous papillomaviruses (PV)-supergroups A and B-appear to have been transmitted through susceptible populations faster than exponentially. The data and methods involved (1) examining the PV database for phylogenetic signa...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/pl00006136

    authors: Ong CK,Nee S,Rambaut A,Bernard HU,Harvey PH

    更新日期:1997-02-01 00:00:00

  • Structural constraints in expansion segments from a midge 26S rDNA.

    abstract::DNA sequences representing approximately 40% of the large-subunit rRNA gene from the lower dipteran Chironomus thummi were analyzed. Once aligned with their Drosophila counterparts, sequence and base content comparisons were carried out. Sequence identity was found to be high overall, except for six regions that displ...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/BF00173183

    authors: Gorab E,Garcia de Lacoba M,Botella LM

    更新日期:1995-12-01 00:00:00

  • Protein modules conserved since LUCA.

    abstract::Universal scale of the sequence conservation has been recently introduced based on omnipresence of the protein sequence motifs across species. A large spectrum of short sequences, up to eight residues has been found to reside in all or almost all prokaryotic organisms. By this discovery a principally novel quantitativ...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-005-0190-4

    authors: Sobolevsky Y,Trifonov EN

    更新日期:2006-11-01 00:00:00

  • Evolution of the primate androgen receptor: a structural basis for disease.

    abstract::Androgen effects mediated by the androgen receptor (AR) are essential for male reproductive development and virilization. Comparison of AR DNA coding sequence from five primate species, Homo sapiens (human), Pan troglodytes (chimpanzee), Papio hamadryas (baboon), Macaca fascicularis (macaque), and Eulemur fulvus colla...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/pl00006391

    authors: Choong CS,Kemppainen JA,Wilson EM

    更新日期:1998-09-01 00:00:00

  • Coordinated amino acid changes in the evolution of mammalian defensins.

    abstract::The mammalian defensin molecule is a short, highly cationic peptide cytotoxic to both microbial and mammalian cells which is cleaved from a precursor including a signal peptide and a highly anionic propiece. A phylogenetic analysis of 28 complete sequences from five mammalian species (mouse, rat, guinea pig, rabbit, a...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/pl00006191

    authors: Hughes AL,Yeager M

    更新日期:1997-06-01 00:00:00

  • Evaluating Neanderthal genetics and phylogeny.

    abstract::The retrieval of Neanderthal (Homo neanderthalsensis) mitochondrial DNA is thought to be among the most significant ancient DNA contributions to date, allowing conflicting hypotheses on modern human (Homo sapiens) evolution to be tested directly. Recently, however, both the authenticity of the Neanderthal sequences an...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-006-0017-y

    authors: Hebsgaard MB,Wiuf C,Gilbert MT,Glenner H,Willerslev E

    更新日期:2007-01-01 00:00:00

  • Molecular characterization of the Rh-like locus and gene transcripts from the rhesus monkey (Macaca mulatta).

    abstract::The human Rh blood group locus consists of two structurally related genes (D and CcEe) in Rh-positive haplotypes but a single gene (CcEe) in Rh-negative haplotypes. The genome of rhesus monkeys (Macaca mulatta), while not expressing any of the human Rh D, C, c, E, or e specificities, carries a Rh-like locus strongly r...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/BF00166163

    authors: Mouro I,Le Van Kim C,Cherif-Zahar B,Salvignol I,Blancher A,Cartron JP,Colin Y

    更新日期:1994-02-01 00:00:00

  • Variation in constraint versus positive selection as an explanation for evolutionary rate variation among anthocyanin genes.

    abstract::It has been argued that downstream enzymes in metabolic pathways are expected to be subject to reduced selective constraint, while upstream enzymes, particularly those at pathway branch points, are expected to exhibit more frequent adaptive substitution than downstream enzymes. We examined whether these expectations a...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s00239-008-9105-5

    authors: Rausher MD,Lu Y,Meyer K

    更新日期:2008-08-01 00:00:00

  • How mitochondria redefine the code.

    abstract::Annotated, complete DNA sequences are available for 213 mitochondrial genomes from 132 species. These provide an extensive sample of evolutionary adjustment of codon usage and meaning spanning the history of this organelle. Because most known coding changes are mitochondrial, such data bear on the general mechanism of...

    journal_title:Journal of molecular evolution

    pub_type: 杂志文章

    doi:10.1007/s002390010220

    authors: Knight RD,Landweber LF,Yarus M

    更新日期:2001-10-01 00:00:00