Abstract:
:Current sequencing methods produce large amounts of data, but genome assemblies constructed from these data are often fragmented and incomplete. Incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. This means that methods attempting to estimate rates of gene duplication and loss often will be misled by such errors and that rates of gene family evolution will be consistently overestimated. Here, we present a method that takes these errors into account, allowing one to accurately infer rates of gene gain and loss among genomes even with low assembly and annotation quality. The method is implemented in the newest version of the software package CAFE, along with several other novel features. We demonstrate the accuracy of the method with extensive simulations and reanalyze several previously published data sets. Our results show that errors in genome annotation do lead to higher inferred rates of gene gain and loss but that CAFE 3 sufficiently accounts for these errors to provide accurate estimates of important evolutionary parameters.
journal_name
Mol Biol Evoljournal_title
Molecular biology and evolutionauthors
Han MV,Thomas GW,Lugo-Martinez J,Hahn MWdoi
10.1093/molbev/mst100subject
Has Abstractpub_date
2013-08-01 00:00:00pages
1987-97issue
8eissn
0737-4038issn
1537-1719pii
mst100journal_volume
30pub_type
杂志文章abstract::The noncoding-DNA content of organelle and nuclear genomes can vary immensely. Both adaptive and nonadaptive explanations for this variation have been proposed. This study addresses a nonadaptive explanation called the mutational-hazard hypothesis and applies it to the mitochondrial, plastid, and nuclear genomes of th...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msq110
更新日期:2010-10-01 00:00:00
abstract::Heterotachy is a general term to describe positions that evolve at different rates in different lineages. Heterotachy also can generally be viewed as multivariate rates-across-sites variation, which can be described as randomly drawing rates (or branch lengths) from a multivariate distribution for each branch at each ...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msq346
更新日期:2011-05-01 00:00:00
abstract::The degree to which an amino acid site is free to vary is strongly dependent on its structural and functional importance. An amino acid that plays an essential role is unlikely to change over evolutionary time. Hence, the evolutionary rate at an amino acid site is indicative of how conserved this site is and, in turn,...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msh194
更新日期:2004-09-01 00:00:00
abstract::There are three common alleles (A, B, and O) at the human ABO blood group locus. We compared nucleotide sequences of these alleles, and relatively large numbers of nucleotide differences were found among them. These differences correspond to the divergence time of at least a few million years, which is unusually large...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/oxfordjournals.molbev.a025776
更新日期:1997-04-01 00:00:00
abstract::The mutation rate of an organism is influenced by the interaction of evolutionary forces such as natural selection and genetic drift. However, the mutation spectrum (i.e., the frequency distribution of different types of mutations) can be heavily influenced by DNA repair. Using mutation-accumulation lines of the extre...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msy134
更新日期:2018-10-01 00:00:00
abstract::The ability to survey polymorphism on a genomic scale has enabled genome-wide scans for the targets of natural selection. Theory that connects patterns of genetic variation to evidence of natural selection most often assumes a diallelic locus and no recurrent mutation. Although these assumptions are suitable to select...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/mss247
更新日期:2013-02-01 00:00:00
abstract::Protein interactions are central to most biological processes. We investigated the dynamics of emergence of the protein interaction network of Saccharomyces cerevisiae by mapping origins of proteins on an evolutionary tree. We demonstrate that evolutionary periods are characterized by distinct connectivity levels of t...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msh085
更新日期:2004-07-01 00:00:00
abstract::The question of multiple sequence alignment quality has received much attention from developers of alignment methods. Less forthcoming, however, are practical measures for addressing alignment quality issues in real life settings. Here, we present a simple methodology to help identify and quantify the uncertainties in...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msm060
更新日期:2007-06-01 00:00:00
abstract::Cellular Ca(2+) homeostasis is tightly regulated and is pivotal to life. Inositol 1,4,5-trisphosphate receptors (IP3Rs) and ryanodine receptors (RyRs) are the major ion channels that regulate Ca(2+) release from intracellular stores. Although these channels have been extensively investigated in multicellular organisms...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msv098
更新日期:2015-09-01 00:00:00
abstract::Protamines are small, highly basic DNA-binding proteins found in the sperm of animals. Interestingly, the proportion of arginine residues in one type of protamine, protamine P1, is about 50% in mammals. Upon closer examination, it was found that both the total number of amino acids and the positions of arginine residu...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/oxfordjournals.molbev.a026307
更新日期:2000-02-01 00:00:00
abstract::Restriction site variations in nuclear ribosomal DNA (rDNA) spacers and mitochondrial DNA (mtDNA) were examined in several populations of mole rats with variable numbers of chromosomes, which represented the two superspecies Spalax leucodon (2n = 38, 54, or 62) and Spalax ehrenbergi (2n = 52, 54, 58, or 60). Sequence ...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/oxfordjournals.molbev.a025574
更新日期:1996-01-01 00:00:00
abstract::The ancestral selection graph, conditioned on the allelic types in the sample, is used to obtain a limiting gene genealogical process under strong selection. In an equilibrium, two-allele system with strong selection, neutral gene genealogies are predicted for random samples and for samples containing at most one unfa...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msn209
更新日期:2008-12-01 00:00:00
abstract::To test whether gaps resulting from sequence alignment contain phylogenetic signal concordant with those of base substitutions, we analyzed the occurrence of indel mutations upon a well-resolved, substitution-based tree for three nuclear genes in bumble bees (Bombus, Apidae: Bombini). The regions analyzed were exon an...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msg007
更新日期:2003-01-01 00:00:00
abstract::In order to investigate divergence of immune regulation among Drosophila species, we have engaged in a study of innate immune function in F1 hybrids of Drosophila melanogaster and D. simulans. If pathways have diverged between the species such that incompatibilities have arisen between interacting components of the im...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msq146
更新日期:2010-11-01 00:00:00
abstract::About half of the mammalian genome is composed of retroposons. Long interspersed elements (LINEs) and short interspersed elements (SINEs) are the most abundant repetitive elements and account for about 21% and 13% of the human genome, respectively. SINEs have been detected in all major mammalian lineages, except for t...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msi071
更新日期:2005-04-01 00:00:00
abstract::In this study, we used an empirical example based on 100 mitochondrial genomes from higher teleost fishes to compare the accuracy of parsimony-based jackknife values with Bayesian support values. Phylogenetic analyses of 366 partitions, using differential taxon and character sampling from the entire data matrix of 100...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msh014
更新日期:2004-01-01 00:00:00
abstract::Environmental or genomic changes during evolution can relax negative selection pressure on specific loci, permitting high frequency polymorphisms at previously conserved sites. Here, we jointly analyze population genomic and comparative genomic data to search for functional processes showing relaxed negative selection...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/mst098
更新日期:2013-08-01 00:00:00
abstract::We have followed, in glucose-limited chemostats, the evolution of natural isolates of Escherichia coli possessing maximal growth rates of 0.48-1.43 doublings/h. Under these conditions a rapid-growth phenotype similar to that of standard laboratory wild-type strains was selected so that after 280 generations all of the...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/oxfordjournals.molbev.a040731
更新日期:1992-05-01 00:00:00
abstract::We report the presence of four nuclear paralogs of a 380-bp segment of cytochrome b in callitrichine primates (marmosets and tamarins). The mitochondrial cytochrome b sequence and each nuclear paralog were obtained from several species, allowing multiple comparisons of rates and patterns of substitution both between m...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/oxfordjournals.molbev.a026388
更新日期:2000-07-01 00:00:00
abstract::This review explores features of the origin and evolution of colicins in Escherichia coli. First, the evolutionary relationships of 16 colicin and colicin-related proteins are inferred from amino acid and DNA sequence comparisons. These comparisons are employed to detail the evolutionary mechanisms involved in the ori...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/oxfordjournals.molbev.a040081
更新日期:1993-11-01 00:00:00
abstract::Employing comparative analysis of the cDNA-coding sequences of the unique preprorelaxin of the Afro-lorisiform Galago crassicaudatus and the Malagasy lemur Varecia variegata and the relaxin-like factor (RLF) of G. crassicaudatus, we demonstrated distinct differences in the dynamics of molecular remodeling of both horm...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/oxfordjournals.molbev.a003815
更新日期:2001-03-01 00:00:00
abstract::Dogs exhibit more phenotypic variation than any other mammal and are affected by a wide variety of genetic diseases. However, the origin and genetic basis of this variation is still poorly understood. We examined the effect of domestication on the dog genome by comparison with its wild ancestor, the gray wolf. We comp...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msn177
更新日期:2008-11-01 00:00:00
abstract::With whole-genome sequences being completed at an increasing rate, it is important to develop and assess tools to analyze them. Following annotation of the protein content of a genome, one can compare sequences with previously characterized homologous genes to detect novel functions within specific proteins in the evo...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/oxfordjournals.molbev.a003745
更新日期:2001-11-01 00:00:00
abstract::A common problem in molecular phylogenetics is choosing a model of DNA substitution that does a good job of explaining the DNA sequence alignment without introducing superfluous parameters. A number of methods have been used to choose among a small set of candidate substitution models, such as the likelihood ratio tes...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msh123
更新日期:2004-06-01 00:00:00
abstract::The spontaneous deamination of cytosine produces uracil mispaired with guanine in DNA, which will produce a mutation, unless repaired. In all domains of life, uracil-DNA glycosylases (UDGs) are responsible for the elimination of uracil from DNA. Thus, UDGs contribute to the integrity of the genetic information and the...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msq318
更新日期:2011-03-01 00:00:00
abstract::Endogenous retroviruses are genomic elements formed by germline infiltration by originally exogenous viruses. These molecular fossils provide valuable information about the evolution of the retroviral family. Lentiviruses are an extensively studied genus of retroviruses infecting a broad range of mammals. Despite a we...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msw149
更新日期:2016-10-01 00:00:00
abstract::Spatial distribution and clustering of repetitive elements are extensively studied during the last years, as well as their colocalization with other genomic components. Here we investigate the large-scale features of Alu and LINE1 spatial arrangement in the human genome by studying the size distribution of interrepeat...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msm181
更新日期:2007-11-01 00:00:00
abstract::Odorant receptors (ORs) are essential for plant-insect interactions. However, despite the global impacts of Lepidoptera (moths and butterflies) as major herbivores and pollinators, little functional data is available about Lepidoptera ORs involved in plant volatile detection. Here, we initially characterized the plant...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msaa300
更新日期:2020-11-24 00:00:00
abstract::The most frequent measure of phylogenetic uncertainty for splits is bootstrap support. Although large bootstrap support intuitively suggests that a split in a tree is well supported, it has not been clear how large bootstrap support needs to be to conclude that there is significant evidence that a hypothesized split i...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msq048
更新日期:2010-07-01 00:00:00
abstract::Viral genome integration provides a complex route to biological innovation that has rarely but repeatedly occurred in one of the most diverse lineages of organisms on the planet, parasitoid wasps. We describe a novel endogenous virus in braconid wasps derived from pathogenic alphanudiviruses. Limited to a subset of th...
journal_title:Molecular biology and evolution
pub_type: 杂志文章
doi:10.1093/molbev/msy148
更新日期:2018-10-01 00:00:00