Phylogenetic patterns of emergence of new genes support a model of frequent de novo evolution.

Abstract:

BACKGROUND:New gene emergence is so far assumed to be mostly driven by duplication and divergence of existing genes. The possibility that entirely new genes could emerge out of the non-coding genomic background was long thought to be almost negligible. With the increasing availability of fully sequenced genomes across broad scales of phylogeny, it has become possible to systematically study the origin of new genes over time and thus revisit this question. RESULTS:We have used phylostratigraphy to assess trends of gene evolution across successive phylogenetic phases, using mostly the well-annotated mouse genome as a reference. We find several significant general trends and confirm them for three other vertebrate genomes (humans, zebrafish and stickleback). Younger genes are shorter, both with respect to gene length, as well as to open reading frame length. They contain also fewer exons and have fewer recognizable domains. Average exon length, on the other hand, does not change much over time. Only the most recently evolved genes have longer exons and they are often associated with active promotor regions, i.e. are part of bidirectional promotors. We have also revisited the possibility that de novo evolution of genes could occur even within existing genes, by making use of an alternative reading frame (overprinting). We find several cases among the annotated Ensembl ORFs, where the new reading frame has emerged at a higher phylostratigraphic level than the original one. We discuss some of these overprinted genes, which include also the Hoxa9 gene where an alternative reading frame covering the homeobox has emerged within the lineage leading to rodents and primates (Euarchontoglires). CONCLUSIONS:We suggest that the overall trends of gene emergence are more compatible with a de novo evolution model for orphan genes than a general duplication-divergence model. Hence de novo evolution of genes appears to have occurred continuously throughout evolutionary time and should therefore be considered as a general mechanism for the emergence of new gene functions.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Neme R,Tautz D

doi

10.1186/1471-2164-14-117

subject

Has Abstract

pub_date

2013-02-21 00:00:00

pages

117

issn

1471-2164

pii

1471-2164-14-117

journal_volume

14

pub_type

杂志文章
  • Screening populations for copy number variation using genotyping-by-sequencing: a proof of concept using soybean fast neutron mutants.

    abstract:BACKGROUND:The effective use of mutant populations for reverse genetic screens relies on the population-wide characterization of the induced mutations. Genome- and population-wide characterization of the mutations found in fast neutron populations has been hindered, however, by the wide range of mutations generated and...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5998-1

    authors: Lemay MA,Torkamaneh D,Rigaill G,Boyle B,Stec AO,Stupar RM,Belzile F

    更新日期:2019-08-06 00:00:00

  • Cytosine methylation is a conserved epigenetic feature found throughout the phylum Platyhelminthes.

    abstract:BACKGROUND:The phylum Platyhelminthes (flatworms) contains an important group of bilaterian organisms responsible for many debilitating and chronic infectious diseases of human and animal populations inhabiting the planet today. In addition to their biomedical and veterinary relevance, some platyhelminths are also freq...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-462

    authors: Geyer KK,Chalmers IW,Mackintosh N,Hirst JE,Geoghegan R,Badets M,Brophy PM,Brehm K,Hoffmann KF

    更新日期:2013-07-09 00:00:00

  • Sequence-indexed mutations in maize using the UniformMu transposon-tagging population.

    abstract:BACKGROUND:Gene knockouts are a critical resource for functional genomics. In Arabidopsis, comprehensive knockout collections were generated by amplifying and sequencing genomic DNA flanking insertion mutants. These Flanking Sequence Tags (FSTs) map each mutant to a specific locus within the genome. In maize, FSTs have...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-116

    authors: Settles AM,Holding DR,Tan BC,Latshaw SP,Liu J,Suzuki M,Li L,O'Brien BA,Fajardo DS,Wroclawska E,Tseung CW,Lai J,Hunter CT 3rd,Avigne WT,Baier J,Messing J,Hannah LC,Koch KE,Becraft PW,Larkins BA,McCarty DR

    更新日期:2007-05-09 00:00:00

  • Gene expression analyses in Atlantic salmon challenged with infectious salmon anemia virus reveal differences between individuals with early, intermediate and late mortality.

    abstract:BACKGROUND:Infectious salmon anemia virus (ISAV) causes a multisystemic disease responsible for severe losses in salmon aquaculture. Better understanding of factors that explain variations in resistance between individuals and families is essential for development of strategies for disease control. To approach this, we...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-179

    authors: Jørgensen SM,Afanasyev S,Krasnov A

    更新日期:2008-04-18 00:00:00

  • Global transcriptional profiling reveals Streptococcus agalactiae genes controlled by the MtaR transcription factor.

    abstract:BACKGROUND:Streptococcus agalactiae (group B Streptococcus; GBS) is a significant bacterial pathogen of neonates and an emerging pathogen of adults. Though transcriptional regulators are abundantly encoded on the GBS genome, their role in GBS pathogenesis is poorly understood. The mtaR gene encodes a putative LysR-type...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-607

    authors: Bryan JD,Liles R,Cvek U,Trutschl M,Shelver D

    更新日期:2008-12-16 00:00:00

  • Microbial "social networks".

    abstract:BACKGROUND:It is well understood that distinct communities of bacteria are present at different sites of the body, and that changes in the structure of these communities have strong implications for human health. Yet, challenges remain in understanding the complex interconnections between the bacterial taxa within thes...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S11-S6

    authors: Fernandez M,Riveros JD,Campos M,Mathee K,Narasimhan G

    更新日期:2015-01-01 00:00:00

  • A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize.

    abstract:BACKGROUND:Haplotypes combine the effects of several single nucleotide polymorphisms (SNPs) with high linkage disequilibrium, which benefit the genome-wide association analysis (GWAS). In the haplotype association analysis, both haplotype alleles and blocks are tested. Haplotype alleles can be inferred with the same st...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6552-x

    authors: Chen H,Hao Z,Zhao Y,Yang R

    更新日期:2020-02-11 00:00:00

  • MS2CNN: predicting MS/MS spectrum based on protein sequence using deep convolutional neural networks.

    abstract:BACKGROUND:Tandem mass spectrometry allows biologists to identify and quantify protein samples in the form of digested peptide sequences. When performing peptide identification, spectral library search is more sensitive than traditional database search but is limited to peptides that have been previously identified. An...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6297-6

    authors: Lin YM,Chen CT,Chang JM

    更新日期:2019-12-24 00:00:00

  • Complete genomic sequence of the Vibrio alginolyticus bacteriophage Vp670 and characterization of the lysis-related genes, cwlQ and holA.

    abstract:BACKGROUND:Biocontrol of bacterial pathogens by bacteriophages (phages) represents a promising strategy. Vibrio alginolyticus, a gram-negative bacterium, is a notorious pathogen responsible for the loss of economically important farmed marine animals. To date, few V. alginolyticus phages have been successfully isolated...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5131-x

    authors: Luo P,Yun L,Li Y,Tian Y,Liu Q,Huang W,Hu C

    更新日期:2018-10-11 00:00:00

  • A vast genomic deletion in the C56BL/6 genome affects different genes within the Ifi200 cluster on chromosome 1 and mediates obesity and insulin resistance.

    abstract:BACKGROUND:Obesity, the excessive accumulation of body fat, is a highly heritable and genetically heterogeneous disorder. The complex, polygenic basis for the disease consisting of a network of different gene variants is still not completely known. RESULTS:In the current study we generated a BAC library of the obese-p...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3552-6

    authors: Vogel H,Jähnert M,Stadion M,Matzke D,Scherneck S,Schürmann A

    更新日期:2017-02-15 00:00:00

  • Genome-wide expression profiling shows transcriptional reprogramming in Fusarium graminearum by Fusarium graminearum virus 1-DK21 infection.

    abstract:BACKGROUND:Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21) is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB) disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-173

    authors: Cho WK,Yu J,Lee KM,Son M,Min K,Lee YW,Kim KH

    更新日期:2012-05-06 00:00:00

  • Finishing monkeypox genomes from short reads: assembly analysis and a neural network method.

    abstract:BACKGROUND:Poxviruses constitute one of the largest and most complex animal virus families known. The notorious smallpox disease has been eradicated and the virus contained, but its simian sister, monkeypox is an emerging, untreatable infectious disease, killing 1 to 10 % of its human victims. In the case of poxviruses...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2826-8

    authors: Zhao K,Wohlhueter RM,Li Y

    更新日期:2016-08-31 00:00:00

  • Synchronous profiling and analysis of mRNAs and ncRNAs in the dermal papilla cells from cashmere goats.

    abstract:BACKGROUND:Dermal papilla cells (DPCs), the "signaling center" of hair follicle (HF), delicately master continual growth of hair in mammals including cashmere, the fine fiber annually produced by secondary HF embedded in cashmere goat skins. Such unparalleled capacity bases on their exquisite character in instructing t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5861-4

    authors: Ma S,Wang Y,Zhou G,Ding Y,Yang Y,Wang X,Zhang E,Chen Y

    更新日期:2019-06-20 00:00:00

  • Gene duplications in the E. coli genome: common themes among pathotypes.

    abstract:BACKGROUND:Gene duplication underlies a significant proportion of gene functional diversity and genome complexity in both eukaryotes and prokaryotes. Although several reports in the literature described the duplication of specific genes in E. coli, a detailed analysis of the extent of gene duplications in this microorg...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5683-4

    authors: Bernabeu M,Sánchez-Herrero JF,Huedo P,Prieto A,Hüttener M,Rozas J,Juárez A

    更新日期:2019-04-24 00:00:00

  • Predicting chemical bioavailability using microarray gene expression data and regression modeling: A tale of three explosive compounds.

    abstract:BACKGROUND:Chemical bioavailability is an important dose metric in environmental risk assessment. Although many approaches have been used to evaluate bioavailability, not a single approach is free from limitations. Previously, we developed a new genomics-based approach that integrated microarray technology and regressi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2541-5

    authors: Gong P,Nan X,Barker ND,Boyd RE,Chen Y,Wilkins DE,Johnson DR,Suedel BC,Perkins EJ

    更新日期:2016-03-08 00:00:00

  • Medicago truncatula transporter database: a comprehensive database resource for M. truncatula transporters.

    abstract:BACKGROUND:Medicago truncatula has been chosen as a model species for genomic studies. It is closely related to an important legume, alfalfa. Transporters are a large group of membrane-spanning proteins. They deliver essential nutrients, eject waste products, and assist the cell in sensing environmental conditions by f...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-60

    authors: Miao Z,Li D,Zhang Z,Dong J,Su Z,Wang T

    更新日期:2012-02-06 00:00:00

  • Identification of recent cases of hepatitis C virus infection using physical-chemical properties of hypervariable region 1 and a radial basis function neural network classifier.

    abstract:BACKGROUND:Identification of acute or recent hepatitis C virus (HCV) infections is important for detecting outbreaks and devising timely public health interventions for interruption of transmission. Epidemiological investigations and chemistry-based laboratory tests are 2 main approaches that are available for identifi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4269-2

    authors: Lara J,Teka M,Khudyakov Y

    更新日期:2017-12-06 00:00:00

  • Identification of a radiosensitivity signature using integrative metaanalysis of published microarray data for NCI-60 cancer cells.

    abstract:BACKGROUND:In the postgenome era, a prediction of response to treatment could lead to better dose selection for patients in radiotherapy. To identify a radiosensitive gene signature and elucidate related signaling pathways, four different microarray experiments were reanalyzed before radiotherapy. RESULTS:Radiosensiti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-348

    authors: Kim HS,Kim SC,Kim SJ,Park CH,Jeung HC,Kim YB,Ahn JB,Chung HC,Rha SY

    更新日期:2012-07-30 00:00:00

  • De novo assembly of highly diverse viral populations.

    abstract:BACKGROUND:Extensive genetic diversity in viral populations within infected hosts and the divergence of variants from existing reference genomes impede the analysis of deep viral sequencing data. A de novo population consensus assembly is valuable both as a single linear representation of the population and as a backbo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-475

    authors: Yang X,Charlebois P,Gnerre S,Coole MG,Lennon NJ,Levin JZ,Qu J,Ryan EM,Zody MC,Henn MR

    更新日期:2012-09-13 00:00:00

  • A study of inter-lab and inter-platform agreement of DNA microarray data.

    abstract::As gene expression profile data from DNA microarrays accumulate rapidly, there is a natural need to compare data across labs and platforms. Comparisons of microarray data can be quite challenging due to data complexity and variability. Different labs may adopt different technology platforms. One may ask about the degr...

    journal_title:BMC genomics

    pub_type: 杂志文章,多中心研究

    doi:10.1186/1471-2164-6-71

    authors: Wang H,He X,Band M,Wilson C,Liu L

    更新日期:2005-05-11 00:00:00

  • An analysis of the transcriptome of Teladorsagia circumcincta: its biological and biotechnological implications.

    abstract:BACKGROUND:Teladorsagia circumcincta (order Strongylida) is an economically important parasitic nematode of small ruminants (including sheep and goats) in temperate climatic regions of the world. Improved insights into the molecular biology of this parasite could underpin alternative methods required to control this an...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S7-S10

    authors: Menon R,Gasser RB,Mitreva M,Ranganathan S

    更新日期:2012-01-01 00:00:00

  • Clustered regulatory elements at nucleosome-depleted regions punctuate a constant nucleosomal landscape in Schizosaccharomyces pombe.

    abstract:BACKGROUND:Nucleosomes facilitate the packaging of the eukaryotic genome and modulate the access of regulators to DNA. A detailed description of the nucleosomal organization under different transcriptional programmes is essential to understand their contribution to genomic regulation. RESULTS:To visualize the dynamics...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-813

    authors: Soriano I,Quintales L,Antequera F

    更新日期:2013-11-21 00:00:00

  • New insights into the evolution and functional divergence of the CIPK gene family in Saccharum.

    abstract:BACKGROUND:Calcineurin B-like protein (CBL)-interacting protein kinases (CIPKs) are the primary components of calcium sensors, and play crucial roles in plant developmental processes, hormone signaling transduction, and in the response to exogenous stresses. RESULTS:In this study, 48 CIPK genes (SsCIPKs) were identifi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07264-9

    authors: Su W,Ren Y,Wang D,Huang L,Fu X,Ling H,Su Y,Huang N,Tang H,Xu L,Que Y

    更新日期:2020-12-07 00:00:00

  • Xylem transcription profiles indicate potential metabolic responses for economically relevant characteristics of Eucalyptus species.

    abstract:BACKGROUND:Eucalyptus is one of the most important sources of industrial cellulose. Three species of this botanical group are intensively used in breeding programs: E. globulus, E. grandis and E. urophylla. E. globulus is adapted to subtropical/temperate areas and is considered a source of high-quality cellulose; E. gr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-201

    authors: Salazar MM,Nascimento LC,Camargo EL,Gonçalves DC,Lepikson Neto J,Marques WL,Teixeira PJ,Mieczkowski P,Mondego JM,Carazzolle MF,Deckmann AC,Pereira GA

    更新日期:2013-03-22 00:00:00

  • Complex sense-antisense architecture of TNFAIP1/POLDIP2 on 17q11.2 represents a novel transcriptional structural-functional gene module involved in breast cancer progression.

    abstract:BACKGROUND:A sense-antisense gene pair (SAGP) is a gene pair where two oppositely transcribed genes share a common nucleotide sequence region. In eukaryotic genomes, SAGPs can be organized in complex sense-antisense architectures (CSAGAs) in which at least one sense gene shares loci with two or more antisense partners....

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S1-S9

    authors: Grinchuk OV,Motakis E,Kuznetsov VA

    更新日期:2010-02-10 00:00:00

  • Functional genomic analysis of constitutive and inducible defense responses to Fusarium verticillioides infection in maize genotypes with contrasting ear rot resistance.

    abstract:BACKGROUND:Fusarium verticillioides causes ear rot in maize (Zea mays L.) and accumulation of mycotoxins, that affect human and animal health. Currently, chemical and agronomic measures to control Fusarium ear rot are not very effective and selection of more resistant genotypes is a desirable strategy to reduce contami...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-710

    authors: Lanubile A,Ferrarini A,Maschietto V,Delledonne M,Marocco A,Bellin D

    更新日期:2014-08-25 00:00:00

  • Antennal transcriptome analysis of the chemosensory gene families in the tree killing bark beetles, Ips typographus and Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae).

    abstract:BACKGROUND:The European spruce bark beetle, Ips typographus, and the North American mountain pine beetle, Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae), are severe pests of coniferous forests. Both bark beetle species utilize aggregation pheromones to coordinate mass-attacks on host trees, while odora...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-198

    authors: Andersson MN,Grosse-Wilde E,Keeling CI,Bengtsson JM,Yuen MM,Li M,Hillbur Y,Bohlmann J,Hansson BS,Schlyter F

    更新日期:2013-03-21 00:00:00

  • Antagonism between Staphylococcus epidermidis and Propionibacterium acnes and its genomic basis.

    abstract:BACKGROUND:Propionibacterium acnes and Staphylococcus epidermidis live in close proximity on human skin, and both bacterial species can be isolated from normal and acne vulgaris-affected skin sites. The antagonistic interactions between the two species are poorly understood, as well as the potential significance of bac...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2489-5

    authors: Christensen GJ,Scholz CF,Enghild J,Rohde H,Kilian M,Thürmer A,Brzuszkiewicz E,Lomholt HB,Brüggemann H

    更新日期:2016-02-29 00:00:00

  • Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.).

    abstract:BACKGROUND:Cucumber, Cucumis sativus L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are freque...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-569

    authors: Cavagnaro PF,Senalik DA,Yang L,Simon PW,Harkins TT,Kodira CD,Huang S,Weng Y

    更新日期:2010-10-15 00:00:00

  • Speckle reducing bilateral filter for cattle follicle segmentation.

    abstract:BACKGROUND:Ultrasound imaging technology has wide applications in cattle reproduction and has been used to monitor individual follicles and determine the patterns of follicular development. However, the speckles in ultrasound images affect the post-processing, such as follicle segmentation and finally affect the measur...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S2-S9

    authors: Tang J,Guo S,Sun Q,Deng Y,Zhou D

    更新日期:2010-11-02 00:00:00