The unusual phylogenetic distribution of retrotransposons: a hypothesis.

Abstract:

:Retrotransposons have proliferated extensively in eukaryotic lineages; the genomes of many animals and plants comprise 50% or more retrotransposon sequences by weight. There are several persuasive arguments that the enzymatic lynchpin of retrotransposon replication, reverse transcriptase (RT), is an ancient enzyme. Moreover, the direct progenitors of retrotransposons are thought to be mobile self-splicing introns that actively propagate themselves via reverse transcription, the group II introns, also known as retrointrons. Retrointrons are represented in modern genomes in very modest numbers, and thus far, only in certain eubacterial and organellar genomes. Archaeal genomes are nearly devoid of RT in any form. In this study, I propose a model to explain this unusual distribution, and rationalize it with the proposed ancient origin of the RT gene. A cap and tail hypothesis is proposed. By this hypothesis, the specialized terminal structures of eukaryotic mRNA provide the ideal molecular environment for the lengthening, evolution, and subsequent massive expansion of highly mobile retrotransposons, leading directly to the retrotransposon-cluttered structure that typifies modern metazoan genomes and the eventual emergence of retroviruses.

journal_name

Genome Res

journal_title

Genome research

authors

Boeke JD

doi

10.1101/gr.1392003

subject

Has Abstract

pub_date

2003-09-01 00:00:00

pages

1975-83

issue

9

eissn

1088-9051

issn

1549-5469

pii

13/9/1975

journal_volume

13

pub_type

杂志文章,评审
  • Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error.

    abstract::It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.097543.109

    authors: Liu X,Fu YX,Maxwell TJ,Boerwinkle E

    更新日期:2010-01-01 00:00:00

  • Evaluation of predicted network modules in yeast metabolism using NMR-based metabolite profiling.

    abstract::Genome-scale metabolic models promise important insights into cell function. However, the definition of pathways and functional network modules within these models, and in the biochemical literature in general, is often based on intuitive reasoning. Although mathematical methods have been proposed to identify modules,...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5662207

    authors: Bundy JG,Papp B,Harmston R,Browne RA,Clayson EM,Burton N,Reece RJ,Oliver SG,Brindle KM

    更新日期:2007-04-01 00:00:00

  • A recombination hotspot leads to sequence variability within a novel gene (AK005651) and contributes to type 1 diabetes susceptibility.

    abstract::More than 25 loci have been linked to type 1 diabetes (T1D) in the nonobese diabetic (NOD) mouse, but identification of the underlying genes remains challenging. We describe here the positional cloning of a T1D susceptibility locus, Idd11, located on mouse chromosome 4. Sequence analysis of a series of congenic NOD mo...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.101881.109

    authors: Tan IK,Mackin L,Wang N,Papenfuss AT,Elso CM,Ashton MP,Quirk F,Phipson B,Bahlo M,Speed TP,Smyth GK,Morahan G,Brodnicki TC

    更新日期:2010-12-01 00:00:00

  • Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function.

    abstract::Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch req...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.089060.108

    authors: Grobei MA,Qeli E,Brunner E,Rehrauer H,Zhang R,Roschitzki B,Basler K,Ahrens CH,Grossniklaus U

    更新日期:2009-10-01 00:00:00

  • Recompleting the Caenorhabditis elegans genome.

    abstract::Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. el...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.244830.118

    authors: Yoshimura J,Ichikawa K,Shoura MJ,Artiles KL,Gabdank I,Wahba L,Smith CL,Edgley ML,Rougvie AE,Fire AZ,Morishita S,Schwarz EM

    更新日期:2019-06-01 00:00:00

  • Population genetic inference from genomic sequence variation.

    abstract::Population genetics has evolved from a theory-driven field with little empirical data into a data-driven discipline in which genome-scale data sets test the limits of available models and computational analysis methods. In humans and a few model organisms, analyses of whole-genome sequence polymorphism data are curren...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:10.1101/gr.079509.108

    authors: Pool JE,Hellmann I,Jensen JD,Nielsen R

    更新日期:2010-03-01 00:00:00

  • Identification of complex genomic rearrangements in cancers using CouGaR.

    abstract::The genomic alterations associated with cancers are numerous and varied, involving both isolated and large-scale complex genomic rearrangements (CGRs). Although the underlying mechanisms are not well understood, CGRs have been implicated in tumorigenesis. Here, we introduce CouGaR, a novel method for characterizing th...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.211201.116

    authors: Dzamba M,Ramani AK,Buczkowicz P,Jiang Y,Yu M,Hawkins C,Brudno M

    更新日期:2017-01-01 00:00:00

  • Global survey of escape from X inactivation by RNA-sequencing in mouse.

    abstract::X inactivation equalizes the dosage of gene expression between the sexes, but some genes escape silencing and are thus expressed from both alleles in females. To survey X inactivation and escape in mouse, we performed RNA sequencing in Mus musculus x Mus spretus cells with complete skewing of X inactivation, relying o...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.103200.109

    authors: Yang F,Babak T,Shendure J,Disteche CM

    更新日期:2010-05-01 00:00:00

  • Ancient duplicated conserved noncoding elements in vertebrates: a genomic and functional analysis.

    abstract::Fish-mammal genomic comparisons have proved powerful in identifying conserved noncoding elements likely to be cis-regulatory in nature, and the majority of those tested in vivo have been shown to act as tissue-specific enhancers associated with genes involved in transcriptional regulation of development. Although most...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4143406

    authors: McEwen GK,Woolfe A,Goode D,Vavouri T,Callaway H,Elgar G

    更新日期:2006-04-01 00:00:00

  • Toward the development of a gene index to the human genome: an assessment of the nature of high-throughput EST sequence data.

    abstract::A rigorous analysis of the Merck-sponsored EST data with respect to known gene sequences increases the utility of the data set and helps refine methods for building a gene index. A highly curated human transcript data base was used as a reference data set of known genes. A detailed analysis of EST sequences derived fr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.9.829

    authors: Aaronson JS,Eckman B,Blevins RA,Borkowski JA,Myerson J,Imran S,Elliston KO

    更新日期:1996-09-01 00:00:00

  • Massive turnover of functional sequence in human and other mammalian genomes.

    abstract::Despite the availability of dozens of animal genome sequences, two key questions remain unanswered: First, what fraction of any species' genome confers biological function, and second, are apparent differences in organismal complexity reflected in an objective measure of genomic complexity? Here, we address both quest...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.108795.110

    authors: Meader S,Ponting CP,Lunter G

    更新日期:2010-10-01 00:00:00

  • Deep sequencing of tomato short RNAs identifies microRNAs targeting genes involved in fruit ripening.

    abstract::In plants there are several classes of 21-24-nt short RNAs that regulate gene expression. The most conserved class is the microRNAs (miRNAs), although some miRNAs are found only in specific species. We used high-throughput pyrosequencing to identify conserved and nonconserved miRNAs and other short RNAs in tomato frui...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.080127.108

    authors: Moxon S,Jing R,Szittya G,Schwach F,Rusholme Pilcher RL,Moulton V,Dalmay T

    更新日期:2008-10-01 00:00:00

  • The mRNA-bound proteome of the early fly embryo.

    abstract::Early embryogenesis is characterized by the maternal to zygotic transition (MZT), in which maternally deposited messenger RNAs are degraded while zygotic transcription begins. Before the MZT, post-transcriptional gene regulation by RNA-binding proteins (RBPs) is the dominant force in embryo patterning. We used two mRN...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.200386.115

    authors: Wessels HH,Imami K,Baltz AG,Kolinski M,Beldovskaya A,Selbach M,Small S,Ohler U,Landthaler M

    更新日期:2016-07-01 00:00:00

  • Large-scale mapping of gene regulatory logic reveals context-dependent repression by transcriptional activators.

    abstract::Transcription factors (TFs) are key mediators that propagate extracellular and intracellular signals through to changes in gene expression profiles. However, the rules by which promoters decode the amount of active TF into target gene expression are not well understood. To determine the mapping between promoter DNA se...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.212316.116

    authors: van Dijk D,Sharon E,Lotan-Pompan M,Weinberger A,Segal E,Carey LB

    更新日期:2017-01-01 00:00:00

  • Structured nucleosome fingerprints enable high-resolution mapping of chromatin architecture within regulatory regions.

    abstract::Transcription factors canonically bind nucleosome-free DNA, making the positioning of nucleosomes within regulatory regions crucial to the regulation of gene expression. Using the assay of transposase accessible chromatin (ATAC-seq), we observe a highly structured pattern of DNA fragment lengths and positions around n...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.192294.115

    authors: Schep AN,Buenrostro JD,Denny SK,Schwartz K,Sherlock G,Greenleaf WJ

    更新日期:2015-11-01 00:00:00

  • Systematic interrogation of human promoters.

    abstract::Despite much research, our understanding of the architecture and cis-regulatory elements of human promoters is still lacking. Here, we devised a high-throughput assay to quantify the activity of approximately 15,000 fully designed sequences that we integrated and expressed from a fixed location within the human genome...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.236075.118

    authors: Weingarten-Gabbay S,Nir R,Lubliner S,Sharon E,Kalma Y,Weinberger A,Segal E

    更新日期:2019-02-01 00:00:00

  • Multimeric threading-based prediction of protein-protein interactions on a genomic scale: application to the Saccharomyces cerevisiae proteome.

    abstract::MULTIPROSPECTOR, a multimeric threading algorithm for the prediction of protein-protein interactions, is applied to the genome of Saccharomyces cerevisiae. Each possible pairwise interaction among more than 6000 encoded proteins is evaluated against a dimer database of 768 complex structures by using a confidence esti...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1145203

    authors: Lu L,Arakaki AK,Lu H,Skolnick J

    更新日期:2003-06-01 00:00:00

  • The TAGteam motif facilitates binding of 21 sequence-specific transcription factors in the Drosophila embryo.

    abstract::Highly overlapping patterns of genome-wide binding of many distinct transcription factors have been observed in worms, insects, and mammals, but the origins and consequences of this overlapping binding remain unclear. While analyzing chromatin immunoprecipitation data sets from 21 sequence-specific transcription facto...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.130682.111

    authors: Satija R,Bradley RK

    更新日期:2012-04-01 00:00:00

  • Antisense transcripts with FANTOM2 clone set and their implications for gene regulation.

    abstract::We have used the FANTOM2 mouse cDNA set (60,770 clones), public mRNA data, and mouse genome sequence data to identify 2481 pairs of sense-antisense transcripts and 899 further pairs of nonantisense bidirectional transcription based upon genomic mapping. The analysis greatly expands the number of known examples of sens...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.982903

    authors: Kiyosawa H,Yamanaka I,Osato N,Kondo S,Hayashizaki Y,RIKEN GER Group.,GSL Members.

    更新日期:2003-06-01 00:00:00

  • metaSPAdes: a new versatile metagenomic assembler.

    abstract::While metagenomics has emerged as a technology of choice for analyzing bacterial populations, the assembly of metagenomic data remains challenging, thus stifling biological discoveries. Moreover, recent studies revealed that complex bacterial populations may be composed from dozens of related strains, thus further amp...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.213959.116

    authors: Nurk S,Meleshko D,Korobeynikov A,Pevzner PA

    更新日期:2017-05-01 00:00:00

  • Detecting copy number variation with mated short reads.

    abstract::The development of high-throughput sequencing (HTS) technologies has opened the door to novel methods for detecting copy number variants (CNVs) in the human genome. While in the past CNVs have been detected based on array CGH data, recent studies have shown that depth-of-coverage information from HTS technologies can ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106344.110

    authors: Medvedev P,Fiume M,Dzamba M,Smith T,Brudno M

    更新日期:2010-11-01 00:00:00

  • Selfish mutations dysregulating RAS-MAPK signaling are pervasive in aged human testes.

    abstract::Mosaic mutations present in the germline have important implications for reproductive risk and disease transmission. We previously demonstrated a phenomenon occurring in the male germline, whereby specific mutations arising spontaneously in stem cells (spermatogonia) lead to clonal expansion, resulting in elevated mut...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.239186.118

    authors: Maher GJ,Ralph HK,Ding Z,Koelling N,Mlcochova H,Giannoulatou E,Dhami P,Paul DS,Stricker SH,Beck S,McVean G,Wilkie AOM,Goriely A

    更新日期:2018-12-01 00:00:00

  • Nature and structure of human genes that generate retropseudogenes.

    abstract::The human genome is estimated to contain 23,000 to 33,000 retropseudogenes. To study the properties of genes giving rise to these retroelements, we compared the structure and expression of genes with or without known retropseudogenes. Four main features have emerged from the analysis of 181 genes associated to retrops...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.10.5.672

    authors: Gonçalves I,Duret L,Mouchiroud D

    更新日期:2000-05-01 00:00:00

  • Accurate detection and genotyping of SNPs utilizing population sequencing data.

    abstract::Next-generation sequencing technologies have made it possible to sequence targeted regions of the human genome in hundreds of individuals. Deep sequencing represents a powerful approach for the discovery of the complete spectrum of DNA sequence variants in functionally important genomic intervals. Current methods for ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.100040.109

    authors: Bansal V,Harismendy O,Tewhey R,Murray SS,Schork NJ,Topol EJ,Frazer KA

    更新日期:2010-04-01 00:00:00

  • Whole genome shotgun sequencing of Brassica oleracea and its application to gene discovery and annotation in Arabidopsis.

    abstract::Through comparative studies of the model organism Arabidopsis thaliana and its close relative Brassica oleracea, we have identified conserved regions that represent potentially functional sequences overlooked by previous Arabidopsis genome annotation methods. A total of 454,274 whole genome shotgun sequences covering ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3176505

    authors: Ayele M,Haas BJ,Kumar N,Wu H,Xiao Y,Van Aken S,Utterback TR,Wortman JR,White OR,Town CD

    更新日期:2005-04-01 00:00:00

  • Modeling of epigenome dynamics identifies transcription factors that mediate Polycomb targeting.

    abstract::Although changes in chromatin are integral to transcriptional reprogramming during cellular differentiation, it is currently unclear how chromatin modifications are targeted to specific loci. To systematically identify transcription factors (TFs) that can direct chromatin changes during cell fate decisions, we model t...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.142661.112

    authors: Arnold P,Schöler A,Pachkov M,Balwierz PJ,Jørgensen H,Stadler MB,van Nimwegen E,Schübeler D

    更新日期:2013-01-01 00:00:00

  • Systematic recovery and analysis of full-ORF human cDNA clones.

    abstract::The Mammalian Gene Collection (MGC) consortium (http://mgc.nci.nih.gov) seeks to establish publicly available collections of full-ORF cDNAs for several organisms of significance to biomedical research, including human. To date over 15,200 human cDNA clones containing full-length open reading frames (ORFs) have been id...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2473704

    authors: Baross A,Butterfield YS,Coughlin SM,Zeng T,Griffith M,Griffith OL,Petrescu AS,Smailus DE,Khattra J,McDonald HL,McKay SJ,Moksa M,Holt RA,Marra MA

    更新日期:2004-10-01 00:00:00

  • Bacillus subtilis during feast and famine: visualization of the overall regulation of protein synthesis during glucose starvation by proteome analysis.

    abstract::Dual channel imaging and warping of two-dimensional (2D) protein gels were used to visualize global changes of the gene expression patterns in growing Bacillus subtilis cells during entry into the stationary phase as triggered by glucose exhaustion. The 2D gels only depict single moments during the cells' growth cycle...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.905003

    authors: Bernhardt J,Weibezahn J,Scharf C,Hecker M

    更新日期:2003-02-01 00:00:00

  • The clustering of functionally related genes contributes to CNV-mediated disease.

    abstract::Clusters of functionally related genes can be disrupted by a single copy number variant (CNV). We demonstrate that the simultaneous disruption of multiple functionally related genes is a frequent and significant characteristic of de novo CNVs in patients with developmental disorders (P = 1 × 10(-3)). Using three diffe...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.184325.114

    authors: Andrews T,Honti F,Pfundt R,de Leeuw N,Hehir-Kwa J,Vulto-van Silfhout A,de Vries B,Webber C

    更新日期:2015-06-01 00:00:00

  • Prokaryotic phylogenies inferred from protein structural domains.

    abstract::The determination of the phylogenetic relationships among microorganisms has long relied primarily on gene sequence information. Given that prokaryotic organisms often lack morphological characteristics amenable to phylogenetic analysis, prokaryotic phylogenies, in particular, are often based on sequence data. In this...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3033805

    authors: Deeds EJ,Hennessey H,Shakhnovich EI

    更新日期:2005-03-01 00:00:00