Abstract:
:Retrotransposons have proliferated extensively in eukaryotic lineages; the genomes of many animals and plants comprise 50% or more retrotransposon sequences by weight. There are several persuasive arguments that the enzymatic lynchpin of retrotransposon replication, reverse transcriptase (RT), is an ancient enzyme. Moreover, the direct progenitors of retrotransposons are thought to be mobile self-splicing introns that actively propagate themselves via reverse transcription, the group II introns, also known as retrointrons. Retrointrons are represented in modern genomes in very modest numbers, and thus far, only in certain eubacterial and organellar genomes. Archaeal genomes are nearly devoid of RT in any form. In this study, I propose a model to explain this unusual distribution, and rationalize it with the proposed ancient origin of the RT gene. A cap and tail hypothesis is proposed. By this hypothesis, the specialized terminal structures of eukaryotic mRNA provide the ideal molecular environment for the lengthening, evolution, and subsequent massive expansion of highly mobile retrotransposons, leading directly to the retrotransposon-cluttered structure that typifies modern metazoan genomes and the eventual emergence of retroviruses.
journal_name
Genome Resjournal_title
Genome researchauthors
Boeke JDdoi
10.1101/gr.1392003subject
Has Abstractpub_date
2003-09-01 00:00:00pages
1975-83issue
9eissn
1088-9051issn
1549-5469pii
13/9/1975journal_volume
13pub_type
杂志文章,评审相关文献
GENOME RESEARCH文献大全abstract::It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.097543.109
更新日期:2010-01-01 00:00:00
abstract::Genome-scale metabolic models promise important insights into cell function. However, the definition of pathways and functional network modules within these models, and in the biochemical literature in general, is often based on intuitive reasoning. Although mathematical methods have been proposed to identify modules,...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5662207
更新日期:2007-04-01 00:00:00
abstract::More than 25 loci have been linked to type 1 diabetes (T1D) in the nonobese diabetic (NOD) mouse, but identification of the underlying genes remains challenging. We describe here the positional cloning of a T1D susceptibility locus, Idd11, located on mouse chromosome 4. Sequence analysis of a series of congenic NOD mo...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.101881.109
更新日期:2010-12-01 00:00:00
abstract::Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch req...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.089060.108
更新日期:2009-10-01 00:00:00
abstract::Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. el...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.244830.118
更新日期:2019-06-01 00:00:00
abstract::Population genetics has evolved from a theory-driven field with little empirical data into a data-driven discipline in which genome-scale data sets test the limits of available models and computational analysis methods. In humans and a few model organisms, analyses of whole-genome sequence polymorphism data are curren...
journal_title:Genome research
pub_type: 杂志文章,评审
doi:10.1101/gr.079509.108
更新日期:2010-03-01 00:00:00
abstract::The genomic alterations associated with cancers are numerous and varied, involving both isolated and large-scale complex genomic rearrangements (CGRs). Although the underlying mechanisms are not well understood, CGRs have been implicated in tumorigenesis. Here, we introduce CouGaR, a novel method for characterizing th...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.211201.116
更新日期:2017-01-01 00:00:00
abstract::X inactivation equalizes the dosage of gene expression between the sexes, but some genes escape silencing and are thus expressed from both alleles in females. To survey X inactivation and escape in mouse, we performed RNA sequencing in Mus musculus x Mus spretus cells with complete skewing of X inactivation, relying o...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.103200.109
更新日期:2010-05-01 00:00:00
abstract::Fish-mammal genomic comparisons have proved powerful in identifying conserved noncoding elements likely to be cis-regulatory in nature, and the majority of those tested in vivo have been shown to act as tissue-specific enhancers associated with genes involved in transcriptional regulation of development. Although most...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.4143406
更新日期:2006-04-01 00:00:00
abstract::A rigorous analysis of the Merck-sponsored EST data with respect to known gene sequences increases the utility of the data set and helps refine methods for building a gene index. A highly curated human transcript data base was used as a reference data set of known genes. A detailed analysis of EST sequences derived fr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.9.829
更新日期:1996-09-01 00:00:00
abstract::Despite the availability of dozens of animal genome sequences, two key questions remain unanswered: First, what fraction of any species' genome confers biological function, and second, are apparent differences in organismal complexity reflected in an objective measure of genomic complexity? Here, we address both quest...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.108795.110
更新日期:2010-10-01 00:00:00
abstract::In plants there are several classes of 21-24-nt short RNAs that regulate gene expression. The most conserved class is the microRNAs (miRNAs), although some miRNAs are found only in specific species. We used high-throughput pyrosequencing to identify conserved and nonconserved miRNAs and other short RNAs in tomato frui...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.080127.108
更新日期:2008-10-01 00:00:00
abstract::Early embryogenesis is characterized by the maternal to zygotic transition (MZT), in which maternally deposited messenger RNAs are degraded while zygotic transcription begins. Before the MZT, post-transcriptional gene regulation by RNA-binding proteins (RBPs) is the dominant force in embryo patterning. We used two mRN...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.200386.115
更新日期:2016-07-01 00:00:00
abstract::Transcription factors (TFs) are key mediators that propagate extracellular and intracellular signals through to changes in gene expression profiles. However, the rules by which promoters decode the amount of active TF into target gene expression are not well understood. To determine the mapping between promoter DNA se...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.212316.116
更新日期:2017-01-01 00:00:00
abstract::Transcription factors canonically bind nucleosome-free DNA, making the positioning of nucleosomes within regulatory regions crucial to the regulation of gene expression. Using the assay of transposase accessible chromatin (ATAC-seq), we observe a highly structured pattern of DNA fragment lengths and positions around n...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.192294.115
更新日期:2015-11-01 00:00:00
abstract::Despite much research, our understanding of the architecture and cis-regulatory elements of human promoters is still lacking. Here, we devised a high-throughput assay to quantify the activity of approximately 15,000 fully designed sequences that we integrated and expressed from a fixed location within the human genome...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.236075.118
更新日期:2019-02-01 00:00:00
abstract::MULTIPROSPECTOR, a multimeric threading algorithm for the prediction of protein-protein interactions, is applied to the genome of Saccharomyces cerevisiae. Each possible pairwise interaction among more than 6000 encoded proteins is evaluated against a dimer database of 768 complex structures by using a confidence esti...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1145203
更新日期:2003-06-01 00:00:00
abstract::Highly overlapping patterns of genome-wide binding of many distinct transcription factors have been observed in worms, insects, and mammals, but the origins and consequences of this overlapping binding remain unclear. While analyzing chromatin immunoprecipitation data sets from 21 sequence-specific transcription facto...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.130682.111
更新日期:2012-04-01 00:00:00
abstract::We have used the FANTOM2 mouse cDNA set (60,770 clones), public mRNA data, and mouse genome sequence data to identify 2481 pairs of sense-antisense transcripts and 899 further pairs of nonantisense bidirectional transcription based upon genomic mapping. The analysis greatly expands the number of known examples of sens...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.982903
更新日期:2003-06-01 00:00:00
abstract::While metagenomics has emerged as a technology of choice for analyzing bacterial populations, the assembly of metagenomic data remains challenging, thus stifling biological discoveries. Moreover, recent studies revealed that complex bacterial populations may be composed from dozens of related strains, thus further amp...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.213959.116
更新日期:2017-05-01 00:00:00
abstract::The development of high-throughput sequencing (HTS) technologies has opened the door to novel methods for detecting copy number variants (CNVs) in the human genome. While in the past CNVs have been detected based on array CGH data, recent studies have shown that depth-of-coverage information from HTS technologies can ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.106344.110
更新日期:2010-11-01 00:00:00
abstract::Mosaic mutations present in the germline have important implications for reproductive risk and disease transmission. We previously demonstrated a phenomenon occurring in the male germline, whereby specific mutations arising spontaneously in stem cells (spermatogonia) lead to clonal expansion, resulting in elevated mut...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.239186.118
更新日期:2018-12-01 00:00:00
abstract::The human genome is estimated to contain 23,000 to 33,000 retropseudogenes. To study the properties of genes giving rise to these retroelements, we compared the structure and expression of genes with or without known retropseudogenes. Four main features have emerged from the analysis of 181 genes associated to retrops...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.10.5.672
更新日期:2000-05-01 00:00:00
abstract::Next-generation sequencing technologies have made it possible to sequence targeted regions of the human genome in hundreds of individuals. Deep sequencing represents a powerful approach for the discovery of the complete spectrum of DNA sequence variants in functionally important genomic intervals. Current methods for ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.100040.109
更新日期:2010-04-01 00:00:00
abstract::Through comparative studies of the model organism Arabidopsis thaliana and its close relative Brassica oleracea, we have identified conserved regions that represent potentially functional sequences overlooked by previous Arabidopsis genome annotation methods. A total of 454,274 whole genome shotgun sequences covering ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3176505
更新日期:2005-04-01 00:00:00
abstract::Although changes in chromatin are integral to transcriptional reprogramming during cellular differentiation, it is currently unclear how chromatin modifications are targeted to specific loci. To systematically identify transcription factors (TFs) that can direct chromatin changes during cell fate decisions, we model t...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.142661.112
更新日期:2013-01-01 00:00:00
abstract::The Mammalian Gene Collection (MGC) consortium (http://mgc.nci.nih.gov) seeks to establish publicly available collections of full-ORF cDNAs for several organisms of significance to biomedical research, including human. To date over 15,200 human cDNA clones containing full-length open reading frames (ORFs) have been id...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.2473704
更新日期:2004-10-01 00:00:00
abstract::Dual channel imaging and warping of two-dimensional (2D) protein gels were used to visualize global changes of the gene expression patterns in growing Bacillus subtilis cells during entry into the stationary phase as triggered by glucose exhaustion. The 2D gels only depict single moments during the cells' growth cycle...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.905003
更新日期:2003-02-01 00:00:00
abstract::Clusters of functionally related genes can be disrupted by a single copy number variant (CNV). We demonstrate that the simultaneous disruption of multiple functionally related genes is a frequent and significant characteristic of de novo CNVs in patients with developmental disorders (P = 1 × 10(-3)). Using three diffe...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.184325.114
更新日期:2015-06-01 00:00:00
abstract::The determination of the phylogenetic relationships among microorganisms has long relied primarily on gene sequence information. Given that prokaryotic organisms often lack morphological characteristics amenable to phylogenetic analysis, prokaryotic phylogenies, in particular, are often based on sequence data. In this...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3033805
更新日期:2005-03-01 00:00:00