Abstract:
:By analyzing 1,780,295 5'-end sequences of human full-length cDNAs derived from 164 kinds of oligo-cap cDNA libraries, we identified 269,774 independent positions of transcriptional start sites (TSSs) for 14,628 human RefSeq genes. These TSSs were clustered into 30,964 clusters that were separated from each other by more than 500 bp and thus are very likely to constitute mutually distinct alternative promoters. To our surprise, at least 7674 (52%) human RefSeq genes were subject to regulation by putative alternative promoters (PAPs). On average, there were 3.1 PAPs per gene, with the composition of one CpG-island-containing promoter per 2.6 CpG-less promoters. In 17% of the PAP-containing loci, tissue-specific use of the PAPs was observed. The richest tissue sources of the tissue-specific PAPs were testis and brain. It was also intriguing that the PAP-containing promoters were enriched in the genes encoding signal transduction-related proteins and were rarer in the genes encoding extracellular proteins, possibly reflecting the varied functional requirement for and the restricted expression of those categories of genes, respectively. The patterns of the first exons were highly diverse as well. On average, there were 7.7 different splicing types of first exons per locus partly produced by the PAPs, suggesting that a wide variety of transcripts can be achieved by this mechanism. Our findings suggest that use of alternate promoters and consequent alternative use of first exons should play a pivotal role in generating the complexity required for the highly elaborated molecular systems in humans.
journal_name
Genome Resjournal_title
Genome researchauthors
Kimura K,Wakamatsu A,Suzuki Y,Ota T,Nishikawa T,Yamashita R,Yamamoto J,Sekine M,Tsuritani K,Wakaguri H,Ishii S,Sugiyama T,Saito K,Isono Y,Irie R,Kushida N,Yoneyama T,Otsuka R,Kanda K,Yokoi T,Kondo H,Wagatsuma Mdoi
10.1101/gr.4039406subject
Has Abstractpub_date
2006-01-01 00:00:00pages
55-65issue
1eissn
1088-9051issn
1549-5469pii
gr.4039406journal_volume
16pub_type
杂志文章相关文献
GENOME RESEARCH文献大全abstract::A large database of copy number profiles from cancer genomes can facilitate the identification of recurrent chromosomal alterations that often contain key cancer-related genes. It can also be used to explore low-prevalence genomic events such as chromothripsis. In this study, we report an analysis of 8227 human cancer...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.140301.112
更新日期:2013-02-01 00:00:00
abstract::Diversity in the antigen-binding receptors of the immune system has long been a primary interest of biologists. Recently it has been suggested that polymorphism in regulatory (noncoding) gene segments is of substantial importance as well. Here, we survey the level of variation in MHC class II gene promoters in man and...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.2.124
更新日期:1998-02-01 00:00:00
abstract::In diploid mammalian genomes, parental alleles can exhibit different methylation patterns (allele-specific DNA methylation, ASM), which have been documented in a small number of cases except for the imprinted regions and X chromosomes in females. We carried out a chromosome-wide survey of ASM across 16 human pluripote...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.104695.109
更新日期:2010-07-01 00:00:00
abstract::Recent evidence from proteomics and deep massively parallel sequencing studies have revealed that eukaryotic genomes contain substantial numbers of as-yet-uncharacterized open reading frames (ORFs). We define these uncharacterized ORFs as novel ORFs (nORFs). nORFs in humans are mostly under 100 codons and are found in...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.263202.120
更新日期:2021-01-19 00:00:00
abstract::Single-cell sequencing (SCS) is a powerful new tool for investigating evolution and diversity in cancer and understanding the role of rare cells in tumor progression. These methods have begun to unravel key questions in cancer biology that have been difficult to address with bulk tumor measurements. Over the past five...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.191098.115
更新日期:2015-10-01 00:00:00
abstract::Recent advances toward the characterization of Alzheimer's disease (AD) have permitted the identification of a dozen of genetic risk factors, although many more remain undiscovered. In parallel, works in the field of network biology have shown a strong link between protein connectivity and disease. In this manuscript,...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.114280.110
更新日期:2011-03-01 00:00:00
abstract::We have developed a new tool to visualize expression data on metabolic pathways and to evaluate which metabolic pathways are most affected by transcriptional changes in whole-genome expression experiments. Using the Fisher Exact Test, the method scores biochemical pathways according to the probability that as many or ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.226602
更新日期:2002-07-01 00:00:00
abstract::Eukaryotic DNA replication initiates from multiple discrete sites in the genome, termed origins of replication (origins). Prior to S phase, multiple origins are poised to initiate replication by recruitment of the pre-replicative complex (pre-RC). For proper replication to occur, origin activation must be tightly regu...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.209940.116
更新日期:2017-02-01 00:00:00
abstract::A DNA mutation detection protocol able to identify and characterize a previously unknown change in a given sequence in a rapid, efficient, sensitive, and inexpensive manner is required to take advantage of the resources now available to researchers through the genome sequencing projects. We have developed a method bas...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.gr-1578r
更新日期:2002-09-01 00:00:00
abstract::We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal change...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.828403
更新日期:2003-01-01 00:00:00
abstract::Sequencing of the human Y chromosome has uncovered the peculiarities of the genomic organization of a heterogametic sex chromosome of old evolutionary age, and has led to many insights into the evolutionary changes that occurred during its long history. We have studied the genomic organization of the medaka fish Y chr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5016106
更新日期:2006-07-01 00:00:00
abstract::PipMaker (http://bio.cse.psu.edu) is a World-Wide Web site for comparing two long DNA sequences to identify conserved segments and for producing informative, high-resolution displays of the resulting alignments. One display is a percent identity plot (pip), which shows both the position in one sequence and the degree ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.10.4.577
更新日期:2000-04-01 00:00:00
abstract::A large number of cis-regulatory motifs involved in transcriptional control have been identified, but the regulatory context and biological processes in which many of them function are unknown. Here, we computationally identify the sets of human core promoters targeted by motifs, and systematically characterize their ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6828808
更新日期:2008-03-01 00:00:00
abstract::Disregulation of imprinted genes can be associated with tumorigenesis and altered cell differentiation capacity and so could provide adverse outcomes for stem cell applications. Although the maintenance of mouse and primate embryonic stem cells in a pluripotent state has been reported to disrupt the monoallelic expres...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6609207
更新日期:2007-12-01 00:00:00
abstract::Oncoviral infection is responsible for 12%-15% of cancer in humans. Convergent evidence from epidemiology, pathology, and oncology suggests that new viral etiologies for cancers remain to be discovered. Oncoviral profiles can be obtained from cancer genome sequencing data; however, widespread viral sequence contaminat...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.242529.118
更新日期:2019-05-01 00:00:00
abstract::Aberrations of protein-coding genes are a focus of cancer genomics; however, the impact of oncogenes on expression of the ~50% of transcripts without protein-coding potential, including long noncoding RNAs (lncRNAs), has been largely uncharacterized. Activating mutations in the BRAF oncogene are present in >70% of mel...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.140061.112
更新日期:2012-06-01 00:00:00
abstract::Nutrient availability profoundly influences gene expression. Many animal genes encode multiple transcript isoforms, yet the effect of nutrient availability on transcript isoform expression has not been studied in genome-wide fashion. When Caenorhabditis elegans larvae hatch without food, they arrest development in the...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.133587.111
更新日期:2012-10-01 00:00:00
abstract::In mammals, genome-wide chromatin maps and immunofluorescence studies show that broad domains of repressive histone modifications are present on pericentromeric and telomeric repeats and on the inactive X chromosome. However, only a few autosomal loci such as silent Hox gene clusters have been shown to lie in broad do...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.080861.108
更新日期:2009-02-01 00:00:00
abstract::Retrotransposons have proliferated extensively in eukaryotic lineages; the genomes of many animals and plants comprise 50% or more retrotransposon sequences by weight. There are several persuasive arguments that the enzymatic lynchpin of retrotransposon replication, reverse transcriptase (RT), is an ancient enzyme. Mo...
journal_title:Genome research
pub_type: 杂志文章,评审
doi:10.1101/gr.1392003
更新日期:2003-09-01 00:00:00
abstract::Short insertions and deletions (indels) are the second most abundant form of human genetic variation, but our understanding of their origins and functional effects lags behind that of other types of variants. Using population-scale sequencing, we have identified a high-quality set of 1.6 million indels from 179 indivi...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.148718.112
更新日期:2013-05-01 00:00:00
abstract::The incorporation and creation of modified nucleobases in DNA have profound effects on genome function. We describe methods for mapping positions and local content of modified DNA nucleobases in genomic DNA. We combined in vitro nucleobase excision with massively parallel DNA sequencing (Excision-seq) to determine the...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.174052.114
更新日期:2014-09-01 00:00:00
abstract::Core promoters mediate transcription initiation by the integration of diverse regulatory signals encoded in the proximal promoter and enhancers. It has been suggested that genes under simple regulation may have low-complexity permissive promoters. For these genes, the core promoter may serve as the principal regulator...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.113381.110
更新日期:2011-05-01 00:00:00
abstract::Genomics data introduce a substantial computational burden as well as data privacy and ownership issues. Data sets generated by high-throughput sequencing platforms require immense amounts of computational resources to align to reference genomes and to call and annotate genomic variants. This problem is even more pron...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.207464.116
更新日期:2018-09-01 00:00:00
abstract::In mammals, genetic recombination during meiosis is limited to a set of 1- to 2-kb regions termed hotspots. Their locations are predominantly determined by the zinc finger protein PRDM9, which binds to DNA in hotspots and subsequently uses its SET domain to locally trimethylate histone H3 at lysine 4 (H3K4me3). This s...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.170167.113
更新日期:2014-05-01 00:00:00
abstract::We have assembled a first-generation anchor map of the mouse genome using a panel of 94 whole-genome-radiation hybrids (WG-RHs) and 271 sequence-tagged sites (STSs). This is the first genome-wide RH anchor map of a model organism. All of the STSs have been previously localized on the genetic map and are located 8.8 Mb...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.7.12.1153
更新日期:1997-12-01 00:00:00
abstract::DNA transposons, or class 2 transposable elements, have successfully propagated in a wide variety of genomes. However, it is widely believed that DNA transposon activity has ceased in mammalian genomes for at least the last 40 million years. We recently reported evidence for the relatively recent activity of hAT and H...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.071886.107
更新日期:2008-05-01 00:00:00
abstract::Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after k...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.106054.110
更新日期:2010-10-01 00:00:00
abstract::The regulation of gene expression is mediated at the transcriptional level by enhancer regions that are bound by sequence-specific transcription factors (TFs). Recent studies have shown that the in vivo binding sites of single TFs differ between developmental or cellular contexts. How this context-specific binding is ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.132811.111
更新日期:2012-10-01 00:00:00
abstract::Somatic L1 retrotransposition events have been shown to occur in epithelial cancers. Here, we attempted to determine how early somatic L1 insertions occurred during the development of gastrointestinal (GI) cancers. Using L1-targeted resequencing (L1-seq), we studied different stages of four colorectal cancers arising ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.196238.115
更新日期:2015-10-01 00:00:00
abstract::Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous pr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.4759706
更新日期:2006-06-01 00:00:00