Abstract:
:Despite much research, our understanding of the architecture and cis-regulatory elements of human promoters is still lacking. Here, we devised a high-throughput assay to quantify the activity of approximately 15,000 fully designed sequences that we integrated and expressed from a fixed location within the human genome. We used this method to investigate thousands of native promoters and preinitiation complex (PIC) binding regions followed by in-depth characterization of the sequence motifs underlying promoter activity, including core promoter elements and TF binding sites. We find that core promoters drive transcription mostly unidirectionally and that sequences originating from promoters exhibit stronger activity than those originating from enhancers. By testing multiple synthetic configurations of core promoter elements, we dissect the motifs that positively and negatively regulate transcription as well as the effect of their combinations and distances, including a 10-bp periodicity in the optimal distance between the TATA and the initiator. By comprehensively screening 133 TF binding sites, we find that in contrast to core promoters, TF binding sites maintain similar activity levels in both orientations, supporting a model by which divergent transcription is driven by two distinct unidirectional core promoters sharing bidirectional TF binding sites. Finally, we find a striking agreement between the effect of binding site multiplicity of individual TFs in our assay and their tendency to appear in homotypic clusters throughout the genome. Overall, our study systematically assays the elements that drive expression in core and proximal promoter regions and sheds light on organization principles of regulatory regions in the human genome.
journal_name
Genome Resjournal_title
Genome researchauthors
Weingarten-Gabbay S,Nir R,Lubliner S,Sharon E,Kalma Y,Weinberger A,Segal Edoi
10.1101/gr.236075.118subject
Has Abstractpub_date
2019-02-01 00:00:00pages
171-183issue
2eissn
1088-9051issn
1549-5469pii
gr.236075.118journal_volume
29pub_type
杂志文章相关文献
GENOME RESEARCH文献大全abstract::Whole-genome sequencing using massively parallel sequencing technologies enables accurate detection of somatic rearrangements in cancer. Pinpointing large numbers of rearrangement breakpoints to base-pair resolution allows analysis of rearrangement microhomology and genomic location for every sample. Here we analyze 9...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.141382.112
更新日期:2013-02-01 00:00:00
abstract::Recent evidence from proteomics and deep massively parallel sequencing studies have revealed that eukaryotic genomes contain substantial numbers of as-yet-uncharacterized open reading frames (ORFs). We define these uncharacterized ORFs as novel ORFs (nORFs). nORFs in humans are mostly under 100 codons and are found in...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.263202.120
更新日期:2021-01-19 00:00:00
abstract::Cys2-His2 zinc finger proteins (ZFPs) are the largest group of transcription factors in higher metazoans. A complete characterization of these ZFPs and their associated target sequences is pivotal to fully annotate transcriptional regulatory networks in metazoan genomes. As a first step in this process, we have charac...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.151472.112
更新日期:2013-06-01 00:00:00
abstract::Intra-tumoral genetic heterogeneity has been characterized across cancers by genome sequencing of bulk tumors, including chronic lymphocytic leukemia (CLL). In order to more accurately identify subclones, define phylogenetic relationships, and probe genotype-phenotype relationships, we developed methods for targeted m...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.217331.116
更新日期:2017-08-01 00:00:00
abstract::The current Caenorhabditis elegans genomic annotation has many genes organized in operons. Using directionally stitched promoterGFP methodology, we have conducted the largest survey to date on the regulatory regions of annotated C. elegans operons and identified 65, over 25% of those studied, with internal promoters. ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6824707
更新日期:2007-10-01 00:00:00
abstract::The association of subclasses of Alu repetitive elements with various classes of trinucleotide and tetranucleotide microsatellites was characterized as a first step toward advancing our understanding of the evolution of microsatellite repeats. In addition, information regarding the association of specific classes of m...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.7.7.716
更新日期:1997-07-01 00:00:00
abstract::Phylogenetic footprinting is a method for the discovery of regulatory elements in a set of orthologous regulatory regions from multiple species. It does so by identifying the best conserved motifs in those orthologous regions. We describe a computer algorithm designed specifically for this purpose, making use of the p...
journal_title:Genome research
pub_type: 信件
doi:10.1101/gr.6902
更新日期:2002-05-01 00:00:00
abstract::To accelerate the molecular analysis of behavior in the honey bee (Apis mellifera), we created expressed sequence tag (EST) and cDNA microarray resources for the bee brain. Over 20,000 cDNA clones were partially sequenced from a normalized (and subsequently subtracted) library generated from adult A. mellifera brains....
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5302
更新日期:2002-04-01 00:00:00
abstract::Transcriptional networks have been shown to evolve very rapidly, prompting questions as to how such changes arise and are tolerated. Recent comparisons of transcriptional networks across species have implicated variations in the cis-acting DNA sequences near genes as the main cause of divergence. What is less clear is...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.111765.110
更新日期:2010-12-01 00:00:00
abstract::Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of inter...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.136739.111
更新日期:2013-01-01 00:00:00
abstract::Facio-scapulo-humeral dystrophy (FSHD), a muscular hereditary disease with a prevalence of 1 in 20,000, is caused by a partial deletion of a subtelomeric repeat array on chromosome 4q. Earlier, we demonstrated the existence in the vicinity of the D4Z4 repeat of a nuclear matrix attachment site, FR-MAR, efficient in no...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6620908
更新日期:2008-01-01 00:00:00
abstract::Microsatellites are abundant in vertebrate genomes, but their sequence representation and length distributions vary greatly within each family of repeats (e.g., tetranucleotides). Biophysical studies of 82 synthetic single-stranded oligonucleotides comprising all tetra- and trinucleotide repeats revealed an inverse co...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.078303.108
更新日期:2008-10-01 00:00:00
abstract::DNA transposons, or class 2 transposable elements, have successfully propagated in a wide variety of genomes. However, it is widely believed that DNA transposon activity has ceased in mammalian genomes for at least the last 40 million years. We recently reported evidence for the relatively recent activity of hAT and H...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.071886.107
更新日期:2008-05-01 00:00:00
abstract::We have developed a mutation-scanning approach suitable for whole population screening for unknown mutations. The method, meltMADGE, combines thermal ramp electrophoresis with MADGE to achieve suitable cost efficiency and throughput. The sensitivity was tested in blind trials using 54 amplicons representing the BRCA1 ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3313405
更新日期:2005-07-01 00:00:00
abstract::Integrating the genotype with epigenetic marks holds the promise of better understanding the biology that underlies the complex interactions of inherited and environmental components that define the developmental origins of a range of disorders. The quality of the in utero environment significantly influences health o...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.171439.113
更新日期:2014-07-01 00:00:00
abstract::High-throughput sequencing of cDNA (RNA-seq) is a widely deployed transcriptome profiling and annotation technique, but questions about the performance of different protocols and platforms remain. We used a newly developed pool of 96 synthetic RNAs with various lengths, and GC content covering a 2(20) concentration ra...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.121095.111
更新日期:2011-09-01 00:00:00
abstract::The initial comparison of the human and chimpanzee genome sequences revealed 16 genomic regions with an unusually high density of rapidly evolving genes. One such region is the whey acidic protein (WAP) four-disulfide core domain locus (or WFDC locus), which contains 14 WFDC genes organized in two subloci on human chr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6004607
更新日期:2007-03-01 00:00:00
abstract::Forty-three yeast artificial chromosomes (YACs) from the X chromosome have been overlapped across the 4-Mb Xq21.3 region, which is homologous to a segment in Yp11.1. The region is formatted to 60-kb resolution with 57 STSs and is merged at its edges with contigs specific for X. This allows a direct comparison of marke...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.7.4.307
更新日期:1997-04-01 00:00:00
abstract::Designing effective and accurate tools for identifying the functional and structural elements in a genome remains at the frontier of genome annotation owing to incompleteness and inaccuracy of the data, limitations in the computational models, and shifting paradigms in genomics, such as alternative splicing. We presen...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.2889405
更新日期:2005-01-01 00:00:00
abstract::Chronic neuropathic pain is affected by specifics of the precipitating neural pathology, psychosocial factors, and by genetic predisposition. Little is known about the identity of predisposing genes. Using an integrative approach, we discovered that CACNG2 significantly affects susceptibility to chronic pain following...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.104976.110
更新日期:2010-09-01 00:00:00
abstract::We assessed the content, structure, and distribution of segmental duplications (> or =90% sequence identity, > or =5 kb length) within the published version of the Rattus norvegicus genome assembly (v.3.1). The overall fraction of duplicated sequence within the rat assembly (2.92%) is greater than that of the mouse (1...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1907504
更新日期:2004-04-01 00:00:00
abstract::Double minutes (dmin) and homogeneously staining regions (hsr) are the cytogenetic hallmarks of genomic amplification in cancer. Different mechanisms have been proposed to explain their genesis. Recently, our group showed that the MYC-containing dmin in leukemia cases arise by excision and amplification (episome model...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.106252.110
更新日期:2010-09-01 00:00:00
abstract::Despite the availability of dozens of animal genome sequences, two key questions remain unanswered: First, what fraction of any species' genome confers biological function, and second, are apparent differences in organismal complexity reflected in an objective measure of genomic complexity? Here, we address both quest...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.108795.110
更新日期:2010-10-01 00:00:00
abstract::Short tandem repeats (STRs) have a wide range of applications, including medical genetics, forensics, and genetic genealogy. High-throughput sequencing (HTS) has the potential to profile hundreds of thousands of STR loci. However, mainstream bioinformatics pipelines are inadequate for the task. These pipelines treat S...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.135780.111
更新日期:2012-06-01 00:00:00
abstract::The centromere is the structural unit responsible for the faithful segregation of chromosomes. Although regulation of centromeric function by epigenetic factors has been well-studied, the contributions of the underlying DNA sequences have been much less well defined, and existing methodologies for studying centromere ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.219709.116
更新日期:2017-12-01 00:00:00
abstract::Essential genes refer to those whose null mutation leads to lethality or sterility. Theoretical reasoning and empirical data both suggest that the fatal effect of inactivating an essential gene can be attributed to either the loss of indispensable core cellular function (Type I), or the gain of fatal side effects afte...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.205955.116
更新日期:2016-10-01 00:00:00
abstract::It is believed that most modern mammalian lineages arose from a series of rapid speciation events near the Cretaceous-Tertiary boundary. It is shown that such a phylogeny makes the common ancestral genome sequence an ideal target for reconstruction. Simulations suggest that with methods currently available, we can exp...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.2800104
更新日期:2004-12-01 00:00:00
abstract::How pathogens evolve their virulence to humans in nature is a scientific issue of great medical and biological importance. Shiga toxin (Stx)-producing Escherichia coli (STEC) and enteropathogenic E. coli (EPEC) are the major foodborne pathogens that can cause hemolytic uremic syndrome and infantile diarrhea, respectiv...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.249268.119
更新日期:2019-09-01 00:00:00
abstract::A contiguous high-resolution map of 44 loci from a 35-Mb portion of the distal region of the long arm of human chromosome 5, q21-q35, was produced using radiation hybrid (RH) mapping in conjunction with a natural deletion mapping panel. The map includes 30 genes, four sequence-tagged site (STS) loci, and 10 DNA marker...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.7.628
更新日期:1996-07-01 00:00:00
abstract::In addition to mediating sister chromatid cohesion during the cell cycle, the cohesin complex associates with CTCF and with active gene regulatory elements to form long-range interactions between its binding sites. Genome-wide chromosome conformation capture had shown that cohesin's main role in interphase genome orga...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.184986.114
更新日期:2015-04-01 00:00:00