Systematic interrogation of human promoters.

Abstract:

:Despite much research, our understanding of the architecture and cis-regulatory elements of human promoters is still lacking. Here, we devised a high-throughput assay to quantify the activity of approximately 15,000 fully designed sequences that we integrated and expressed from a fixed location within the human genome. We used this method to investigate thousands of native promoters and preinitiation complex (PIC) binding regions followed by in-depth characterization of the sequence motifs underlying promoter activity, including core promoter elements and TF binding sites. We find that core promoters drive transcription mostly unidirectionally and that sequences originating from promoters exhibit stronger activity than those originating from enhancers. By testing multiple synthetic configurations of core promoter elements, we dissect the motifs that positively and negatively regulate transcription as well as the effect of their combinations and distances, including a 10-bp periodicity in the optimal distance between the TATA and the initiator. By comprehensively screening 133 TF binding sites, we find that in contrast to core promoters, TF binding sites maintain similar activity levels in both orientations, supporting a model by which divergent transcription is driven by two distinct unidirectional core promoters sharing bidirectional TF binding sites. Finally, we find a striking agreement between the effect of binding site multiplicity of individual TFs in our assay and their tendency to appear in homotypic clusters throughout the genome. Overall, our study systematically assays the elements that drive expression in core and proximal promoter regions and sheds light on organization principles of regulatory regions in the human genome.

journal_name

Genome Res

journal_title

Genome research

authors

Weingarten-Gabbay S,Nir R,Lubliner S,Sharon E,Kalma Y,Weinberger A,Segal E

doi

10.1101/gr.236075.118

subject

Has Abstract

pub_date

2019-02-01 00:00:00

pages

171-183

issue

2

eissn

1088-9051

issn

1549-5469

pii

gr.236075.118

journal_volume

29

pub_type

杂志文章
  • Somatic rearrangements across cancer reveal classes of samples with distinct patterns of DNA breakage and rearrangement-induced hypermutability.

    abstract::Whole-genome sequencing using massively parallel sequencing technologies enables accurate detection of somatic rearrangements in cancer. Pinpointing large numbers of rearrangement breakpoints to base-pair resolution allows analysis of rearrangement microhomology and genomic location for every sample. Here we analyze 9...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.141382.112

    authors: Drier Y,Lawrence MS,Carter SL,Stewart C,Gabriel SB,Lander ES,Meyerson M,Beroukhim R,Getz G

    更新日期:2013-02-01 00:00:00

  • A platform for curated products from novel open reading frames prompts reinterpretation of disease variants.

    abstract::Recent evidence from proteomics and deep massively parallel sequencing studies have revealed that eukaryotic genomes contain substantial numbers of as-yet-uncharacterized open reading frames (ORFs). We define these uncharacterized ORFs as novel ORFs (nORFs). nORFs in humans are mostly under 100 codons and are found in...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.263202.120

    authors: Neville MDC,Kohze R,Erady C,Meena N,Hayden M,Cooper DN,Mort M,Prabakaran S

    更新日期:2021-01-19 00:00:00

  • Global analysis of Drosophila Cys₂-His₂ zinc finger proteins reveals a multitude of novel recognition motifs and binding determinants.

    abstract::Cys2-His2 zinc finger proteins (ZFPs) are the largest group of transcription factors in higher metazoans. A complete characterization of these ZFPs and their associated target sequences is pivotal to fully annotate transcriptional regulatory networks in metazoan genomes. As a first step in this process, we have charac...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.151472.112

    authors: Enuameh MS,Asriyan Y,Richards A,Christensen RG,Hall VL,Kazemian M,Zhu C,Pham H,Cheng Q,Blatti C,Brasefield JA,Basciotta MD,Ou J,McNulty JC,Zhu LJ,Celniker SE,Sinha S,Stormo GD,Brodsky MH,Wolfe SA

    更新日期:2013-06-01 00:00:00

  • Integrated single-cell genetic and transcriptional analysis suggests novel drivers of chronic lymphocytic leukemia.

    abstract::Intra-tumoral genetic heterogeneity has been characterized across cancers by genome sequencing of bulk tumors, including chronic lymphocytic leukemia (CLL). In order to more accurately identify subclones, define phylogenetic relationships, and probe genotype-phenotype relationships, we developed methods for targeted m...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.217331.116

    authors: Wang L,Fan J,Francis JM,Georghiou G,Hergert S,Li S,Gambe R,Zhou CW,Yang C,Xiao S,Cin PD,Bowden M,Kotliar D,Shukla SA,Brown JR,Neuberg D,Alessi DR,Zhang CZ,Kharchenko PV,Livak KJ,Wu CJ

    更新日期:2017-08-01 00:00:00

  • Identification and analysis of internal promoters in Caenorhabditis elegans operons.

    abstract::The current Caenorhabditis elegans genomic annotation has many genes organized in operons. Using directionally stitched promoterGFP methodology, we have conducted the largest survey to date on the regulatory regions of annotated C. elegans operons and identified 65, over 25% of those studied, with internal promoters. ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6824707

    authors: Huang P,Pleasance ED,Maydan JS,Hunt-Newbury R,O'Neil NJ,Mah A,Baillie DL,Marra MA,Moerman DG,Jones SJ

    更新日期:2007-10-01 00:00:00

  • Characterization of Alu repeats that are associated with trinucleotide and tetranucleotide repeat microsatellites.

    abstract::The association of subclasses of Alu repetitive elements with various classes of trinucleotide and tetranucleotide microsatellites was characterized as a first step toward advancing our understanding of the evolution of microsatellite repeats. In addition, information regarding the association of specific classes of m...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.7.716

    authors: Yandava CN,Gastier JM,Pulido JC,Brody T,Sheffield V,Murray J,Buetow K,Duyk GM

    更新日期:1997-07-01 00:00:00

  • Discovery of regulatory elements by a computational method for phylogenetic footprinting.

    abstract::Phylogenetic footprinting is a method for the discovery of regulatory elements in a set of orthologous regulatory regions from multiple species. It does so by identifying the best conserved motifs in those orthologous regions. We describe a computer algorithm designed specifically for this purpose, making use of the p...

    journal_title:Genome research

    pub_type: 信件

    doi:10.1101/gr.6902

    authors: Blanchette M,Tompa M

    更新日期:2002-05-01 00:00:00

  • Annotated expressed sequence tags and cDNA microarrays for studies of brain and behavior in the honey bee.

    abstract::To accelerate the molecular analysis of behavior in the honey bee (Apis mellifera), we created expressed sequence tag (EST) and cDNA microarray resources for the bee brain. Over 20,000 cDNA clones were partially sequenced from a normalized (and subsequently subtracted) library generated from adult A. mellifera brains....

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5302

    authors: Whitfield CW,Band MR,Bonaldo MF,Kumar CG,Liu L,Pardinas JR,Robertson HM,Soares MB,Robinson GE

    更新日期:2002-04-01 00:00:00

  • Coevolution within a transcriptional network by compensatory trans and cis mutations.

    abstract::Transcriptional networks have been shown to evolve very rapidly, prompting questions as to how such changes arise and are tolerated. Recent comparisons of transcriptional networks across species have implicated variations in the cis-acting DNA sequences near genes as the main cause of divergence. What is less clear is...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.111765.110

    authors: Kuo D,Licon K,Bandyopadhyay S,Chuang R,Luo C,Catalana J,Ravasi T,Tan K,Ideker T

    更新日期:2010-12-01 00:00:00

  • Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases.

    abstract::Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of inter...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.136739.111

    authors: Schadt EE,Banerjee O,Fang G,Feng Z,Wong WH,Zhang X,Kislyuk A,Clark TA,Luong K,Keren-Paz A,Chess A,Kumar V,Chen-Plotkin A,Sondheimer N,Korlach J,Kasarskis A

    更新日期:2013-01-01 00:00:00

  • A nuclear matrix attachment site in the 4q35 locus has an enhancer-blocking activity in vivo: implications for the facio-scapulo-humeral dystrophy.

    abstract::Facio-scapulo-humeral dystrophy (FSHD), a muscular hereditary disease with a prevalence of 1 in 20,000, is caused by a partial deletion of a subtelomeric repeat array on chromosome 4q. Earlier, we demonstrated the existence in the vicinity of the D4Z4 repeat of a nuclear matrix attachment site, FR-MAR, efficient in no...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6620908

    authors: Petrov A,Allinne J,Pirozhkova I,Laoudj D,Lipinski M,Vassetzky YS

    更新日期:2008-01-01 00:00:00

  • Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties.

    abstract::Microsatellites are abundant in vertebrate genomes, but their sequence representation and length distributions vary greatly within each family of repeats (e.g., tetranucleotides). Biophysical studies of 82 synthetic single-stranded oligonucleotides comprising all tetra- and trinucleotide repeats revealed an inverse co...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.078303.108

    authors: Bacolla A,Larson JE,Collins JR,Li J,Milosavljevic A,Stenson PD,Cooper DN,Wells RD

    更新日期:2008-10-01 00:00:00

  • Multiple waves of recent DNA transposon activity in the bat, Myotis lucifugus.

    abstract::DNA transposons, or class 2 transposable elements, have successfully propagated in a wide variety of genomes. However, it is widely believed that DNA transposon activity has ceased in mammalian genomes for at least the last 40 million years. We recently reported evidence for the relatively recent activity of hAT and H...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.071886.107

    authors: Ray DA,Feschotte C,Pagan HJ,Smith JD,Pritham EJ,Arensburger P,Atkinson PW,Craig NL

    更新日期:2008-05-01 00:00:00

  • Mutation scanning by meltMADGE: validations using BRCA1 and LDLR, and demonstration of the potential to identify severe, moderate, silent, rare, and paucimorphic mutations in the general population.

    abstract::We have developed a mutation-scanning approach suitable for whole population screening for unknown mutations. The method, meltMADGE, combines thermal ramp electrophoresis with MADGE to achieve suitable cost efficiency and throughput. The sensitivity was tested in blind trials using 54 amplicons representing the BRCA1 ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3313405

    authors: Alharbi KK,Aldahmesh MA,Spanakis E,Haddad L,Whittall RA,Chen XH,Rassoulian H,Smith MJ,Sillibourne J,Ball NJ,Graham NJ,Briggs PJ,Simpson IA,Phillips DI,Lawlor DA,Ye S,Humphries SE,Cooper C,Smith GD,Ebrahim S,Eccles

    更新日期:2005-07-01 00:00:00

  • The effect of genotype and in utero environment on interindividual variation in neonate DNA methylomes.

    abstract::Integrating the genotype with epigenetic marks holds the promise of better understanding the biology that underlies the complex interactions of inherited and environmental components that define the developmental origins of a range of disorders. The quality of the in utero environment significantly influences health o...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.171439.113

    authors: Teh AL,Pan H,Chen L,Ong ML,Dogra S,Wong J,MacIsaac JL,Mah SM,McEwen LM,Saw SM,Godfrey KM,Chong YS,Kwek K,Kwoh CK,Soh SE,Chong MF,Barton S,Karnani N,Cheong CY,Buschdorf JP,Stünkel W,Kobor MS,Meaney MJ,Gluckma

    更新日期:2014-07-01 00:00:00

  • Synthetic spike-in standards for RNA-seq experiments.

    abstract::High-throughput sequencing of cDNA (RNA-seq) is a widely deployed transcriptome profiling and annotation technique, but questions about the performance of different protocols and platforms remain. We used a newly developed pool of 96 synthetic RNAs with various lengths, and GC content covering a 2(20) concentration ra...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.121095.111

    authors: Jiang L,Schlesinger F,Davis CA,Zhang Y,Li R,Salit M,Gingeras TR,Oliver B

    更新日期:2011-09-01 00:00:00

  • Comparative sequence analyses reveal rapid and divergent evolutionary changes of the WFDC locus in the primate lineage.

    abstract::The initial comparison of the human and chimpanzee genome sequences revealed 16 genomic regions with an unusually high density of rapidly evolving genes. One such region is the whey acidic protein (WAP) four-disulfide core domain locus (or WFDC locus), which contains 14 WFDC genes organized in two subloci on human chr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6004607

    authors: Hurle B,Swanson W,NISC Comparative Sequencing Program.,Green ED

    更新日期:2007-03-01 00:00:00

  • Evolutionary features of the 4-Mb Xq21.3 XY homology region revealed by a map at 60-kb resolution.

    abstract::Forty-three yeast artificial chromosomes (YACs) from the X chromosome have been overlapped across the 4-Mb Xq21.3 region, which is homologous to a segment in Yp11.1. The region is formatted to 60-kb resolution with 57 STSs and is merged at its edges with contigs specific for X. This allows a direct comparison of marke...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.4.307

    authors: Mumm S,Molini B,Terrell J,Srivastava A,Schlessinger D

    更新日期:1997-04-01 00:00:00

  • Gene and alternative splicing annotation with AIR.

    abstract::Designing effective and accurate tools for identifying the functional and structural elements in a genome remains at the frontier of genome annotation owing to incompleteness and inaccuracy of the data, limitations in the computational models, and shifting paradigms in genomics, such as alternative splicing. We presen...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2889405

    authors: Florea L,Di Francesco V,Miller J,Turner R,Yao A,Harris M,Walenz B,Mobarry C,Merkulov GV,Charlab R,Dew I,Deng Z,Istrail S,Li P,Sutton G

    更新日期:2005-01-01 00:00:00

  • Susceptibility to chronic pain following nerve injury is genetically affected by CACNG2.

    abstract::Chronic neuropathic pain is affected by specifics of the precipitating neural pathology, psychosocial factors, and by genetic predisposition. Little is known about the identity of predisposing genes. Using an integrative approach, we discovered that CACNG2 significantly affects susceptibility to chronic pain following...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.104976.110

    authors: Nissenbaum J,Devor M,Seltzer Z,Gebauer M,Michaelis M,Tal M,Dorfman R,Abitbul-Yarkoni M,Lu Y,Elahipanah T,delCanho S,Minert A,Fried K,Persson AK,Shpigler H,Shabo E,Yakir B,Pisanté A,Darvasi A

    更新日期:2010-09-01 00:00:00

  • Recent segmental duplications in the working draft assembly of the brown Norway rat.

    abstract::We assessed the content, structure, and distribution of segmental duplications (> or =90% sequence identity, > or =5 kb length) within the published version of the Rattus norvegicus genome assembly (v.3.1). The overall fraction of duplicated sequence within the rat assembly (2.92%) is greater than that of the mouse (1...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1907504

    authors: Tuzun E,Bailey JA,Eichler EE

    更新日期:2004-04-01 00:00:00

  • Gene amplification as double minutes or homogeneously staining regions in solid tumors: origin and structure.

    abstract::Double minutes (dmin) and homogeneously staining regions (hsr) are the cytogenetic hallmarks of genomic amplification in cancer. Different mechanisms have been proposed to explain their genesis. Recently, our group showed that the MYC-containing dmin in leukemia cases arise by excision and amplification (episome model...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106252.110

    authors: Storlazzi CT,Lonoce A,Guastadisegni MC,Trombetta D,D'Addabbo P,Daniele G,L'Abbate A,Macchia G,Surace C,Kok K,Ullmann R,Purgato S,Palumbo O,Carella M,Ambros PF,Rocchi M

    更新日期:2010-09-01 00:00:00

  • Massive turnover of functional sequence in human and other mammalian genomes.

    abstract::Despite the availability of dozens of animal genome sequences, two key questions remain unanswered: First, what fraction of any species' genome confers biological function, and second, are apparent differences in organismal complexity reflected in an objective measure of genomic complexity? Here, we address both quest...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.108795.110

    authors: Meader S,Ponting CP,Lunter G

    更新日期:2010-10-01 00:00:00

  • lobSTR: A short tandem repeat profiler for personal genomes.

    abstract::Short tandem repeats (STRs) have a wide range of applications, including medical genetics, forensics, and genetic genealogy. High-throughput sequencing (HTS) has the potential to profile hundreds of thousands of STR loci. However, mainstream bioinformatics pipelines are inadequate for the task. These pipelines treat S...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.135780.111

    authors: Gymrek M,Golan D,Rosset S,Erlich Y

    更新日期:2012-06-01 00:00:00

  • Rapid molecular assays to study human centromere genomics.

    abstract::The centromere is the structural unit responsible for the faithful segregation of chromosomes. Although regulation of centromeric function by epigenetic factors has been well-studied, the contributions of the underlying DNA sequences have been much less well defined, and existing methodologies for studying centromere ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.219709.116

    authors: Contreras-Galindo R,Fischer S,Saha AK,Lundy JD,Cervantes PW,Mourad M,Wang C,Qian B,Dai M,Meng F,Chinnaiyan A,Omenn GS,Kaplan MH,Markovitz DM

    更新日期:2017-12-01 00:00:00

  • The nonessentiality of essential genes in yeast provides therapeutic insights into a human disease.

    abstract::Essential genes refer to those whose null mutation leads to lethality or sterility. Theoretical reasoning and empirical data both suggest that the fatal effect of inactivating an essential gene can be attributed to either the loss of indispensable core cellular function (Type I), or the gain of fatal side effects afte...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.205955.116

    authors: Chen P,Wang D,Chen H,Zhou Z,He X

    更新日期:2016-10-01 00:00:00

  • Reconstructing large regions of an ancestral mammalian genome in silico.

    abstract::It is believed that most modern mammalian lineages arose from a series of rapid speciation events near the Cretaceous-Tertiary boundary. It is shown that such a phylogeny makes the common ancestral genome sequence an ideal target for reconstruction. Simulations suggest that with methods currently available, we can exp...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2800104

    authors: Blanchette M,Green ED,Miller W,Haussler D

    更新日期:2004-12-01 00:00:00

  • Large-scale genome analysis of bovine commensal Escherichia coli reveals that bovine-adapted E. coli lineages are serving as evolutionary sources of the emergence of human intestinal pathogenic strains.

    abstract::How pathogens evolve their virulence to humans in nature is a scientific issue of great medical and biological importance. Shiga toxin (Stx)-producing Escherichia coli (STEC) and enteropathogenic E. coli (EPEC) are the major foodborne pathogens that can cause hemolytic uremic syndrome and infantile diarrhea, respectiv...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.249268.119

    authors: Arimizu Y,Kirino Y,Sato MP,Uno K,Sato T,Gotoh Y,Auvray F,Brugere H,Oswald E,Mainil JG,Anklam KS,Döpfer D,Yoshino S,Ooka T,Tanizawa Y,Nakamura Y,Iguchi A,Morita-Ishihara T,Ohnishi M,Akashi K,Hayashi T,Ogura Y

    更新日期:2019-09-01 00:00:00

  • A contiguous high-resolution radiation hybrid map of 44 loci from the distal portion of the long arm of human chromosome 5.

    abstract::A contiguous high-resolution map of 44 loci from a 35-Mb portion of the distal region of the long arm of human chromosome 5, q21-q35, was produced using radiation hybrid (RH) mapping in conjunction with a natural deletion mapping panel. The map includes 30 genes, four sequence-tagged site (STS) loci, and 10 DNA marker...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.7.628

    authors: Warrington JA,Wasmuth JJ

    更新日期:1996-07-01 00:00:00

  • Spatial enhancer clustering and regulation of enhancer-proximal genes by cohesin.

    abstract::In addition to mediating sister chromatid cohesion during the cell cycle, the cohesin complex associates with CTCF and with active gene regulatory elements to form long-range interactions between its binding sites. Genome-wide chromosome conformation capture had shown that cohesin's main role in interphase genome orga...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.184986.114

    authors: Ing-Simmons E,Seitan VC,Faure AJ,Flicek P,Carroll T,Dekker J,Fisher AG,Lenhard B,Merkenschlager M

    更新日期:2015-04-01 00:00:00