Principled multi-omic analysis reveals gene regulatory mechanisms of phenotype variation.

Abstract:

:Recent studies have analyzed large-scale data sets of gene expression to identify genes associated with interindividual variation in phenotypes ranging from cancer subtypes to drug sensitivity, promising new avenues of research in personalized medicine. However, gene expression data alone is limited in its ability to reveal cis-regulatory mechanisms underlying phenotypic differences. In this study, we develop a new probabilistic model, called pGENMi, that integrates multi-omic data to investigate the transcriptional regulatory mechanisms underlying interindividual variation of a specific phenotype-that of cell line response to cytotoxic treatment. In particular, pGENMi simultaneously analyzes genotype, DNA methylation, gene expression, and transcription factor (TF)-DNA binding data, along with phenotypic measurements, to identify TFs regulating the phenotype. It does so by combining statistical information about expression quantitative trait loci (eQTLs) and expression-correlated methylation marks (eQTMs) located within TF binding sites, as well as observed correlations between gene expression and phenotype variation. Application of pGENMi to data from a panel of lymphoblastoid cell lines treated with 24 drugs, in conjunction with ENCODE TF ChIP data, yielded a number of known as well as novel (TF, Drug) associations. Experimental validations by TF knockdown confirmed 41% of the predicted and tested associations, compared to a 12% confirmation rate of tested nonassociations (controls). An extensive literature survey also corroborated 62% of the predicted associations above a stringent threshold. Moreover, associations predicted only when combining eQTL and eQTM data showed higher precision compared to an eQTL-only or eQTM-only analysis using pGENMi, further demonstrating the value of multi-omic integrative analysis.

journal_name

Genome Res

journal_title

Genome research

authors

Hanson C,Cairns J,Wang L,Sinha S

doi

10.1101/gr.227066.117

subject

Has Abstract

pub_date

2018-08-01 00:00:00

pages

1207-1216

issue

8

eissn

1088-9051

issn

1549-5469

pii

gr.227066.117

journal_volume

28

pub_type

杂志文章
  • Computational modeling of the Plasmodium falciparum interactome reveals protein function on a genome-wide scale.

    abstract::Many thousands of proteins encoded by the genome of Plasmodium falciparum, the causal organism of the deadliest form of human malaria, are of unknown function. It is of utmost importance that these proteins be characterized if we are to develop combative strategies against malaria based on the biology of the parasite....

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4573206

    authors: Date SV,Stoeckert CJ Jr

    更新日期:2006-04-01 00:00:00

  • Transposon expression in the Drosophila brain is driven by neighboring genes and diversifies the neural transcriptome.

    abstract::Somatic transposon expression in neural tissue is commonly considered as a measure of mobilization and has therefore been linked to neuropathology and organismal individuality. We combined genome sequencing data with single-cell mRNA sequencing of the same inbred fly strain to map transposon expression in the Drosophi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.259200.119

    authors: Treiber CD,Waddell S

    更新日期:2020-11-01 00:00:00

  • Arboretum: reconstruction and analysis of the evolutionary history of condition-specific transcriptional modules.

    abstract::Comparative functional genomics studies the evolution of biological processes by analyzing functional data, such as gene expression profiles, across species. A major challenge is to compare profiles collected in a complex phylogeny. Here, we present Arboretum, a novel scalable computational algorithm that integrates e...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.146233.112

    authors: Roy S,Wapinski I,Pfiffner J,French C,Socha A,Konieczka J,Habib N,Kellis M,Thompson D,Regev A

    更新日期:2013-06-01 00:00:00

  • Copy number variation at the breakpoint region of isochromosome 17q.

    abstract::Isochromosome 17q, or i(17q), is one of the most frequent nonrandom changes occurring in human neoplasia. Most of the i(17q) breakpoints cluster within a approximately 240-kb interval located in the Smith-Magenis syndrome common deletion region in 17p11.2. The breakpoint cluster region is characterized by a complex ar...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.080697.108

    authors: Carvalho CM,Lupski JR

    更新日期:2008-11-01 00:00:00

  • Widespread plasticity in CTCF occupancy linked to DNA methylation.

    abstract::CTCF is a ubiquitously expressed regulator of fundamental genomic processes including transcription, intra- and interchromosomal interactions, and chromatin structure. Because of its critical role in genome function, CTCF binding patterns have long been assumed to be largely invariant across different cellular environ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.136101.111

    authors: Wang H,Maurano MT,Qu H,Varley KE,Gertz J,Pauli F,Lee K,Canfield T,Weaver M,Sandstrom R,Thurman RE,Kaul R,Myers RM,Stamatoyannopoulos JA

    更新日期:2012-09-01 00:00:00

  • Genome-wide patterns of natural variation reveal strong selective sweeps and ongoing genomic conflict in Drosophila mauritiana.

    abstract::Although it is well understood that selection shapes the polymorphism pattern in Drosophila, signatures of classic selective sweeps are scarce. Here, we focus on Drosophila mauritiana, an island endemic, which is closely related to Drosophila melanogaster. Based on a new, annotated genome sequence, we characterized th...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.139873.112

    authors: Nolte V,Pandey RV,Kofler R,Schlötterer C

    更新日期:2013-01-01 00:00:00

  • Preference of DNA methyltransferases for CpG islands in mouse embryonic stem cells.

    abstract::Many CpG islands have tissue-dependent and differentially methylated regions (T-DMRs) in normal cells and tissues. To elucidate how DNA methyltransferases (Dnmts) participate in methylation of the genomic components, we investigated the genome-wide DNA methylation pattern of the T-DMRs with Dnmt1-, Dnmt3a-, and/or Dnm...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2431504

    authors: Hattori N,Abe T,Hattori N,Suzuki M,Matsuyama T,Yoshida S,Li E,Shiota K

    更新日期:2004-09-01 00:00:00

  • Reconstructing large regions of an ancestral mammalian genome in silico.

    abstract::It is believed that most modern mammalian lineages arose from a series of rapid speciation events near the Cretaceous-Tertiary boundary. It is shown that such a phylogeny makes the common ancestral genome sequence an ideal target for reconstruction. Simulations suggest that with methods currently available, we can exp...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2800104

    authors: Blanchette M,Green ED,Miller W,Haussler D

    更新日期:2004-12-01 00:00:00

  • A network of transcriptionally coordinated functional modules in Saccharomyces cerevisiae.

    abstract::Recent computational and experimental work suggests that functional modules underlie much of cellular physiology and are a useful unit of cellular organization from the perspective of systems biology. Because interactions among modules can give rise to higher-level properties that are essential to cellular function, a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3847105

    authors: Petti AA,Church GM

    更新日期:2005-09-01 00:00:00

  • Copy number variation at the 7q11.23 segmental duplications is a susceptibility factor for the Williams-Beuren syndrome deletion.

    abstract::Large copy number variants (CNVs) have been recently found as structural polymorphisms of the human genome of still unknown biological significance. CNVs are significantly enriched in regions with segmental duplications or low-copy repeats (LCRs). Williams-Beuren syndrome (WBS) is a neurodevelopmental disorder caused ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.073197.107

    authors: Cuscó I,Corominas R,Bayés M,Flores R,Rivera-Brugués N,Campuzano V,Pérez-Jurado LA

    更新日期:2008-05-01 00:00:00

  • Chromosomal instability mediated by non-B DNA: cruciform conformation and not DNA sequence is responsible for recurrent translocation in humans.

    abstract::Chromosomal aberrations have been thought to be random events. However, recent findings introduce a new paradigm in which certain DNA segments have the potential to adopt unusual conformations that lead to genomic instability and nonrandom chromosomal rearrangement. One of the best-studied examples is the palindromic ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.079244.108

    authors: Inagaki H,Ohye T,Kogo H,Kato T,Bolor H,Taniguchi M,Shaikh TH,Emanuel BS,Kurahashi H

    更新日期:2009-02-01 00:00:00

  • Discovery of regulatory elements by a computational method for phylogenetic footprinting.

    abstract::Phylogenetic footprinting is a method for the discovery of regulatory elements in a set of orthologous regulatory regions from multiple species. It does so by identifying the best conserved motifs in those orthologous regions. We describe a computer algorithm designed specifically for this purpose, making use of the p...

    journal_title:Genome research

    pub_type: 信件

    doi:10.1101/gr.6902

    authors: Blanchette M,Tompa M

    更新日期:2002-05-01 00:00:00

  • Long noncoding RNAs in C. elegans.

    abstract::Thousands of long noncoding RNAs (lncRNAs) have been found in vertebrate animals, a few of which have known biological roles. To better understand the genomics and features of lncRNAs in invertebrates, we used available RNA-seq, poly(A)-site, and ribosome-mapping data to identify lncRNAs of Caenorhabditis elegans. We ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.140475.112

    authors: Nam JW,Bartel DP

    更新日期:2012-12-01 00:00:00

  • Identification and analysis of internal promoters in Caenorhabditis elegans operons.

    abstract::The current Caenorhabditis elegans genomic annotation has many genes organized in operons. Using directionally stitched promoterGFP methodology, we have conducted the largest survey to date on the regulatory regions of annotated C. elegans operons and identified 65, over 25% of those studied, with internal promoters. ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6824707

    authors: Huang P,Pleasance ED,Maydan JS,Hunt-Newbury R,O'Neil NJ,Mah A,Baillie DL,Marra MA,Moerman DG,Jones SJ

    更新日期:2007-10-01 00:00:00

  • Synthetic spike-in standards for RNA-seq experiments.

    abstract::High-throughput sequencing of cDNA (RNA-seq) is a widely deployed transcriptome profiling and annotation technique, but questions about the performance of different protocols and platforms remain. We used a newly developed pool of 96 synthetic RNAs with various lengths, and GC content covering a 2(20) concentration ra...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.121095.111

    authors: Jiang L,Schlesinger F,Davis CA,Zhang Y,Li R,Salit M,Gingeras TR,Oliver B

    更新日期:2011-09-01 00:00:00

  • A matter of life or death: how microsatellites emerge in and vanish from the human genome.

    abstract::Microsatellites--tandem repeats of short DNA motifs--are abundant in the human genome and have high mutation rates. While microsatellite instability is implicated in numerous genetic diseases, the molecular processes involved in their emergence and disappearance are still not well understood. Microsatellites are hypot...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.122937.111

    authors: Kelkar YD,Eckert KA,Chiaromonte F,Makova KD

    更新日期:2011-12-01 00:00:00

  • Allele-specific methylation is prevalent and is contributed by CpG-SNPs in the human genome.

    abstract::In diploid mammalian genomes, parental alleles can exhibit different methylation patterns (allele-specific DNA methylation, ASM), which have been documented in a small number of cases except for the imprinted regions and X chromosomes in females. We carried out a chromosome-wide survey of ASM across 16 human pluripote...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.104695.109

    authors: Shoemaker R,Deng J,Wang W,Zhang K

    更新日期:2010-07-01 00:00:00

  • Comparative gene mapping: a fine-scale survey of chromosome rearrangements between ruminants and humans.

    abstract::A total of 202 genes were cytogenetically mapped to goat chromosomes, multiplying by five the total number of regional gene localizations in domestic ruminants (255). This map encompasses 249 and 173 common anchor loci regularly spaced along human and murine chromosomes, respectively, which makes it possible to perfor...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.9.901

    authors: Schibler L,Vaiman D,Oustry A,Giraud-Delville C,Cribiu EP

    更新日期:1998-09-01 00:00:00

  • Analysis of 5' junctions of human LINE-1 and Alu retrotransposons suggests an alternative model for 5'-end attachment requiring microhomology-mediated end-joining.

    abstract::Insertion of the human non-LTR retrotransposon LINE-1 (L1) into chromosomal DNA is thought to be initiated by a mechanism called target-primed reverse transcription (TPRT). This mechanism readily accounts for the attachment of the 3'-end of an L1 copy to the genomic target, but the subsequent integration steps leading...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3421505

    authors: Zingler N,Willhoeft U,Brose HP,Schoder V,Jahns T,Hanschmann KM,Morrish TA,Löwer J,Schumann GG

    更新日期:2005-06-01 00:00:00

  • Susceptibility to chronic pain following nerve injury is genetically affected by CACNG2.

    abstract::Chronic neuropathic pain is affected by specifics of the precipitating neural pathology, psychosocial factors, and by genetic predisposition. Little is known about the identity of predisposing genes. Using an integrative approach, we discovered that CACNG2 significantly affects susceptibility to chronic pain following...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.104976.110

    authors: Nissenbaum J,Devor M,Seltzer Z,Gebauer M,Michaelis M,Tal M,Dorfman R,Abitbul-Yarkoni M,Lu Y,Elahipanah T,delCanho S,Minert A,Fried K,Persson AK,Shpigler H,Shabo E,Yakir B,Pisanté A,Darvasi A

    更新日期:2010-09-01 00:00:00

  • PipMaker--a web server for aligning two genomic DNA sequences.

    abstract::PipMaker (http://bio.cse.psu.edu) is a World-Wide Web site for comparing two long DNA sequences to identify conserved segments and for producing informative, high-resolution displays of the resulting alignments. One display is a percent identity plot (pip), which shows both the position in one sequence and the degree ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.10.4.577

    authors: Schwartz S,Zhang Z,Frazer KA,Smit A,Riemer C,Bouck J,Gibbs R,Hardison R,Miller W

    更新日期:2000-04-01 00:00:00

  • A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles.

    abstract::An important aspect of understanding a biological pathway is to delineate the transcriptional regulatory mechanisms of the genes involved. Two important tasks are often encountered when studying transcription regulation, i.e., (1) the identification of common transcriptional regulators of a set of coexpressed genes; (...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4303406

    authors: Chang LW,Nagarajan R,Magee JA,Milbrandt J,Stormo GD

    更新日期:2006-03-01 00:00:00

  • Random mutagenesis of proximal mouse chromosome 5 uncovers predominantly embryonic lethal mutations.

    abstract::A region-specific ENU mutagenesis screen was conducted to elucidate the functional content of proximal mouse Chr 5. We used the visibly marked, recessive, lethal inversion Rump White (Rw) as a balancer in a three-generation breeding scheme to identify recessive mutations within the approximately 50 megabases spanned b...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3826505

    authors: Wilson L,Ching YH,Farias M,Hartford SA,Howell G,Shao H,Bucan M,Schimenti JC

    更新日期:2005-08-01 00:00:00

  • Gene loss and movement in the maize genome.

    abstract::Maize (Zea mays L. ssp. mays), one of the most important agricultural crops in the world, originated by hybridization of two closely related progenitors. To investigate the fate of its genes after tetraploidization, we analyzed the sequence of five duplicated regions from different chromosomal locations. We also compa...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2701104

    authors: Lai J,Ma J,Swigonová Z,Ramakrishna W,Linton E,Llaca V,Tanyolac B,Park YJ,Jeong OY,Bennetzen JL,Messing J

    更新日期:2004-10-01 00:00:00

  • A tale of two templates: automatically resolving double traces has many applications, including efficient PCR-based elucidation of alternative splices.

    abstract::Trace Recalling is a novel method for deconvoluting double traces that result from simultaneously sequencing two DNA templates. Trace Recalling identifies up to two bases at each position of such a trace. The resulting ambiguity sequence is aligned to the genome, identifying one template sequence. A second template se...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5661407

    authors: Tenney AE,Wu JQ,Langton L,Klueh P,Quatrano R,Brent MR

    更新日期:2007-02-01 00:00:00

  • Evaluation of predicted network modules in yeast metabolism using NMR-based metabolite profiling.

    abstract::Genome-scale metabolic models promise important insights into cell function. However, the definition of pathways and functional network modules within these models, and in the biochemical literature in general, is often based on intuitive reasoning. Although mathematical methods have been proposed to identify modules,...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5662207

    authors: Bundy JG,Papp B,Harmston R,Browne RA,Clayson EM,Burton N,Reece RJ,Oliver SG,Brindle KM

    更新日期:2007-04-01 00:00:00

  • Time series community genomics analysis reveals rapid shifts in bacterial species, strains, and phage during infant gut colonization.

    abstract::The gastrointestinal microbiome undergoes shifts in species and strain abundances, yet dynamics involving closely related microorganisms remain largely unknown because most methods cannot resolve them. We developed new metagenomic methods and utilized them to track species and strain level variations in microbial comm...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.142315.112

    authors: Sharon I,Morowitz MJ,Thomas BC,Costello EK,Relman DA,Banfield JF

    更新日期:2013-01-01 00:00:00

  • Distal CpG islands can serve as alternative promoters to transcribe genes with silenced proximal promoters.

    abstract::DNA methylation at the promoter of a gene is presumed to render it silent, yet a sizable fraction of genes with methylated proximal promoters exhibit elevated expression. Here, we show, through extensive analysis of the methylome and transcriptome in 34 tissues, that in many such cases, transcription is initiated by a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.212050.116

    authors: Sarda S,Das A,Vinson C,Hannenhalli S

    更新日期:2017-04-01 00:00:00

  • High-throughput plasmid purification for capillary sequencing.

    abstract::The need for expeditious and inexpensive methods for high-throughput DNA sequencing has been highlighted by the accelerated pace of genome DNA sequencing over the past year. At the Joint Genome Institute, the throughput in terms of high-quality bases per day has increased over 20-fold during the past 18 mo, reaching a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.167801

    authors: Elkin CJ,Richardson PM,Fourcade HM,Hammon NM,Pollard MJ,Predki PF,Glavina T,Hawkins TL

    更新日期:2001-07-01 00:00:00

  • Characterization and dynamics of pericentromere-associated domains in mice.

    abstract::Despite recent progress in genome topology knowledge, the role of repeats, which make up the majority of mammalian genomes, remains elusive. Satellite repeats are highly abundant sequences that cluster around centromeres, attract pericentromeric heterochromatin, and aggregate into nuclear chromocenters. These nuclear ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.186643.114

    authors: Wijchers PJ,Geeven G,Eyres M,Bergsma AJ,Janssen M,Verstegen M,Zhu Y,Schell Y,Vermeulen C,de Wit E,de Laat W

    更新日期:2015-07-01 00:00:00