Prioritizing candidate disease genes by network-based boosting of genome-wide association data.

Abstract:

:Network "guilt by association" (GBA) is a proven approach for identifying novel disease genes based on the observation that similar mutational phenotypes arise from functionally related genes. In principle, this approach could account even for nonadditive genetic interactions, which underlie the synergistic combinations of mutations often linked to complex diseases. Here, we analyze a large-scale, human gene functional interaction network (dubbed HumanNet). We show that candidate disease genes can be effectively identified by GBA in cross-validated tests using label propagation algorithms related to Google's PageRank. However, GBA has been shown to work poorly in genome-wide association studies (GWAS), where many genes are somewhat implicated, but few are known with very high certainty. Here, we resolve this by explicitly modeling the uncertainty of the associations and incorporating the uncertainty for the seed set into the GBA framework. We observe a significant boost in the power to detect validated candidate genes for Crohn's disease and type 2 diabetes by comparing our predictions to results from follow-up meta-analyses, with incorporation of the network serving to highlight the JAK-STAT pathway and associated adaptors GRB2/SHC1 in Crohn's disease and BACH2 in type 2 diabetes. Consideration of the network during GWAS thus conveys some of the benefits of enrolling more participants in the GWAS study. More generally, we demonstrate that a functional network of human genes provides a valuable statistical framework for prioritizing candidate disease genes, both for candidate gene-based and GWAS-based studies.

journal_name

Genome Res

journal_title

Genome research

authors

Lee I,Blom UM,Wang PI,Shim JE,Marcotte EM

doi

10.1101/gr.118992.110

subject

Has Abstract

pub_date

2011-07-01 00:00:00

pages

1109-21

issue

7

eissn

1088-9051

issn

1549-5469

pii

gr.118992.110

journal_volume

21

pub_type

杂志文章
  • End Sequence Analysis Toolkit (ESAT) expands the extractable information from single-cell RNA-seq data.

    abstract::RNA-seq protocols that focus on transcript termini are well suited for applications in which template quantity is limiting. Here we show that, when applied to end-sequencing data, analytical methods designed for global RNA-seq produce computational artifacts. To remedy this, we created the End Sequence Analysis Toolki...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.207902.116

    authors: Derr A,Yang C,Zilionis R,Sergushichev A,Blodgett DM,Redick S,Bortell R,Luban J,Harlan DM,Kadener S,Greiner DL,Klein A,Artyomov MN,Garber M

    更新日期:2016-10-01 00:00:00

  • Evolutionary conservation of Y Chromosome ampliconic gene families despite extensive structural variation.

    abstract::Despite claims that the mammalian Y Chromosome is on a path to extinction, comparative sequence analysis of primate Y Chromosomes has shown the decay of the ancestral single-copy genes has all but ceased in this eutherian lineage. The suite of single-copy Y-linked genes is highly conserved among the majority of euther...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.237586.118

    authors: Brashear WA,Raudsepp T,Murphy WJ

    更新日期:2018-12-01 00:00:00

  • Genome function and nuclear architecture: from gene expression to nanoscience.

    abstract::Biophysical, chemical, and nanoscience approaches to the study of nuclear structure and activity have been developing recently and hold considerable promise. A selection of fundamental problems in genome organization and function are reviewed and discussed in the context of these new perspectives and approaches. Advan...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:10.1101/gr.946403

    authors: O'Brien TP,Bult CJ,Cremer C,Grunze M,Knowles BB,Langowski J,McNally J,Pederson T,Politz JC,Pombo A,Schmahl G,Spatz JP,van Driel R

    更新日期:2003-06-01 00:00:00

  • PRDM9 binding organizes hotspot nucleosomes and limits Holliday junction migration.

    abstract::In mammals, genetic recombination during meiosis is limited to a set of 1- to 2-kb regions termed hotspots. Their locations are predominantly determined by the zinc finger protein PRDM9, which binds to DNA in hotspots and subsequently uses its SET domain to locally trimethylate histone H3 at lysine 4 (H3K4me3). This s...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.170167.113

    authors: Baker CL,Walker M,Kajita S,Petkov PM,Paigen K

    更新日期:2014-05-01 00:00:00

  • Characterization of Alu repeats that are associated with trinucleotide and tetranucleotide repeat microsatellites.

    abstract::The association of subclasses of Alu repetitive elements with various classes of trinucleotide and tetranucleotide microsatellites was characterized as a first step toward advancing our understanding of the evolution of microsatellite repeats. In addition, information regarding the association of specific classes of m...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.7.716

    authors: Yandava CN,Gastier JM,Pulido JC,Brody T,Sheffield V,Murray J,Buetow K,Duyk GM

    更新日期:1997-07-01 00:00:00

  • A role for palindromic structures in the cis-region of maize Sirevirus LTRs in transposable element evolution and host epigenetic response.

    abstract::Transposable elements (TEs) proliferate within the genome of their host, which responds by silencing them epigenetically. Much is known about the mechanisms of silencing in plants, particularly the role of siRNAs in guiding DNA methylation. In contrast, little is known about siRNA targeting patterns along the length o...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.193763.115

    authors: Bousios A,Diez CM,Takuno S,Bystry V,Darzentas N,Gaut BS

    更新日期:2016-02-01 00:00:00

  • Systematic functional characterization of cis-regulatory motifs in human core promoters.

    abstract::A large number of cis-regulatory motifs involved in transcriptional control have been identified, but the regulatory context and biological processes in which many of them function are unknown. Here, we computationally identify the sets of human core promoters targeted by motifs, and systematically characterize their ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6828808

    authors: Sinha S,Adler AS,Field Y,Chang HY,Segal E

    更新日期:2008-03-01 00:00:00

  • Mouse population-guided resequencing reveals that variants in CD44 contribute to acetaminophen-induced liver injury in humans.

    abstract::Interindividual variability in response to chemicals and drugs is a common regulatory concern. It is assumed that xenobiotic-induced adverse reactions have a strong genetic basis, but many mechanism-based investigations have not been successful in identifying susceptible individuals. While recent advances in pharmacog...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.090241.108

    authors: Harrill AH,Watkins PB,Su S,Ross PK,Harbourt DE,Stylianou IM,Boorman GA,Russo MW,Sackler RS,Harris SC,Smith PC,Tennant R,Bogue M,Paigen K,Harris C,Contractor T,Wiltshire T,Rusyn I,Threadgill DW

    更新日期:2009-09-01 00:00:00

  • X chromosome cDNA microarray screening identifies a functional PLP2 promoter polymorphism enriched in patients with X-linked mental retardation.

    abstract::X-linked Mental Retardation (XLMR) occurs in 1 in 600 males and is highly genetically heterogeneous. We used a novel human X chromosome cDNA microarray (XCA) to survey the expression profile of X-linked genes in lymphoblasts of XLMR males. Genes with altered expression verified by Northern blot and/or quantitative PCR...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5336307

    authors: Zhang L,Jie C,Obie C,Abidi F,Schwartz CE,Stevenson RE,Valle D,Wang T

    更新日期:2007-05-01 00:00:00

  • Nature and structure of human genes that generate retropseudogenes.

    abstract::The human genome is estimated to contain 23,000 to 33,000 retropseudogenes. To study the properties of genes giving rise to these retroelements, we compared the structure and expression of genes with or without known retropseudogenes. Four main features have emerged from the analysis of 181 genes associated to retrops...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.10.5.672

    authors: Gonçalves I,Duret L,Mouchiroud D

    更新日期:2000-05-01 00:00:00

  • A scalable high-throughput chemical synthesizer.

    abstract::A machine that employs a novel reagent delivery technique for biomolecular synthesis has been developed. This machine separates the addressing of individual synthesis sites from the actual process of reagent delivery by using masks placed over the sites. Because of this separation, this machine is both cost-effective ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.359002

    authors: Livesay EA,Liu YH,Luebke KJ,Irick J,Belosludtsev Y,Rayner S,Balog R,Johnston SA

    更新日期:2002-12-01 00:00:00

  • Genetically indistinguishable SNPs and their influence on inferring the location of disease-associated variants.

    abstract::As part of a recent high-density linkage disequilibrium (LD) study of chromosome 20, we obtained genotypes for approximately 30,000 SNPs at a density of 1 SNP/2 kb on four different population samples (47 CEPH founders; 91 UK unrelateds [unrelated white individuals of western European ancestry]; 97 African Americans; ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4217605

    authors: Lawrence R,Evans DM,Morris AP,Ke X,Hunt S,Paolucci M,Ragoussis J,Deloukas P,Bentley D,Cardon LR

    更新日期:2005-11-01 00:00:00

  • Parente2: a fast and accurate method for detecting identity by descent.

    abstract::Identity-by-descent (IBD) inference is the problem of establishing a genetic connection between two individuals through a genomic segment that is inherited by both individuals from a recent common ancestor. IBD inference is an important preceding step in a variety of population genomic studies, ranging from demographi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.173641.114

    authors: Rodriguez JM,Bercovici S,Huang L,Frostig R,Batzoglou S

    更新日期:2015-02-01 00:00:00

  • Alterations in TCF7L2 expression define its role as a key regulator of glucose metabolism.

    abstract::Genome-wide association studies (GWAS) have consistently implicated noncoding variation within the TCF7L2 locus with type 2 diabetes (T2D) risk. While this locus represents the strongest genetic determinant for T2D risk in humans, it remains unclear how these noncoding variants affect disease etiology. To test the hyp...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.123745.111

    authors: Savic D,Ye H,Aneas I,Park SY,Bell GI,Nobrega MA

    更新日期:2011-09-01 00:00:00

  • Comparative architectures of mammalian and chicken genomes reveal highly variable rates of genomic rearrangements across different lineages.

    abstract::Molecular evolution studies are usually based on the analysis of individual genes and thus reflect only small-range variations in genomic sequences. A complementary approach is to study the evolutionary history of rearrangements in entire genomes based on the analysis of gene orders. The progress in whole genome seque...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3002305

    authors: Bourque G,Zdobnov EM,Bork P,Pevzner PA,Tesler G

    更新日期:2005-01-01 00:00:00

  • Computational modeling of the Plasmodium falciparum interactome reveals protein function on a genome-wide scale.

    abstract::Many thousands of proteins encoded by the genome of Plasmodium falciparum, the causal organism of the deadliest form of human malaria, are of unknown function. It is of utmost importance that these proteins be characterized if we are to develop combative strategies against malaria based on the biology of the parasite....

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4573206

    authors: Date SV,Stoeckert CJ Jr

    更新日期:2006-04-01 00:00:00

  • Genome-wide identification of conserved regulatory function in diverged sequences.

    abstract::Plasticity of gene regulatory encryption can permit DNA sequence divergence without loss of function. Functional information is preserved through conservation of the composition of transcription factor binding sites (TFBS) in a regulatory element. We have developed a method that can accurately identify pairs of functi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.119016.110

    authors: Taher L,McGaughey DM,Maragh S,Aneas I,Bessling SL,Miller W,Nobrega MA,McCallion AS,Ovcharenko I

    更新日期:2011-07-01 00:00:00

  • Comprehensive genome sequence analysis of a breast cancer amplicon.

    abstract::Gene amplification occurs in most solid tumors and is associated with poor prognosis. Amplification of 20q13.2 is common to several tumor types including breast cancer. The 1 Mb of sequence spanning the 20q13.2 breast cancer amplicon is one of the most exhaustively studied segments of the human genome. These studies h...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.gr1743r

    authors: Collins C,Volik S,Kowbel D,Ginzinger D,Ylstra B,Cloutier T,Hawkins T,Predki P,Martin C,Wernick M,Kuo WL,Alberts A,Gray JW

    更新日期:2001-06-01 00:00:00

  • GeneID in Drosophila.

    abstract::GeneID is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. In the first step, splice sites, and start and stop codons are predicted and scored along the sequence using position weight matrices (PWMs). In the second step, exons are built from the sites. Exons are scored ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.10.4.511

    authors: Parra G,Blanco E,Guigó R

    更新日期:2000-04-01 00:00:00

  • The extensive and condition-dependent nature of epistasis among whole-genome duplicates in yeast.

    abstract::Since complete redundancy between extant duplicates (paralogs) is evolutionarily unfavorable, some degree of functional congruency is eventually lost. However, in budding yeast, experimental evidence collected for duplicated metabolic enzymes and in global physical interaction surveys had suggested widespread function...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.076174.108

    authors: Musso G,Costanzo M,Huangfu M,Smith AM,Paw J,San Luis BJ,Boone C,Giaever G,Nislow C,Emili A,Zhang Z

    更新日期:2008-07-01 00:00:00

  • Strategies for mutational analysis of the large multiexon ATM gene using high-density oligonucleotide arrays.

    abstract::Mutational analysis of large genes with complex genomic structures plays an important role in medical genetics. Technical limitations associated with current mutation screening protocols have placed increased emphasis on the development of new technologies to simplify these procedures. High-density arrays of >90,000-o...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.12.1245

    authors: Hacia JG,Sun B,Hunt N,Edgemon K,Mosbrook D,Robbins C,Fodor SP,Tagle DA,Collins FS

    更新日期:1998-12-01 00:00:00

  • The complete genome and proteome of Mycoplasma mobile.

    abstract::Although often considered "minimal" organisms, mycoplasmas show a wide range of diversity with respect to host environment, phenotypic traits, and pathogenicity. Here we report the complete genomic sequence and proteogenomic map for the piscine mycoplasma Mycoplasma mobile, noted for its robust gliding motility. For t...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2674004

    authors: Jaffe JD,Stange-Thomann N,Smith C,DeCaprio D,Fisher S,Butler J,Calvo S,Elkins T,FitzGerald MG,Hafez N,Kodira CD,Major J,Wang S,Wilkinson J,Nicol R,Nusbaum C,Birren B,Berg HC,Church GM

    更新日期:2004-08-01 00:00:00

  • Mutation scanning by meltMADGE: validations using BRCA1 and LDLR, and demonstration of the potential to identify severe, moderate, silent, rare, and paucimorphic mutations in the general population.

    abstract::We have developed a mutation-scanning approach suitable for whole population screening for unknown mutations. The method, meltMADGE, combines thermal ramp electrophoresis with MADGE to achieve suitable cost efficiency and throughput. The sensitivity was tested in blind trials using 54 amplicons representing the BRCA1 ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3313405

    authors: Alharbi KK,Aldahmesh MA,Spanakis E,Haddad L,Whittall RA,Chen XH,Rassoulian H,Smith MJ,Sillibourne J,Ball NJ,Graham NJ,Briggs PJ,Simpson IA,Phillips DI,Lawlor DA,Ye S,Humphries SE,Cooper C,Smith GD,Ebrahim S,Eccles

    更新日期:2005-07-01 00:00:00

  • Copy-number-aware differential analysis of quantitative DNA sequencing data.

    abstract::Developments in microarray and high-throughput sequencing (HTS) technologies have resulted in a rapid expansion of research into epigenomic changes that occur in normal development and in the progression of disease, such as cancer. Not surprisingly, copy number variation (CNV) has a direct effect on HTS read densities...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.139055.112

    authors: Robinson MD,Strbenac D,Stirzaker C,Statham AL,Song J,Speed TP,Clark SJ

    更新日期:2012-12-01 00:00:00

  • Evolution and comparative genomics of odorant- and pheromone-associated genes in rodents.

    abstract::Chemical cues influence a range of behavioral responses in rodents. The involvement of protein odorants and odorant receptors in mediating reproductive behavior, foraging, and predator avoidance suggests that their genes may have been subject to adaptive evolution. We have estimated the consequences of selection on ro...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1940604

    authors: Emes RD,Beatson SA,Ponting CP,Goodstadt L

    更新日期:2004-04-01 00:00:00

  • CG dinucleotides enhance promoter activity independent of DNA methylation.

    abstract::Most mammalian RNA polymerase II initiation events occur at CpG islands, which are rich in CpGs and devoid of DNA methylation. Despite their relevance for gene regulation, it is unknown to what extent the CpG dinucleotide itself actually contributes to promoter activity. To address this question, we determined the tra...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.241653.118

    authors: Hartl D,Krebs AR,Grand RS,Baubec T,Isbel L,Wirbelauer C,Burger L,Schübeler D

    更新日期:2019-04-01 00:00:00

  • Genomic analysis identifies association of Fusobacterium with colorectal carcinoma.

    abstract::The tumor microenvironment of colorectal carcinoma is a complex community of genomically altered cancer cells, nonneoplastic cells, and a diverse collection of microorganisms. Each of these components may contribute to carcinogenesis; however, the role of the microbiota is the least well understood. We have characteri...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.126573.111

    authors: Kostic AD,Gevers D,Pedamallu CS,Michaud M,Duke F,Earl AM,Ojesina AI,Jung J,Bass AJ,Tabernero J,Baselga J,Liu C,Shivdasani RA,Ogino S,Birren BW,Huttenhower C,Garrett WS,Meyerson M

    更新日期:2012-02-01 00:00:00

  • A comprehensive survey of 3' animal miRNA modification events and a possible role for 3' adenylation in modulating miRNA targeting effectiveness.

    abstract::Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after k...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106054.110

    authors: Burroughs AM,Ando Y,de Hoon MJ,Tomaru Y,Nishibu T,Ukekawa R,Funakoshi T,Kurokawa T,Suzuki H,Hayashizaki Y,Daub CO

    更新日期:2010-10-01 00:00:00

  • The human homolog T of the mouse T(Brachyury) gene; gene structure, cDNA sequence, and assignment to chromosome 6q27.

    abstract::We have cloned the human gene encoding the transcription factor T. T protein is vital for the formation of posterior mesoderm and axial development in all vertebrates. Brachyury mutant mice, which lack T protein, die in utero with abnormal notochord, posterior somites, and allantois. We have identified human T genomic...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.3.226

    authors: Edwards YH,Putt W,Lekoape KM,Stott D,Fox M,Hopkinson DA,Sowden J

    更新日期:1996-03-01 00:00:00

  • A linkage map of the rat genome derived from three F2 crosses.

    abstract::We report the construction of a dense linkage map of the rat genome integrating 767 simple sequence length polymorphism markers, combined over three crosses with high rates of polymorphism. F2 populations from WKY x S (n = 159), BN x S (n = 91), and BN x GK (n = 139) were selected and genotyped for combinations of mic...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.5.434

    authors: Bihoreau MT,Gauguier D,Kato N,Hyne G,Lindpaintner K,Rapp JP,James MR,Lathrop GM

    更新日期:1997-05-01 00:00:00