Stratified false discovery control for large-scale hypothesis testing with application to genome-wide association studies.

Abstract:

:The multiplicity problem has become increasingly important in genetic studies as the capacity for high-throughput genotyping has increased. The control of False Discovery Rate (FDR) (Benjamini and Hochberg. [1995] J. R. Stat. Soc. Ser. B 57:289-300) has been adopted to address the problems of false positive control and low power inherent in high-volume genome-wide linkage and association studies. In many genetic studies, there is often a natural stratification of the m hypotheses to be tested. Given the FDR framework and the presence of such stratification, we investigate the performance of a stratified false discovery control approach (i.e. control or estimate FDR separately for each stratum) and compare it to the aggregated method (i.e. consider all hypotheses in a single stratum). Under the fixed rejection region framework (i.e. reject all hypotheses with unadjusted p-values less than a pre-specified level and then estimate FDR), we demonstrate that the aggregated FDR is a weighted average of the stratum-specific FDRs. Under the fixed FDR framework (i.e. reject as many hypotheses as possible and meanwhile control FDR at a pre-specified level), we specify a condition necessary for the expected total number of true positives under the stratified FDR method to be equal to or greater than that obtained from the aggregated FDR method. Application to a recent Genome-Wide Association (GWA) study by Maraganore et al. ([2005] Am. J. Hum. Genet. 77:685-693) illustrates the potential advantages of control or estimation of FDR by stratum. Our analyses also show that controlling FDR at a low rate, e.g. 5% or 10%, may not be feasible for some GWA studies.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Sun L,Craiu RV,Paterson AD,Bull SB

doi

10.1002/gepi.20164

subject

Has Abstract

pub_date

2006-09-01 00:00:00

pages

519-30

issue

6

eissn

0741-0395

issn

1098-2272

journal_volume

30

pub_type

杂志文章
  • Parental transmission and D18S37 allele sharing in bipolar affective disorder.

    abstract::We combined the five chromosome 18 bipolar affective disorder data sets provided by GAW10, totaling 185 families with 3,394 individuals, and performed analysis of differential parental transmission and chromosome 18 marker allele sharing in families with transmission through fathers vs those through mothers. Results i...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<665::AID-GEPI19>

    authors: Lin JP,Bale SJ

    更新日期:1997-01-01 00:00:00

  • eQuIPS: eQTL Analysis Using Informed Partitioning of SNPs - A Fully Bayesian Approach.

    abstract::We develop a Bayesian multi-SNP Markov chain Monte Carlo approach that allows published functional significance scores to objectively inform single nucleotide polymorphism (SNP) prior effect sizes in expression quantitative trait locus (eQTL) studies. We developed the Normal Gamma prior to allow the inclusion of funct...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21961

    authors: Boggis EM,Milo M,Walters K

    更新日期:2016-05-01 00:00:00

  • National database of familial cancer in Sweden.

    abstract::A family cancer database was constructed from the nationwide Swedish registries and includes approximately 6 million persons and >30,000 cancers in offspring diagnosed at ages 15-51 years and their parents. A particular advantage of the database is that the contribution of both parental lineages on cancer risk can be ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1998)15:3<225::AID-GEPI2>3

    authors: Hemminki K,Vaittinen P

    更新日期:1998-01-01 00:00:00

  • Association and linkage analysis of ICD-10 diagnosis for alcoholism.

    abstract::We analyzed the GAW11 data on alcoholism provided by the Collaborative Study on the Genetics of Alcoholism (COGA) using an extension of a new test of linkage and association for quantitative traits developed by George et al. [1999]. This method determines linkage between marker loci and quantitative traits, when allel...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170758

    authors: Tiwari HK,Zhu X,Elston RC,Shu Y,George V

    更新日期:1999-01-01 00:00:00

  • A multipoint method for meta-analysis of genetic association studies.

    abstract::Meta-analyses of genetic association studies are usually performed using a single polymorphism at a time, even though in many cases the individual studies report results from partially overlapping sets of polymorphisms. We present here a multipoint (or multilocus) method for multivariate meta-analysis of published pop...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20531

    authors: Bagos PG,Liakopoulos TD

    更新日期:2010-11-01 00:00:00

  • POLARIS: Polygenic LD-adjusted risk score approach for set-based analysis of GWAS data.

    abstract::Polygenic risk scores (PRSs) are a method to summarize the additive trait variance captured by a set of SNPs, and can increase the power of set-based analyses by leveraging public genome-wide association study (GWAS) datasets. PRS aims to assess the genetic liability to some phenotype on the basis of polygenic risk fo...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22117

    authors: Baker E,Schmidt KM,Sims R,O'Donovan MC,Williams J,Holmans P,Escott-Price V,Consortium WTG

    更新日期:2018-06-01 00:00:00

  • Use of variable marker density, principal components, and neural networks in the dissection of disease etiology.

    abstract::Several approaches were taken to identify the loci contributing to the quantitative and qualitative phenotypes in the Genetic Analysis Workshop 12 simulated data set. To identify possible quantitative trait loci (QTL), the quantitative traits were analyzed using SOLAR. The four replicates identified as the "best repli...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s732

    authors: Pankratz N,Kirkwood SC,Flury L,Koller DL,Foroud T

    更新日期:2001-01-01 00:00:00

  • The insulin gene and susceptibility to IDDM.

    abstract::The association between insulin-dependent diabetes mellitus (IDDM) and an allele of a restriction fragment length polymorphism (RFLP) 5' to the coding region of the insulin gene has raised the possibility that variation in the vicinity of the insulin gene confers susceptibility to IDDM. To test this hypothesis, the di...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060113

    authors: Cox NJ,Spielman RS

    更新日期:1989-01-01 00:00:00

  • Improving power in genome-wide association studies: weights tip the scale.

    abstract::The potential of genome-wide association analysis can only be realized when they have power to detect signals despite the detrimental effect of multiple testing on power. We develop a weighted multiple testing procedure that facilitates the input of prior information in the form of groupings of tests. For each group a...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20237

    authors: Roeder K,Devlin B,Wasserman L

    更新日期:2007-11-01 00:00:00

  • Analysis of twin data ascertained through probands: the double-entry approach.

    abstract::Twin pairs are sometimes included in studies because at least one of them is a proband, and conventionally the analysis of the data is based on the conditional distribution of the co twin given the proband. In the case of more than one proband in each pair, an often used "ad hoc" method of analysis is to allow each tw...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10253

    authors: Hindsberger C,Bryld LE

    更新日期:2003-11-01 00:00:00

  • Optimizing the power of genome-wide association studies by using publicly available reference samples to expand the control group.

    abstract::Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consisting of many thousan...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20482

    authors: Zhuang JJ,Zondervan K,Nyberg F,Harbron C,Jawaid A,Cardon LR,Barratt BJ,Morris AP

    更新日期:2010-05-01 00:00:00

  • Genome-wide association studies for discrete traits.

    abstract::Genome-wide association studies of discrete traits generally use simple methods of analysis based on chi(2) tests for contingency tables or logistic regression, at least for an initial scan of the entire genome. Nevertheless, more power might be obtained by using various methods that analyze multiple markers in combin...

    journal_title:Genetic epidemiology

    pub_type:

    doi:10.1002/gepi.20465

    authors: Thomas DC

    更新日期:2009-01-01 00:00:00

  • Two adaptive weighting methods to test for rare variant associations in family-based designs.

    abstract::Although next-generation DNA sequencing technologies have made rare variant association studies feasible and affordable, the development of powerful statistical methods for rare variant association studies is still under way. Most of the existing methods for rare variant association studies compare the number of rare ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21646

    authors: Fang S,Sha Q,Zhang S

    更新日期:2012-07-01 00:00:00

  • Model-based linkage analysis with imprinting for quantitative traits: ignoring imprinting effects can severely jeopardize detection of linkage.

    abstract::Genes with imprinting (parent-of-origin) effects express differently when inheriting from the mother or from the father. Some genes for development and behavior in mammals are known to be imprinted. We developed parametric linkage analysis that accounts for imprinting effects for continuous traits, implementing it in ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20321

    authors: Sung YJ,Rao DC

    更新日期:2008-07-01 00:00:00

  • Genotyping errors, pedigree errors, and missing data.

    abstract::Our group studied the effects of genotyping errors, pedigree errors, and missing data on a wide range of techniques, with a focus on the role of single-nucleotide polymorphisms (SNPs). Half of our group used simulated data, and half of our group used data from the Collaborative Study on the Genetics of Alcoholism (COG...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20120

    authors: Hinrichs AL,Suarez BK

    更新日期:2005-01-01 00:00:00

  • Replication of genetic associations as pseudoreplication due to shared genealogy.

    abstract::The genotypes of individuals in replicate genetic association studies have some level of correlation due to shared descent in the complete pedigree of all living humans. As a result of this genealogical sharing, replicate studies that search for genotype-phenotype associations using linkage disequilibrium between mark...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20400

    authors: Rosenberg NA,Vanliere JM

    更新日期:2009-09-01 00:00:00

  • Genome-wide approaches for identifying interacting susceptibility regions for asthma.

    abstract::A genome-wide correlation analysis and cluster analysis were utilized to determine chromosomal regions that had similar nonparametric linkage scores across families in order to locate interacting susceptibility loci for asthma. Conditional analysis was performed to detect any increase in lod score over baseline. Eight...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s266

    authors: Colilla S,Tsalenko A,Pluznikov A,Cox NJ

    更新日期:2001-01-01 00:00:00

  • Multiethnic polygenic risk scores improve risk prediction in diverse populations.

    abstract::Methods for genetic risk prediction have been widely investigated in recent years. However, most available training data involves European samples, and it is currently unclear how to accurately predict disease risk in other populations. Previous studies have used either training data from European samples in large sam...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22083

    authors: Márquez-Luna C,Loh PR,South Asian Type 2 Diabetes (SAT2D) Consortium.,SIGMA Type 2 Diabetes Consortium.,Price AL

    更新日期:2017-12-01 00:00:00

  • On the association analysis of genome-sequencing data: A spatial clustering approach for partitioning the entire genome into nonoverlapping windows.

    abstract::For the association analysis of whole-genome sequencing (WGS) studies, we propose an efficient and fast spatial-clustering algorithm. Compared to existing analysis approaches for WGS data, that define the tested regions either by sliding or consecutive windows of fixed sizes along variants, a meaningful grouping of ne...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22040

    authors: Loehlein Fier H,Prokopenko D,Hecker J,Cho MH,Silverman EK,Weiss ST,Tanzi RE,Lange C

    更新日期:2017-05-01 00:00:00

  • Information on ancestry from genetic markers.

    abstract::It is possible to estimate the proportionate contributions of ancestral populations to admixed individuals or populations using genetic markers, but different loci and alleles vary considerably in the amount of information that they provide. Conventionally, the allele frequency difference between parental populations ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10319

    authors: Pfaff CL,Barnholtz-Sloan J,Wagner JK,Long JC

    更新日期:2004-05-01 00:00:00

  • Quantitative allelic test--a fast test for very large association studies.

    abstract::Advances in high throughput technology have enabled the generation of unprecedented amounts of genomic data (e.g., next-generation sequence data, transcriptomics, metabolomics, and proteomics), which promises to unravel the genetic architecture of complex traits. These discoveries may lead to novel therapeutic targets...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21768

    authors: Lee SM,Karrison TG,Cox NJ,Im HK

    更新日期:2013-12-01 00:00:00

  • Effect of physical activity on lipid levels in a population-based sample of men with and without the Arg192 variant of the human paraoxonase gene.

    abstract::The prevalence of cardiovascular risk factors in Gerona, Spain, is high for the low myocardial infarction incidence and mortality rates in the province. Physical activity is a protective factor against coronary heart disease. We investigated whether the genetic variants Q and R of the paraoxonase Gln-Arg 192 polymorph...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(200003)18:3<276::AID-GEPI6

    authors: Sentí M,Aubó C,Elosua R,Sala J,Tomás M,Marrugat J

    更新日期:2000-03-01 00:00:00

  • Using single nucleotide polymorphisms to investigate association between a candidate gene and disease.

    abstract::A range of study designs, using unrelated or family controls, were used to investigate the pattern of association with disease of single nucleotide polymorphisms (SNPs) within candidate gene 1 (simulated data). Strong evidence of disease association at the functional locus was detected using all study designs, and in ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s415

    authors: Saunders CL,Crockford GP,Bishop DT,Barrett JH

    更新日期:2001-01-01 00:00:00

  • Kernel Approach for Modeling Interaction Effects in Genetic Association Studies of Complex Quantitative Traits.

    abstract::The etiology of complex traits likely involves the effects of genetic and environmental factors, along with complicated interaction effects between them. Consequently, there has been interest in applying genetic association tests of complex traits that account for potential modification of the genetic effect in the pr...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21901

    authors: Broadaway KA,Duncan R,Conneely KN,Almli LM,Bradley B,Ressler KJ,Epstein MP

    更新日期:2015-07-01 00:00:00

  • Evaluation of genetic and environmental effects using GEE and APM methods.

    abstract::Two analytic methods were used in the Problem 2 data set. First, generalized estimating equations (GEE) modelling was developed to adjust for familial correlation in regressions evaluating candidate genes and an environmental factor. Second, the affected-pedigree-member (APM) method was used to identify chromosomal re...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370120633

    authors: Bull SB,Chapman NH,Greenwood CM,Darlington GA

    更新日期:1995-01-01 00:00:00

  • Robust inference for variance components models in families ascertained through probands: I. Conditioning on proband's phenotype.

    abstract::A robust approach for estimating standard errors of variance components by using quantitative phenotypes from families ascertained through a proband with an extreme phenotypic value is presented. Estimators that use the multivariate normal distribution as a "working likelihood" are obtained by computing conditional ln...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370040305

    authors: Beaty TH,Liang KY

    更新日期:1987-01-01 00:00:00

  • BRCA1 polymorphisms and breast cancer epidemiology in the Western New York exposures and breast cancer (WEB) study.

    abstract::Results of studies for the association of BRCA1 genotypes and haplotypes with sporadic breast cancer have been inconsistent. Therefore, a candidate single nucleotide polymorphism (SNP) approach was used in a breast cancer case-control study to explore genotypes and haplotypes that have the potential to affect protein ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21730

    authors: Ricks-Santi LJ,Nie J,Marian C,Ochs-Balcom HM,Trevisan M,Edge SB,Kanaan Y,Freudenheim JL,Shields PG

    更新日期:2013-07-01 00:00:00

  • To type or not to type: the use of unaffected siblings in nonparametric linkage analysis.

    abstract::Unaffected individuals are often disregarded in nonparametric linkage analysis. Because of the presumed high complexity of genetic interactions and the resulting low penetrance of any single genetic effect, the statistical contribution of unaffected sib pairs is thought to be considerably lower than that of the affect...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s522

    authors: Majewski J

    更新日期:2001-01-01 00:00:00

  • Identifying SNPs predictive of phenotype using random forests.

    abstract::There has been a great interest and a few successes in the identification of complex disease susceptibility genes in recent years. Association studies, where a large number of single-nucleotide polymorphisms (SNPs) are typed in a sample of cases and controls to determine which genes are associated with a specific dise...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20041

    authors: Bureau A,Dupuis J,Falls K,Lunetta KL,Hayward B,Keith TP,Van Eerdewegh P

    更新日期:2005-02-01 00:00:00

  • Phenotypic effects of apolipoprotein structural variation on lipid profiles: II. Apolipoprotein A-IV and quantitative lipid measures in the healthy women study.

    abstract::Apolipoprotein A-IV (APO A-IV) is a major protein component of mesenteric lymph chylomicrons and very-low-density lipoproteins. It is found in plasma predominantly unassociated with major lipoprotein fractions and in high density lipoproteins. APO A-IV exhibits structural heterogeneity owing to two codominant alleles,...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060404

    authors: Eichner JE,Kuller LH,Ferrell RE,Kamboh MI

    更新日期:1989-01-01 00:00:00