Improving power for rare-variant tests by integrating external controls.

Abstract:

:Due to the drop in sequencing cost, the number of sequenced genomes is increasing rapidly. To improve power of rare-variant tests, these sequenced samples could be used as external control samples in addition to control samples from the study itself. However, when using external controls, possible batch effects due to the use of different sequencing platforms or genotype calling pipelines can dramatically increase type I error rates. To address this, we propose novel summary statistics based single and gene- or region-based rare-variant tests that allow the integration of external controls while controlling for type I error. Our approach is based on the insight that batch effects on a given variant can be assessed by comparing odds ratio estimates using internal controls only vs. using combined control samples of internal and external controls. From simulation experiments and the analysis of data from age-related macular degeneration and type 2 diabetes studies, we demonstrate that our method can substantially improve power while controlling for type I error rate.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Lee S,Kim S,Fuchsberger C

doi

10.1002/gepi.22057

subject

Has Abstract

pub_date

2017-11-01 00:00:00

pages

610-619

issue

7

eissn

0741-0395

issn

1098-2272

journal_volume

41

pub_type

杂志文章
  • SNP selection in genome-wide and candidate gene studies via penalized logistic regression.

    abstract::Penalized regression methods offer an attractive alternative to single marker testing in genetic association analysis. Penalized regression methods shrink down to zero the coefficient of markers that have little apparent effect on the trait of interest, resulting in a parsimonious subset of what we hope are true perti...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20543

    authors: Ayers KL,Cordell HJ

    更新日期:2010-12-01 00:00:00

  • Availability of schizophrenic patients and their families for genetic linkage studies: findings from the Maryland epidemiology sample.

    abstract::It has been suggested that collections of affected sib pairs, or their nuclear families, may be an efficient method for screening for genetic linkages in schizophrenia. We present the data collected in five years from 15 hospitals in the state of Maryland in an effort to determine if such a collection scheme will be f...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060604

    authors: Pulver AE,Bale SJ

    更新日期:1989-01-01 00:00:00

  • Genetic analysis of IDDM: summary of GAW5 IDDM results.

    abstract::This paper summarizes the analyses by participants in the insulin-dependent diabetes mellitus (IDDM) component of Genetic Analysis Workshop 5 (GAW5). The data were obtained from 94 families with two or more IDDM sibs. Topics treated in the Workshop analysis included the following: methods for detecting associations an...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/gepi.1370060111

    authors: Spielman RS,Baur MP,Clerget-Darpoux F

    更新日期:1989-01-01 00:00:00

  • Effect of linkage disequilibrium between markers in linkage and association analyses.

    abstract::Contributions to Group 17 of the Genetic Analysis Workshop 15 considered dense markers in linkage disequilibrium (LD) in the context of either linkage or association analysis. Three contributions reported on methods for modeling LD or selecting a subset of markers in linkage equilibrium to perform linkage analysis. Wh...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20291

    authors: Dupuis J,Albers K,Allen-Brady K,Cho K,Elston RC,Kappen HJ,Tang H,Thomas A,Thomson G,Tsung E,Yang Q,Zhang W,Zhao K,Zheng G,Ziegler JT

    更新日期:2007-01-01 00:00:00

  • Biochemical intermediates in alpha 1-antitrypsin deficiency: residual family resemblance for total alpha 1-antitrypsin, oxidized alpha 1-antitrypsin, and immunoglobulin E after adjustment for the effect of the Pi locus.

    abstract::alpha 1-antitrypsin (alpha 1 AT) deficiency is variably associated with the development of pulmonary emphysema. To gain insight into the process which begins the Z point mutation at the Protease Inhibitor (Pi) locus and results in the variable development of emphysema, three quantitative phenotypes, including total al...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370070204

    authors: Silverman EK,Province MA,Campbell EJ,Pierce JA,Rao DC

    更新日期:1990-01-01 00:00:00

  • Contribution of thermolabile methylenetetrahydrofolate reductase variant to total plasma homocysteine levels in healthy men and women. Inter99 (2).

    abstract::Elevation in plasma total homocysteine (tHcy) is believed to be causally related to cardiovascular disease. Like age and sex, the thermolabile variant of methylenetetrahydrofolate reductase (MTHFR(C677T)) is an important nonmodifiable determinant of tHcy, which may be considered when describing normal ranges of tHcy i...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10239

    authors: Husemoen LL,Thomsen TF,Fenger M,Jørgensen HL,Jørgensen T

    更新日期:2003-05-01 00:00:00

  • Lifestyle and blood pressure levels in male twins in Utah.

    abstract::Healthy male monozygotic (MZ) and dizygotic (DZ) twin pairs (MZ pairs = 77; DZ pairs = 88) were studied to assess the effect of dietary intake, physical activity, physical fitness, body mass index (BMI), sum of the triceps and subscapular skinfold measurements, alcohol and caffeine consumption, and smoking patterns on...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370050409

    authors: Slattery ML,Bishop DT,French TK,Hunt SC,Meikle AW,Williams RR

    更新日期:1988-01-01 00:00:00

  • Generalization of the extended transmission disequilibrium test to two unlinked disease loci.

    abstract::The extended transmission disequilibrium test (ETDT) of Sham and Curtis [1995] is a powerful test of the null hypothesis of no linkage between a multi-allelic marker locus and a disease susceptibility locus of unknown location in the presence of association between alleles at the two loci. We propose a generalization ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.13701707108

    authors: Morris A,Whittaker J

    更新日期:1999-01-01 00:00:00

  • PANDA: Prioritization of autism-genes using network-based deep-learning approach.

    abstract::Understanding the genetic background of complex diseases and disorders plays an essential role in the promising precision medicine. The evaluation of candidate genes, however, requires time-consuming and expensive experiments given a large number of possibilities. Thus, computational methods have seen increasing appli...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22282

    authors: Zhang Y,Chen Y,Hu T

    更新日期:2020-06-01 00:00:00

  • Meta-Analysis of Rare Variant Association Tests in Multiethnic Populations.

    abstract::Several methods have been proposed to increase power in rare variant association testing by aggregating information from individual rare variants (MAF < 0.005). However, how to best combine rare variants across multiple ethnicities and the relative performance of designs using different ethnic sampling fractions remai...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21939

    authors: Mensah-Ablorh A,Lindstrom S,Haiman CA,Henderson BE,Marchand LL,Lee S,Stram DO,Eliassen AH,Price A,Kraft P

    更新日期:2016-01-01 00:00:00

  • Genetic heterogeneity in Alzheimer's disease: a grade of membership analysis.

    abstract::Grade of membership analysis (GoM) may have particular relevance for genetic epidemiology. The method can flexibly relate genetic markers, clinical features, and environmental exposures to possible subtypes of disease termed pure types even when population allele frequencies and penetrance functions are not known. Hen...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100628

    authors: Corder EH,Woodbury MA

    更新日期:1993-01-01 00:00:00

  • Modeling the HLA component in rheumatoid arthritis: sensitivity to DRB1 allele frequencies.

    abstract::Rheumatoid arthritis is an inflammatory disease for which positive associations have been described with some HLA-DRB1 alleles. The associated alleles share a similar amino acid sequence in the third hypervariable region, the shared epitope, but differ at position 71 and 86. It has been suggested that HLA susceptibili...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/1098-2272(200012)19:4<422::AID-GEPI12>3.0.

    authors: Tézenas du Montcel S,Reviron D,Genin E,Roudier J,Mercier P,Clerget-Darpoux F

    更新日期:2000-12-01 00:00:00

  • Genome-wide family-based linkage analysis of exome chip variants and cardiometabolic risk.

    abstract::Linkage analysis of complex traits has had limited success in identifying trait-influencing loci. Recently, coding variants have been implicated as the basis for some biomedical associations. We tested whether coding variants are the basis for linkage peaks of complex traits in 42 African-American (n = 596) and 90 His...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21801

    authors: Hellwege JN,Palmer ND,Raffield LM,Ng MC,Hawkins GA,Long J,Lorenzo C,Norris JM,Ida Chen YD,Speliotes EK,Rotter JI,Langefeld CD,Wagenknecht LE,Bowden DW

    更新日期:2014-05-01 00:00:00

  • Identifying SNPs predictive of phenotype using random forests.

    abstract::There has been a great interest and a few successes in the identification of complex disease susceptibility genes in recent years. Association studies, where a large number of single-nucleotide polymorphisms (SNPs) are typed in a sample of cases and controls to determine which genes are associated with a specific dise...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20041

    authors: Bureau A,Dupuis J,Falls K,Lunetta KL,Hayward B,Keith TP,Van Eerdewegh P

    更新日期:2005-02-01 00:00:00

  • Optimizing the power of genome-wide association studies by using publicly available reference samples to expand the control group.

    abstract::Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consisting of many thousan...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20482

    authors: Zhuang JJ,Zondervan K,Nyberg F,Harbron C,Jawaid A,Cardon LR,Barratt BJ,Morris AP

    更新日期:2010-05-01 00:00:00

  • Lessons learned from Genetic Analysis Workshop 17: transitioning from genome-wide association studies to whole-genome statistical genetic analysis.

    abstract::Genetic Analysis Workshop 17 (GAW17) focused on the transition from genome-wide association study designs and methods to the study designs and statistical genetic methods that will be required for the analysis of next-generation sequence data including both common and rare sequence variants. In the 166 contributions t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20659

    authors: Wilson AF,Ziegler A

    更新日期:2011-01-01 00:00:00

  • Estimating gene penetrance from family data.

    abstract::Family data are useful for estimating disease risk in carriers of specific genotypes of a given gene (penetrance). Penetrance is frequently estimated assuming that relatives' phenotypes are independent, given their genotypes for the gene of interest. This assumption is unrealistic when multiple shared risk factors con...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20493

    authors: Gong G,Hannon N,Whittemore AS

    更新日期:2010-05-01 00:00:00

  • Model selection and Bayesian methods in statistical genetics: summary of group 11 contributions to Genetic Analysis Workshop 15.

    abstract::The research presented in group 11 of the Genetic Analysis Workshop 15 (GAW15) falls into two major themes: Model selection approaches for gene mapping (both Bayesian and Frequentist); and other Bayesian methods. These methods either allow relaxation of some of the common assumptions, such as mode of inheritance, for ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/gepi.20285

    authors: Swartz MD,Thomas DC,Daw EW,Albers K,Charlesworth JC,Dyer TC,Fridley BL,Govil M,Kraft P,Kwon S,Logue MW,Oh C,Pique-Regi R,Saba L,Schumacher FR,Uh HW

    更新日期:2007-01-01 00:00:00

  • Apolipoprotein E-epsilon 4 allele and familial risk in Alzheimer's disease.

    abstract::Recent studies have found an association between presence of apolipoprotein E (APOE) epsilon 4 allele and Alzheimer's disease (AD). The present study compared the cumulative risk of primary progressive dementia (PPD) in relatives of AD probands carrying at least one copy of the epsilon 4 allele with the relatives of A...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1996)13:3<285::AID-GEPI5>3

    authors: Li G,Silverman JM,Altstiel LD,Haroutunian V,Perl DP,Purohit D,Birstein S,Lantz M,Mohs RC,Davis KL

    更新日期:1996-01-01 00:00:00

  • Major locus inheritance of apolipoprotein B in Utah pedigrees.

    abstract::A major locus that determines levels of apolipoprotein B (apoB) was revealed by likelihood analysis on 331 members of 36 pedigrees. The major locus explained 43.2% of the observed variance, with the remainder attributed to random environmental factors. Estimated mean apoB levels (mg/dl) were 110.5 +/- 2.5, 141.9 +/- 4...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370040202

    authors: Hasstedt SJ,Wu L,Williams RR

    更新日期:1987-01-01 00:00:00

  • Stratified false discovery control for large-scale hypothesis testing with application to genome-wide association studies.

    abstract::The multiplicity problem has become increasingly important in genetic studies as the capacity for high-throughput genotyping has increased. The control of False Discovery Rate (FDR) (Benjamini and Hochberg. [1995] J. R. Stat. Soc. Ser. B 57:289-300) has been adopted to address the problems of false positive control an...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20164

    authors: Sun L,Craiu RV,Paterson AD,Bull SB

    更新日期:2006-09-01 00:00:00

  • Influence of marker heterozygosity and genetic heterogeneity on fine mapping.

    abstract::The purpose of the current study was to utilize the Genetic Analysis Workshop 12 simulated data to evaluate fine-mapping strategies for quantitative traits. We approached the analysis as if it was a follow-up to a genome scan that had identified two regions of interest and used the provided 1-cM density microsatellite...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s467

    authors: Heard-Costa NL,Demissie S,DeStefano AL,Knowlton BA,Maher NE,Myers RH,Volcjak JS,Wilk JB,Cupples LA

    更新日期:2001-01-01 00:00:00

  • Increased risk for familial ovarian cancer among Jewish women: a population-based case-control study.

    abstract::Jewish women have been reported to have a higher risk for familial breast cancer than non-Jewish women and to be more likely to carry mutations in breast cancer genes such as BRCA1. Because BRCA1 mutations also increase women's risk for ovarian cancer, we asked whether Jewish women are at higher risk for familial ovar...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(SICI)1098-2272(1998)15:1<51::AID-GEPI4>3.

    authors: Steinberg KK,Pernarelli JM,Marcus M,Khoury MJ,Schildkraut JM,Marchbanks PA

    更新日期:1998-01-01 00:00:00

  • Genotyping errors, pedigree errors, and missing data.

    abstract::Our group studied the effects of genotyping errors, pedigree errors, and missing data on a wide range of techniques, with a focus on the role of single-nucleotide polymorphisms (SNPs). Half of our group used simulated data, and half of our group used data from the Collaborative Study on the Genetics of Alcoholism (COG...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20120

    authors: Hinrichs AL,Suarez BK

    更新日期:2005-01-01 00:00:00

  • Information on ancestry from genetic markers.

    abstract::It is possible to estimate the proportionate contributions of ancestral populations to admixed individuals or populations using genetic markers, but different loci and alleles vary considerably in the amount of information that they provide. Conventionally, the allele frequency difference between parental populations ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10319

    authors: Pfaff CL,Barnholtz-Sloan J,Wagner JK,Long JC

    更新日期:2004-05-01 00:00:00

  • Power and sample size calculations for SNP association studies with censored time-to-event outcomes.

    abstract::For many clinical studies in cancer, germline DNA is prospectively collected for the purpose of discovering or validating single-nucleotide polymorphisms (SNPs) associated with clinical outcomes. The primary clinical endpoint for many of these studies are time-to-event outcomes such as time of death or disease progres...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21645

    authors: Owzar K,Li Z,Cox N,Jung SH

    更新日期:2012-09-01 00:00:00

  • Major gene with sex-specific effects influences fat mass in Mexican Americans.

    abstract::Increased adiposity has repeatedly been identified as a major risk factor for a variety of chronic diseases. However, the question still remains whether the amount of adipose tissue itself is genetically mediated. To address this question, a segregation analysis, using maximum likelihood techniques as implemented in t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370120505

    authors: Comuzzie AG,Blangero J,Mahaney MC,Mitchell BD,Hixson JE,Samollow PB,Stern MP,MacCluer JW

    更新日期:1995-01-01 00:00:00

  • Tests for gene-environment interaction from case-control data: a novel study of type I error, power and designs.

    abstract::To evaluate the risk of a disease associated with the joint effects of genetic susceptibility and environmental exposures, epidemiologic researchers often test for non-multiplicative gene-environment effects from case-control studies. In this article, we present a comparative study of four alternative tests for intera...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20337

    authors: Mukherjee B,Ahn J,Gruber SB,Rennert G,Moreno V,Chatterjee N

    更新日期:2008-11-01 00:00:00

  • Comparison of variance components, ANOVA and regression of offspring on midparent (ROMP) methods for SNP markers.

    abstract::An extension of the traditional regression of offspring on midparent (ROMP) method was used to estimate the heritability of the trait, test for marker association, and estimate the heritability attributable to a marker locus. The fifty replicates of the Genetic Analysis Workshop (GAW) 12 simulated general population d...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s794

    authors: Pugh EW,Papanicolaou GJ,Justice CM,Roy-Gagnon MH,Sorant AJ,Kingman A,Wilson AF

    更新日期:2001-01-01 00:00:00

  • GAW10: simulated family data for a common oligogenic disease with quantitative risk factors.

    abstract::GAW10 Problem 2 involves a simulated common disease defined by imposing a threshold, T, on a quantitative trait, Q1. Every individual with a value of Q1 > or = T (where T = 40) is defined as affected. Also thought to be associated with the disease as intervening variables are four other quantitative traits (Q2, Q3, Q4...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<737::AID-GEPI29>

    authors: MacCluer JW,Blangero J,Dyer TD,Speer MC

    更新日期:1997-01-01 00:00:00