SimPEL: Simulation-based power estimation for sequencing studies of low-prevalence conditions.

Abstract:

:Power estimations are important for optimizing genotype-phenotype association study designs. However, existing frameworks are designed for common disorders, and thus ill-suited for the inherent challenges of studies for low-prevalence conditions such as rare diseases and infrequent adverse drug reactions. These challenges include small sample sizes and the need to leverage genetic annotation resources in association analyses for the purpose of ranking potential causal genes. We present SimPEL, a simulation-based program providing power estimations for the design of low-prevalence condition studies. SimPEL integrates the usage of gene annotation resources for association analyses. Customizable parameters, including the penetrance of the putative causal allele and the employed pathogenic scoring system, allow SimPEL to realistically model a large range of study designs. To demonstrate the effects of various parameters on power, we estimated the power of several simulated designs using SimPEL and captured power trends in agreement with observations from current literature on low-frequency condition studies. SimPEL, as a tool, provides researchers studying low-frequency conditions with an intuitive and highly flexible avenue for statistical power estimation. The platform-independent "batteries included" executable and default input files are available at https://github.com/precisionomics/SimPEL.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Mak L,Li M,Cao C,Gordon P,Tarailo-Graovac M,Bousman C,Wang P,Long Q

doi

10.1002/gepi.22129

subject

Has Abstract

pub_date

2018-07-01 00:00:00

pages

480-487

issue

5

eissn

0741-0395

issn

1098-2272

journal_volume

42

pub_type

杂志文章
  • Rank-based robust tests for quantitative-trait genetic association studies.

    abstract::Standard linear regression is commonly used for genetic association studies of quantitative traits. This approach may not be appropriate if the trait, on its original or transformed scales, does not follow a normal distribution. A rank-based nonparametric approach that does not rely on any distributional assumptions c...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21723

    authors: Li Q,Li Z,Zheng G,Gao G,Yu K

    更新日期:2013-05-01 00:00:00

  • Classifying disease chromosomes arising from multiple founders, with application to fine-scale haplotype mapping.

    abstract::The availability of high-density haplotype data has motivated several fine-scale linkage disequilibrium mapping methods for locating disease-causing mutations. These methods identify loci around which haplotypes of case chromosomes exhibit greater similarity than do those of control chromosomes. A difficulty arising i...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20016

    authors: Yu K,Martin RB,Whittemore AS

    更新日期:2004-11-01 00:00:00

  • Testing untyped alleles (TUNA)-applications to genome-wide association studies.

    abstract::The large number of tests performed in analyzing data from genome-wide association studies has a large impact on the power of detecting risk variants, and analytic strategies specifying the optimal set of hypotheses to be tested are necessary. We propose a genome-wide strategy that is based on one degree of freedom te...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20182

    authors: Nicolae DL

    更新日期:2006-12-01 00:00:00

  • Two common polymorphisms in the APO A-IV coding gene: their evolution and linkage disequilibrium.

    abstract::Human apolipoprotein A-IV (APO A-IV) exhibits a common protein polymorphism detectable by isoelectric focusing (IEF) due to a single base substitution at codon 360 which replaces the frequently occurring glutamine residue (allele 1) with histidine (allele 2). Recently, sequence analysis of the APO A-IV coding region h...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370090503

    authors: Kamboh MI,Hamman RF,Ferrell RE

    更新日期:1992-01-01 00:00:00

  • Segregation analysis of juvenile myoclonic epilepsy.

    abstract::We examined the inheritance of juvenile myoclonic epilepsy (JME). We looked at both the trait of "epilepsy" and the trait of "epilepsy-plus-EEG abnormalities," since EEG abnormalities are frequently found in the clinically unaffected sibs of JME patients. We tested several modes of inheritance including the fully pene...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370050204

    authors: Greenberg DA,Delgado-Escueta AV,Maldonado HM,Widelitz H

    更新日期:1988-01-01 00:00:00

  • Segregation analysis of autosomal dominant polycystic kidney disease.

    abstract::The results of classical segregation analysis on 159 families with polycystic kidney disease (PKD) are presented. It had been previously estimated that about 95% of autosomal dominant PKD (ADPKD) families have PKD1, the gene localized to chromosome 16p. The main purpose of the study was to determine if PKD shows any s...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100305

    authors: Dobin A,Kimberling WJ,Pettinger W,Bailey-Wilson JE,Shugart YY,Gabow P

    更新日期:1993-01-01 00:00:00

  • Genome-wide family-based linkage analysis of exome chip variants and cardiometabolic risk.

    abstract::Linkage analysis of complex traits has had limited success in identifying trait-influencing loci. Recently, coding variants have been implicated as the basis for some biomedical associations. We tested whether coding variants are the basis for linkage peaks of complex traits in 42 African-American (n = 596) and 90 His...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21801

    authors: Hellwege JN,Palmer ND,Raffield LM,Ng MC,Hawkins GA,Long J,Lorenzo C,Norris JM,Ida Chen YD,Speliotes EK,Rotter JI,Langefeld CD,Wagenknecht LE,Bowden DW

    更新日期:2014-05-01 00:00:00

  • The role of environmental heterogeneity in meta-analysis of gene-environment interactions with quantitative traits.

    abstract::With challenges in data harmonization and environmental heterogeneity across various data sources, meta-analysis of gene-environment interaction studies can often involve subtle statistical issues. In this paper, we study the effect of environmental covariate heterogeneity (within and between cohorts) on two approache...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21810

    authors: Li S,Mukherjee B,Taylor JM,Rice KM,Wen X,Rice JD,Stringham HM,Boehnke M

    更新日期:2014-07-01 00:00:00

  • Adjustment for competing risk in kin-cohort estimation.

    abstract::Kin-cohort design can be used to study the effect of a genetic mutation on the risk of multiple events, using the same study. In this design, the outcome data consist of the event history of the relatives of a sample of genotyped subjects. Existing methods for kin-cohort estimation allow estimation of the risk of one ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10269

    authors: Chatterjee N,Hartge P,Wacholder S

    更新日期:2003-12-01 00:00:00

  • Direct genetic effects and their estimation from matched case-control data.

    abstract::In genetic association studies, a single marker is often associated with multiple, correlated phenotypes (e.g., obesity and cardiovascular disease, or nicotine dependence and lung cancer). A pervasive question is then whether that marker exerts independent effects on all phenotypes. In this paper, we address this ques...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21660

    authors: Berzuini C,Vansteelandt S,Foco L,Pastorino R,Bernardinelli L

    更新日期:2012-09-01 00:00:00

  • On the association analysis of genome-sequencing data: A spatial clustering approach for partitioning the entire genome into nonoverlapping windows.

    abstract::For the association analysis of whole-genome sequencing (WGS) studies, we propose an efficient and fast spatial-clustering algorithm. Compared to existing analysis approaches for WGS data, that define the tested regions either by sliding or consecutive windows of fixed sizes along variants, a meaningful grouping of ne...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22040

    authors: Loehlein Fier H,Prokopenko D,Hecker J,Cho MH,Silverman EK,Weiss ST,Tanzi RE,Lange C

    更新日期:2017-05-01 00:00:00

  • Genetic heterogeneity in Alzheimer's disease: a grade of membership analysis.

    abstract::Grade of membership analysis (GoM) may have particular relevance for genetic epidemiology. The method can flexibly relate genetic markers, clinical features, and environmental exposures to possible subtypes of disease termed pure types even when population allele frequencies and penetrance functions are not known. Hen...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100628

    authors: Corder EH,Woodbury MA

    更新日期:1993-01-01 00:00:00

  • The recurrence risks for isolated cases with incompletely penetrant X-linked conditions.

    abstract::The recurrence risks for an X-linked disease with incomplete penetrance are evaluated for a sib given that an isolated proband (male or female) is affected. The derived formulae are applied to the X-linked form of Alport and fragile X syndromes. ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370030508

    authors: Rogatko A

    更新日期:1986-01-01 00:00:00

  • Constructing meiotic maps with known error probability.

    abstract::We propose methods to construct meiotic gene maps while controlling the probability of a decision-error. First, a single step gene ordering procedure is presented whose decision-error probability is bounded above by a prespecified threshold. The bound for the error probability is valid under quite general circumstance...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1999)16:3<274::AID-GEPI4>3

    authors: Rogatko A,Babb J,Jordan H,Zacks S

    更新日期:1999-01-01 00:00:00

  • Power and sample size calculations for SNP association studies with censored time-to-event outcomes.

    abstract::For many clinical studies in cancer, germline DNA is prospectively collected for the purpose of discovering or validating single-nucleotide polymorphisms (SNPs) associated with clinical outcomes. The primary clinical endpoint for many of these studies are time-to-event outcomes such as time of death or disease progres...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21645

    authors: Owzar K,Li Z,Cox N,Jung SH

    更新日期:2012-09-01 00:00:00

  • Equivalence of the mixed and regressive models for genetic analysis. I. Continuous traits.

    abstract::The mixed model of segregation analysis specifies major gene effects and partitions the residual variance into polygenic and environmental components. The model explains familial correlations essentially in terms of genetic causation. The regressive model, on the other hand, is constructed by successively conditioning...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060505

    authors: Demenais FM,Bonney GE

    更新日期:1989-01-01 00:00:00

  • Familial aggregation of breast cancer with early onset lung cancer.

    abstract::Site-specific familial aggregation and evidence supporting Mendelian codominant inheritance have been shown in lung cancer. In characterizing lung cancer families, a number of other cancers have been observed. The current study evaluates whether first-degree relatives of early onset lung cancer cases are at increased ...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章

    doi:10.1002/(SICI)1098-2272(199911)17:4<274::AID-GEPI3

    authors: Schwartz AG,Siegfried JM,Weiss L

    更新日期:1999-11-01 00:00:00

  • Biochemical intermediates in alpha 1-antitrypsin deficiency: residual family resemblance for total alpha 1-antitrypsin, oxidized alpha 1-antitrypsin, and immunoglobulin E after adjustment for the effect of the Pi locus.

    abstract::alpha 1-antitrypsin (alpha 1 AT) deficiency is variably associated with the development of pulmonary emphysema. To gain insight into the process which begins the Z point mutation at the Protease Inhibitor (Pi) locus and results in the variable development of emphysema, three quantitative phenotypes, including total al...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370070204

    authors: Silverman EK,Province MA,Campbell EJ,Pierce JA,Rao DC

    更新日期:1990-01-01 00:00:00

  • Linkage disequilibrium structure and its impact on the localization of a candidate functional mutation.

    abstract::We have used the unblinded MG1/Q1 Genetic Analysis Workshop 12 simulated data as a model system for investigating the use of linkage disequilibrium structure and simple genotype-phenotype associations to identify candidate functional mutations within a gene of interest. Analysis of the pattern of pairwise linkage dise...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s620

    authors: Huang Q,Morrison AC,Boerwinkle E

    更新日期:2001-01-01 00:00:00

  • Exploring data from genetic association studies using Bayesian variable selection and the Dirichlet process: application to searching for gene × gene patterns.

    abstract::We construct data exploration tools for recognizing important covariate patterns associated with a phenotype, with particular focus on searching for association with gene-gene patterns. To this end, we propose a new variable selection procedure that employs latent selection weights and compare it to an alternative for...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21661

    authors: Papathomas M,Molitor J,Hoggart C,Hastie D,Richardson S

    更新日期:2012-09-01 00:00:00

  • Analysis of two-locus traits under heterogeneity for recessive versus dominant inheritance.

    abstract::Complex traits have been modeled under various modes of two-locus inheritance. One example of a two-locus threshold model is the situation where an individual is susceptible to a disease trait if he or she carries three or more disease alleles. Under this model, if each locus is examined individually the inheritance a...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(SICI)1098-2272(1997)14:6<1097::AID-GEPI89

    authors: Leal SM,Ott J

    更新日期:1997-01-01 00:00:00

  • Risk factors for atherosclerosis in twins.

    abstract::We performed multivariate genetic analyses of cardiovascular risk factors from two sets of data on US and Australian female twins. Similar models for body mass index (BMI), serum low density (LDL) and high density (HDL) lipoproteins, including age as a covariate, were fitted successfully to both groups. These suggeste...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100638

    authors: Duffy DL,O'Connell DL,Heller RF,Martin NG

    更新日期:1993-01-01 00:00:00

  • Monte Carlo analysis on a large pedigree.

    abstract::Monte Carlo methods for linkage and segregation analysis are applied to the HGAR1 pedigree. To address these data, the methods are extended in several ways. The results are compared with those provided by PAP. ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100658

    authors: Thompson EA,Lin S,Olshen AB,Wijsman EM

    更新日期:1993-01-01 00:00:00

  • Design of artificial neural network and its applications to the analysis of alcoholism data.

    abstract::Artificial neural networks were applied to the alcoholism data to reveal nonlinear relationships between intermediate phenotypes, marker identity-by-descent sharing, and the affection status. A variable number of hidden units were considered to achieve a balance between the minimal mean-squared error and over-fitting ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170738

    authors: Li W,Haghighi F,Falk CT

    更新日期:1999-01-01 00:00:00

  • Transcriptome-wide association study of breast cancer risk by estrogen-receptor status.

    abstract::Previous transcriptome-wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome-wide association studies (GWAS), but analyses of breast cancer subtype-specific associations have been limited. In this study, we conducted a TWAS using gene ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22288

    authors: Feng H,Gusev A,Pasaniuc B,Wu L,Long J,Abu-Full Z,Aittomäki K,Andrulis IL,Anton-Culver H,Antoniou AC,Arason A,Arndt V,Aronson KJ,Arun BK,Asseryanis E,Auer PL,Azzollini J,Balmaña J,Barkardottir RB,Barnes DR,Barrowda

    更新日期:2020-07-01 00:00:00

  • Model selection and Bayesian methods in statistical genetics: summary of group 11 contributions to Genetic Analysis Workshop 15.

    abstract::The research presented in group 11 of the Genetic Analysis Workshop 15 (GAW15) falls into two major themes: Model selection approaches for gene mapping (both Bayesian and Frequentist); and other Bayesian methods. These methods either allow relaxation of some of the common assumptions, such as mode of inheritance, for ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/gepi.20285

    authors: Swartz MD,Thomas DC,Daw EW,Albers K,Charlesworth JC,Dyer TC,Fridley BL,Govil M,Kraft P,Kwon S,Logue MW,Oh C,Pique-Regi R,Saba L,Schumacher FR,Uh HW

    更新日期:2007-01-01 00:00:00

  • The inheritance of pyloric stenosis explained by a multifactorial threshold model with sex dimorphism for liability.

    abstract::The inheritance of pyloric stenosis is explained by a multifactorial threshold model with an underlying assumption that the liability for the disease is distributed in males and females showing a sex dimorphism. From the available data on familial occurrences of pyloric stenosis, it is shown, that an extra maternal ef...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370030102

    authors: Chakraborty R

    更新日期:1986-01-01 00:00:00

  • Analysis of multiple phenotypes.

    abstract::The complex etiology of common diseases like cardiovascular disease, diabetes, hypertension, and rheumatoid arthritis has led investigators to focus on the genetics of correlated phenotypes and risk factors. Joint analysis of multiple disease-related phenotypes may reveal genes of pleiotropic effect and increase analy...

    journal_title:Genetic epidemiology

    pub_type:

    doi:10.1002/gepi.20470

    authors: Kent JW Jr

    更新日期:2009-01-01 00:00:00

  • Tag SNPs chosen from HapMap perform well in several population isolates.

    abstract::Population isolates may be particularly useful for association studies of complex traits. This utility, however, largely depends on the transferability of tag SNPs chosen from reference samples, such as HapMap, to samples from such populations. Factors that characterize population isolates, such as widespread genetic ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20201

    authors: Service S,International Collaborative Group on Isolated Populations.,Sabatti C,Freimer N

    更新日期:2007-04-01 00:00:00

  • The power of iterated generalized least squares (GLS) method to detect direct relationships in the analysis of correlated quantitative traits.

    abstract::We examined the power of the stepwise iterated generalized least squares (GLS) method by modeling the relationship between quantitative traits and other variables using the simulated data for Problem 2A. The comparison between the generating model provided by the workshop and the results of the stepwise iterated GLS m...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<797::AID-GEPI39>

    authors: He Q,Nemesure BB,Mendell NR

    更新日期:1997-01-01 00:00:00