Genotyping errors, pedigree errors, and missing data.

Abstract:

:Our group studied the effects of genotyping errors, pedigree errors, and missing data on a wide range of techniques, with a focus on the role of single-nucleotide polymorphisms (SNPs). Half of our group used simulated data, and half of our group used data from the Collaborative Study on the Genetics of Alcoholism (COGA). The simulated data had no missing genotypes and no genotyping errors, so our group, as a whole, removed data and introduced artificial errors to study the robustness of various techniques. Our teams showed that genotyping errors are less detectable and may have a greater impact on SNPs than on microsatellites, but recently developed methods that account for genotyping errors help reduce false positives, and the assumptions of these methods appear to be supported by observations from repeated genotyping. The ability to detect linkage disequilibrium (LD) was also substantially reduced by missing data; this in turn could affect tagging SNPs chosen to generate haplotypes. In the COGA sample, genotyping measurements were repeated in three ways. First, full-genome screens were performed on three sets of markers: 328 microsatellites, 11,560 SNPs from the Affymetrix GeneChip Mapping 10 K Array marker set, and 4,720 SNPs from the Illumina Linkage III panel. Second, the entire Affymetrix marker set was typed on the same 184 individuals by two different laboratories. Finally, the Affymetrix and Illumina marker panels had 94 SNPs in common. Our teams showed that both SNPs and microsatellites can be readily used to identify pedigree errors, and that SNPs have fewer genotyping errors and a low inconsistency rate. However, a fairly high rate of no-calls, especially for the Affymetrix platform, suggests that the inconsistency rate may be higher than observed.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Hinrichs AL,Suarez BK

doi

10.1002/gepi.20120

subject

Has Abstract

pub_date

2005-01-01 00:00:00

pages

S120-4

eissn

0741-0395

issn

1098-2272

journal_volume

29 Suppl 1

pub_type

杂志文章
  • The impact of improved microarray coverage and larger sample sizes on future genome-wide association studies.

    abstract::Genome-wide association studies (GWAS) have identified many single nucleotide polymorphisms (SNPs) associated with complex traits. However, the genetic heritability of most of these traits remains unexplained. To help guide future studies, we address the crucial question of whether future GWAS can detect new SNP assoc...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21724

    authors: Lindquist KJ,Jorgenson E,Hoffmann TJ,Witte JS

    更新日期:2013-05-01 00:00:00

  • Review of proteomics with applications to genetic epidemiology.

    abstract::Mapping of the human genome has the potential to transform the traditional methods of genetic epidemiology. The complete draft sequence of the 3.3 billion nucleotides comprising the genome is now available over the Internet, including the location and nearly complete sequence of the 26,000 to 31,000 protein-encoding g...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/gepi.10226

    authors: Sellers TA,Yates JR

    更新日期:2003-02-01 00:00:00

  • Risk factors for atherosclerosis in twins.

    abstract::We performed multivariate genetic analyses of cardiovascular risk factors from two sets of data on US and Australian female twins. Similar models for body mass index (BMI), serum low density (LDL) and high density (HDL) lipoproteins, including age as a covariate, were fitted successfully to both groups. These suggeste...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100638

    authors: Duffy DL,O'Connell DL,Heller RF,Martin NG

    更新日期:1993-01-01 00:00:00

  • Adaptive testing for association between two random vectors in moderate to high dimensions.

    abstract::Testing for association between two random vectors is a common and important task in many fields, however, existing tests, such as Escoufier's RV test, are suitable only for low-dimensional data, not for high-dimensional data. In moderate to high dimensions, it is necessary to consider sparse signals, which are often ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22059

    authors: Xu Z,Xu G,Pan W,Alzheimer's Disease Neuroimaging Initiative.

    更新日期:2017-11-01 00:00:00

  • Multifactorial disease risk calculator: Risk prediction for multifactorial disease pedigrees.

    abstract::Construction of multifactorial disease models from epidemiological findings and their application to disease pedigrees for risk prediction is nontrivial for all but the simplest of cases. Multifactorial Disease Risk Calculator is a web tool facilitating this. It provides a user-friendly interface, extending a reported...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22101

    authors: Campbell DD,Li Y,Sham PC

    更新日期:2018-03-01 00:00:00

  • Pedigree disequilibrium tests for multilocus haplotypes.

    abstract::Association tests of multilocus haplotypes are of interest both in linkage disequilibrium mapping and in candidate gene studies. For case-parent trios, I discuss the extension of existing multilocus methods to include ambiguous haplotypes in tests of models which distinguish between the cis and trans phase. A likeliho...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10252

    authors: Dudbridge F

    更新日期:2003-09-01 00:00:00

  • Testing untyped alleles (TUNA)-applications to genome-wide association studies.

    abstract::The large number of tests performed in analyzing data from genome-wide association studies has a large impact on the power of detecting risk variants, and analytic strategies specifying the optimal set of hypotheses to be tested are necessary. We propose a genome-wide strategy that is based on one degree of freedom te...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20182

    authors: Nicolae DL

    更新日期:2006-12-01 00:00:00

  • Increasing the power of identifying gene x gene interactions in genome-wide association studies.

    abstract::In this paper we investigate the power to identify gene x gene interactions in genome-wide association studies. In our analysis we focus on two-stage analyses: analyses in which we only test for interactions between single nucleotide polymorphisms that show some marginal effect. We give two algorithms to compute signi...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20300

    authors: Kooperberg C,Leblanc M

    更新日期:2008-04-01 00:00:00

  • New simple tests for age-at-onset anticipation: application to panic disorder.

    abstract::Recently, testing for anticipation has received renewed interest. It is well known that standard statistical methods are inappropriate for this purpose due to problems of sampling bias. Few statistical tests have been proposed for comparing mean age of onset in affected parents with mean age of onset in affected child...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20057

    authors: Tsai WY,Heiman GA,Hodge SE

    更新日期:2005-04-01 00:00:00

  • Efficient computation of patterned covariance matrix mixed models in quantitative segregation analysis.

    abstract::The use of patterned covariance matrices in forming pedigree-based mixed models for quantitative traits is discussed. It is suggested that patterned covariance matrix models provide intuitive, theoretically appealing, and flexible genetic modeling devices for pedigree data. It is suggested further that the very great ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370080104

    authors: Schork N

    更新日期:1991-01-01 00:00:00

  • Meta-analysis by combining p-values: simulated linkage studies.

    abstract::Meta-analysis has been little explored to make an overall assessment of linkage from different studies. In practice, it is likely that published linkage studies will only report p-values. We compared the performance of the widely used Fisher method for combining p-values with that of pooling raw data. More loci were c...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,meta分析

    doi:10.1002/gepi.1370170798

    authors: Guerra R,Etzel CJ,Goldstein DR,Sain SR

    更新日期:1999-01-01 00:00:00

  • Adjustment for competing risk in kin-cohort estimation.

    abstract::Kin-cohort design can be used to study the effect of a genetic mutation on the risk of multiple events, using the same study. In this design, the outcome data consist of the event history of the relatives of a sample of genotyped subjects. Existing methods for kin-cohort estimation allow estimation of the risk of one ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10269

    authors: Chatterjee N,Hartge P,Wacholder S

    更新日期:2003-12-01 00:00:00

  • Inflated type I error rates when using aggregation methods to analyze rare variants in the 1000 Genomes Project exon sequencing data in unrelated individuals: summary results from Group 7 at Genetic Analysis Workshop 17.

    abstract::As part of Genetic Analysis Workshop 17 (GAW17), our group considered the application of novel and standard approaches to the analysis of genotype-phenotype association in next-generation sequencing data. Our group identified a major issue in the analysis of the GAW17 next-generation sequencing data: type I error and ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20650

    authors: Tintle N,Aschard H,Hu I,Nock N,Wang H,Pugh E

    更新日期:2011-01-01 00:00:00

  • Model selection and Bayesian methods in statistical genetics: summary of group 11 contributions to Genetic Analysis Workshop 15.

    abstract::The research presented in group 11 of the Genetic Analysis Workshop 15 (GAW15) falls into two major themes: Model selection approaches for gene mapping (both Bayesian and Frequentist); and other Bayesian methods. These methods either allow relaxation of some of the common assumptions, such as mode of inheritance, for ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/gepi.20285

    authors: Swartz MD,Thomas DC,Daw EW,Albers K,Charlesworth JC,Dyer TC,Fridley BL,Govil M,Kraft P,Kwon S,Logue MW,Oh C,Pique-Regi R,Saba L,Schumacher FR,Uh HW

    更新日期:2007-01-01 00:00:00

  • Evaluation of path analysis through computer simulation: effect of incorrectly assuming independent distribution of familial correlations.

    abstract::Path analysis of family data has been widely applied to resolve genetic and environmental patterns of familial resemblance. A prevalent statistical approach in path analysis has been, first, to estimate the familial correlations and, second, by assuming these estimates to be independently distributed, define a likelih...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370010305

    authors: McGue M,Wette R,Rao DC

    更新日期:1984-01-01 00:00:00

  • Regressive logistic and proportional hazards disease models for within-family analyses of measured genotypes, with application to a CYP17 polymorphism and breast cancer.

    abstract::Various statistical methods have been proposed to evaluate associations between measured genetic variants and disease, including some using family designs. For breast cancer and rare variants, we applied a modified segregation analysis method that uses the population cancer incidence and population-based case families...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10222

    authors: Cui JS,Spurdle AB,Southey MC,Dite GS,Venter DJ,McCredie MR,Giles GG,Chenevix-Trench G,Hopper JL

    更新日期:2003-04-01 00:00:00

  • Modelling the major histocompatibility complex susceptibility to RA using the MASC method.

    abstract::To explain the association between HLA-DRB1 gene and rheumatoid arthritis (RA), two main hypotheses have been proposed. The first, the shared epitope hypothesis, assumes a direct role of DRB1 in RA susceptibility. The second hypothesis assumes a recessive disease susceptibility gene in linkage disequilibrium with DRB1...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1998)15:4<419::AID-GEPI7>3

    authors: Génin E,Babron MC,McDermott MF,Mulcahy B,Waldron-Lynch F,Adams C,Clegg DO,Ward RH,Shanahan F,Molloy MG,O'Gara F,Clerget-Darpoux F

    更新日期:1998-01-01 00:00:00

  • A multimarker regression-based test of linkage for affected sib-pairs at two linked loci.

    abstract::We address the analytical problem of evaluating the evidence for linkage at a test locus while taking into account the effect of a known linked disease locus. The method we propose is a multimarker regression approach that models the identity-by-descent states for affected sib-pairs at a series of linked markers in te...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20137

    authors: Barber MJ,Todd JA,Cordell HJ

    更新日期:2006-04-01 00:00:00

  • How can maximum likelihood methods reveal candidate gene effects on a quantitative trait?

    abstract::Different maximum likelihood approaches were used to explore the role of candidate genes in the variability of quantitative trait Q1 while accounting for the effects of age, Q2, and Q3. Segregation analysis, under the class D regressive model, provides evidence for a Mendelian gene effect on the adjusted trait Q1. Res...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370120643

    authors: Martinez M,Abel L,Demenais F

    更新日期:1995-01-01 00:00:00

  • Genetic epidemiology of cleft lip with or without cleft palate in the population of Hawaii.

    abstract::Orientals consisting of Japanese, Chinese, Koreans, and Filipinos are clearly at higher risk for cleft lip with or without cleft palate [CL(P)] than whites, Puerto Ricans, and Hawaiians/part-Hawaiians in Hawaii. Using the model of diallele cross, CL(P) incidences in incrosses and outcrosses involving 564,002 live birt...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370040603

    authors: Chung CS,Mi MP,Beechert AM

    更新日期:1987-01-01 00:00:00

  • Variance component models for X-linked QTLs.

    abstract::This paper discusses the theory and implementation of a model for mapping X-linked quantitative trait loci (QTL). As a result of X inactivation, a female's body is subdivided into a number of patches. In each patch one of her two X chromosomes is randomly switched off. This smooths the allelic contributions in a heter...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20158

    authors: Lange K,Sobel E

    更新日期:2006-07-01 00:00:00

  • A likelihood ratio-based Mann-Whitney approach finds novel replicable joint gene action for type 2 diabetes.

    abstract::The potential importance of the joint action of genes, whether modeled with or without a statistical interaction term, has long been recognized. However, identifying such action has been a great challenge, especially when millions of genetic markers are involved. We propose a likelihood ratio-based Mann-Whitney test t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21651

    authors: Lu Q,Wei C,Ye C,Li M,Elston RC

    更新日期:2012-09-01 00:00:00

  • An efficient study design to test parent-of-origin effects in family trios.

    abstract::Increasing evidence has shown that genes may cause prenatal, neonatal, and pediatric diseases depending on their parental origins. Statistical models that incorporate parent-of-origin effects (POEs) can improve the power of detecting disease-associated genes and help explain the missing heritability of diseases. In ma...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22060

    authors: Yu X,Chen G,Feng R

    更新日期:2017-11-01 00:00:00

  • To type or not to type: the use of unaffected siblings in nonparametric linkage analysis.

    abstract::Unaffected individuals are often disregarded in nonparametric linkage analysis. Because of the presumed high complexity of genetic interactions and the resulting low penetrance of any single genetic effect, the statistical contribution of unaffected sib pairs is thought to be considerably lower than that of the affect...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s522

    authors: Majewski J

    更新日期:2001-01-01 00:00:00

  • Biochemical intermediates in alpha 1-antitrypsin deficiency: residual family resemblance for total alpha 1-antitrypsin, oxidized alpha 1-antitrypsin, and immunoglobulin E after adjustment for the effect of the Pi locus.

    abstract::alpha 1-antitrypsin (alpha 1 AT) deficiency is variably associated with the development of pulmonary emphysema. To gain insight into the process which begins the Z point mutation at the Protease Inhibitor (Pi) locus and results in the variable development of emphysema, three quantitative phenotypes, including total al...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370070204

    authors: Silverman EK,Province MA,Campbell EJ,Pierce JA,Rao DC

    更新日期:1990-01-01 00:00:00

  • Association and linkage analysis of ICD-10 diagnosis for alcoholism.

    abstract::We analyzed the GAW11 data on alcoholism provided by the Collaborative Study on the Genetics of Alcoholism (COGA) using an extension of a new test of linkage and association for quantitative traits developed by George et al. [1999]. This method determines linkage between marker loci and quantitative traits, when allel...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170758

    authors: Tiwari HK,Zhu X,Elston RC,Shu Y,George V

    更新日期:1999-01-01 00:00:00

  • Optimizing the power of genome-wide association studies by using publicly available reference samples to expand the control group.

    abstract::Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consisting of many thousan...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20482

    authors: Zhuang JJ,Zondervan K,Nyberg F,Harbron C,Jawaid A,Cardon LR,Barratt BJ,Morris AP

    更新日期:2010-05-01 00:00:00

  • Design of artificial neural network and its applications to the analysis of alcoholism data.

    abstract::Artificial neural networks were applied to the alcoholism data to reveal nonlinear relationships between intermediate phenotypes, marker identity-by-descent sharing, and the affection status. A variable number of hidden units were considered to achieve a balance between the minimal mean-squared error and over-fitting ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170738

    authors: Li W,Haghighi F,Falk CT

    更新日期:1999-01-01 00:00:00

  • Genetic heterogeneity in Alzheimer's disease: a grade of membership analysis.

    abstract::Grade of membership analysis (GoM) may have particular relevance for genetic epidemiology. The method can flexibly relate genetic markers, clinical features, and environmental exposures to possible subtypes of disease termed pure types even when population allele frequencies and penetrance functions are not known. Hen...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100628

    authors: Corder EH,Woodbury MA

    更新日期:1993-01-01 00:00:00

  • Equivalence of the mixed and regressive models for genetic analysis. I. Continuous traits.

    abstract::The mixed model of segregation analysis specifies major gene effects and partitions the residual variance into polygenic and environmental components. The model explains familial correlations essentially in terms of genetic causation. The regressive model, on the other hand, is constructed by successively conditioning...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060505

    authors: Demenais FM,Bonney GE

    更新日期:1989-01-01 00:00:00