Data mining and computationally intensive methods: summary of Group 7 contributions to Genetic Analysis Workshop 13.

Abstract:

:The Framingham Heart Study data, as well as a related simulated data set, were generously provided to the participants of the Genetic Analysis Workshop 13 in order that newly developed and emerging statistical methodologies could be tested on that well-characterized data set. The impetus driving the development of novel methods is to elucidate the contributions of genes, environment, and interactions between and among them, as well as to allow comparison between and validation of methods. The seven papers that comprise this group used data-mining methodologies (tree-based methods, neural networks, discriminant analysis, and Bayesian variable selection) in an attempt to identify the underlying genetics of cardiovascular disease and related traits in the presence of environmental and genetic covariates. Data-mining strategies are gaining popularity because they are extremely flexible and may have greater efficiency and potential in identifying the factors involved in complex disorders. While the methods grouped together here constitute a diverse collection, some papers asked similar questions with very different methods, while others used the same underlying methodology to ask very different questions. This paper briefly describes the data-mining methodologies applied to the Genetic Analysis Workshop 13 data sets and the results of those investigations.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Costello TJ,Falk CT,Ye KQ

doi

10.1002/gepi.10285

subject

Has Abstract

pub_date

2003-01-01 00:00:00

pages

S57-63

eissn

0741-0395

issn

1098-2272

journal_volume

25 Suppl 1

pub_type

杂志文章
  • A flexible and parallelizable approach to genome-wide polygenic risk scores.

    abstract::The heritability of most complex traits is driven by variants throughout the genome. Consequently, polygenic risk scores, which combine information on multiple variants genome-wide, have demonstrated improved accuracy in genetic risk prediction. We present a new two-step approach to constructing genome-wide polygenic ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22245

    authors: Newcombe PJ,Nelson CP,Samani NJ,Dudbridge F

    更新日期:2019-10-01 00:00:00

  • Joint analysis of multiple phenotypes using a clustering linear combination method based on hierarchical clustering.

    abstract::Emerging evidence suggests that a genetic variant can affect multiple phenotypes, especially in complex human diseases. Therefore, joint analysis of multiple phenotypes may offer new insights into disease etiology. Recently, many statistical methods have been developed for joint analysis of multiple phenotypes, includ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22263

    authors: Li X,Zhang S,Sha Q

    更新日期:2020-01-01 00:00:00

  • Improving estimates of genetic maps: a meta-analysis-based approach.

    abstract::Inaccurate genetic (or linkage) maps can reduce the power to detect linkage, increase type I error, and distort haplotype and relationship inference. To improve the accuracy of existing maps, I propose a meta-analysis-based method that combines independent map estimates into a single estimate of the linkage map. The m...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20221

    authors: Stewart WC

    更新日期:2007-07-01 00:00:00

  • A hybrid design: case-parent triads supplemented by control-mother dyads.

    abstract::Hybrid designs arose from an effort to combine the benefits of family-based and population-based study designs. A recently proposed hybrid approach augments case-parent triads with population-based control-parent triads, genotyping everyone except the control offspring. Including parents of controls substantially impr...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20365

    authors: Vermeulen SH,Shi M,Weinberg CR,Umbach DM

    更新日期:2009-02-01 00:00:00

  • Identifying genetic interactions in genome-wide data using Bayesian networks.

    abstract::It is believed that interactions among genes (epistasis) may play an important role in susceptibility to common diseases (Moore and Williams [2002]. Ann Med 34:88-95; Ritchie et al. [2001]. Am J Hum Genet 69:138-147). To study the underlying genetic variants of diseases, genome-wide association studies (GWAS) that sim...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20514

    authors: Jiang X,Barmada MM,Visweswaran S

    更新日期:2010-09-01 00:00:00

  • Genetic epidemiology of cleft lip with or without cleft palate in the population of Hawaii.

    abstract::Orientals consisting of Japanese, Chinese, Koreans, and Filipinos are clearly at higher risk for cleft lip with or without cleft palate [CL(P)] than whites, Puerto Ricans, and Hawaiians/part-Hawaiians in Hawaii. Using the model of diallele cross, CL(P) incidences in incrosses and outcrosses involving 564,002 live birt...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370040603

    authors: Chung CS,Mi MP,Beechert AM

    更新日期:1987-01-01 00:00:00

  • Familial aggregation of breast cancer with early onset lung cancer.

    abstract::Site-specific familial aggregation and evidence supporting Mendelian codominant inheritance have been shown in lung cancer. In characterizing lung cancer families, a number of other cancers have been observed. The current study evaluates whether first-degree relatives of early onset lung cancer cases are at increased ...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章

    doi:10.1002/(SICI)1098-2272(199911)17:4<274::AID-GEPI3

    authors: Schwartz AG,Siegfried JM,Weiss L

    更新日期:1999-11-01 00:00:00

  • Familial analysis of eosinophilia caused by helminthic parasites.

    abstract::A highly significant familial aggregation of eosinophil levels (X2(3) = 38.00) was detected in a sample from three Brazilian populations with a high incidence of helminthic parasitism. The data were unable to resolve genetic or common environment causation due to the lack of environmental concomitant variables. Result...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370090305

    authors: Moro-Furlani AM,Krieger H

    更新日期:1992-01-01 00:00:00

  • Genetic analysis of IDDM: summary of GAW5 IDDM results.

    abstract::This paper summarizes the analyses by participants in the insulin-dependent diabetes mellitus (IDDM) component of Genetic Analysis Workshop 5 (GAW5). The data were obtained from 94 families with two or more IDDM sibs. Topics treated in the Workshop analysis included the following: methods for detecting associations an...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/gepi.1370060111

    authors: Spielman RS,Baur MP,Clerget-Darpoux F

    更新日期:1989-01-01 00:00:00

  • Power of the linkage test for a heterogeneous disorder due to two independent inherited causes: a simulation study.

    abstract::We have conducted a simulation study in small pedigrees to investigate the power to detect linkage and heterogeneity for a disorder due to either one of two independent disease loci. We have considered a highly polymorphic marker locus (PIC = 70%) linked to one disease locus and unlinked to the second. The power to de...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370070306

    authors: Martinez M,Goldin LR

    更新日期:1990-01-01 00:00:00

  • Relationship between body mass index, cigarette smoking, and plasma sex steroids in normal male twins.

    abstract::Smoking has been observed to affect plasma sex hormones and body mass index. The relationship between smoking, body mass index, and plasma concentration of sex hormones was studied in normal adult male twins. The analyses were performed for between 150 and 159 twin pairs for whom hormonal data were available on both t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060303

    authors: Meikle AW,Bishop DT,Stringham JD,Ford MH,West DW

    更新日期:1989-01-01 00:00:00

  • Multiethnic polygenic risk scores improve risk prediction in diverse populations.

    abstract::Methods for genetic risk prediction have been widely investigated in recent years. However, most available training data involves European samples, and it is currently unclear how to accurately predict disease risk in other populations. Previous studies have used either training data from European samples in large sam...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22083

    authors: Márquez-Luna C,Loh PR,South Asian Type 2 Diabetes (SAT2D) Consortium.,SIGMA Type 2 Diabetes Consortium.,Price AL

    更新日期:2017-12-01 00:00:00

  • Relevance of the genes for bone mass variation to susceptibility to osteoporotic fractures and its implications to gene search for complex human diseases.

    abstract::We investigate the relevance of the genetic determination of bone mineral density (BMD) variation to that of differential risk to osteoporotic fractures (OF). The high heritability (h(2)) of BMD and the significant phenotypic correlations between high BMD and low risk to OF are well known. Little is reported on h(2) f...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1040

    authors: Deng HW,Mahaney MC,Williams JT,Li J,Conway T,Davies KM,Li JL,Deng H,Recker RR

    更新日期:2002-01-01 00:00:00

  • Propensity score-based nonparametric test revealing genetic variants underlying bipolar disorder.

    abstract::Association analysis has led to the identification of many genetic variants for complex diseases. While assessing the association between genes and a disease, other factors can play an important role. The consequence of not considering covariates (such as population stratification and environmental factors) is well-do...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20558

    authors: Jiang Y,Zhang H

    更新日期:2011-02-01 00:00:00

  • Robustness of the unified model to shared environmental effects in the analysis of dichotomous traits.

    abstract::Simulation studies were conducted to assess to what extent the conclusions of segregation analysis, performed under the unified model, can be affected by the presence of unmeasured environmental factors shared by family members. Dichotomous data were generated on six-member nuclear families under two variants of the m...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060140

    authors: Demenais F,Abel L

    更新日期:1989-01-01 00:00:00

  • Linkage analysis in alcohol dependence.

    abstract::Alcohol dependence often is a familial disorder and has a genetic component. Research in causative factors of alcoholism is coordinated by a multi-center program, COGA [The Collaborative Study on the Genetics of Alcoholism, Begleiter et al., 1995]. We analyzed a subset of the COGA family sample, 84 pedigrees of Caucas...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170768

    authors: Windemuth C,Hahn A,Strauch K,Baur MP,Wienker TF

    更新日期:1999-01-01 00:00:00

  • A Bayesian integrative genomic model for pathway analysis of complex traits.

    abstract::With new technologies, multiple types of genomic data are commonly collected on a single set of samples. However, standard analysis methods concentrate on a single data type at a time and ignore the relationships between genes, proteins, and biochemical reactions that give rise to complex phenotypes. In this paper, we...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21628

    authors: Fridley BL,Lund S,Jenkins GD,Wang L

    更新日期:2012-05-01 00:00:00

  • Linkage analysis of Alzheimer's disease with methods using relative pairs.

    abstract::Four relative-pair methods for detecting genetic linkage were applied to familial Alzheimer's disease data. Results obtained using an extended Haseman-Elston test and a weighted rank pairwise correlation test, which both use information from all relative pairs, were consistent with previously published likelihood resu...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100608

    authors: Blossey H,Commenges D,Olson JM

    更新日期:1993-01-01 00:00:00

  • Lifestyle and blood pressure levels in male twins in Utah.

    abstract::Healthy male monozygotic (MZ) and dizygotic (DZ) twin pairs (MZ pairs = 77; DZ pairs = 88) were studied to assess the effect of dietary intake, physical activity, physical fitness, body mass index (BMI), sum of the triceps and subscapular skinfold measurements, alcohol and caffeine consumption, and smoking patterns on...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370050409

    authors: Slattery ML,Bishop DT,French TK,Hunt SC,Meikle AW,Williams RR

    更新日期:1988-01-01 00:00:00

  • Affected relative pairs and simultaneous search for two-locus linkage in the presence of epistasis.

    abstract::It is commonly believed that multiple interacting genes increase the susceptibility of genetically complex diseases, yet few linkage analyses of human diseases scan for more than one locus at a time. To overcome some of the statistical and computational limitations of a simultaneous search for two disease susceptibili...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20223

    authors: Schaid DJ,McDonnell SK,Carlson EE,Thibodeau SN,Ostrander EA,Stanford JL

    更新日期:2007-07-01 00:00:00

  • Trends in prenatal diagnosis of Down syndrome and other autosomal trisomies in Scotland 1990 to 1994, with associated cytogenetic and epidemiological findings.

    abstract::The present report summarizes findings on 670 cases of autosomal trisomy diagnosed in Scotland, with actual or expected dates of delivery in 1990 to 1994 inclusive. Cases were notified by cytogenetic service laboratories. There were 277 prenatal and 369 postnatal diagnoses and 24 spontaneous losses. Excluding the latt...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1999)16:2<179::AID-GEPI5>3

    authors: Carothers AD,Boyd E,Lowther G,Ellis PM,Couzin DA,Faed MJ,Robb A

    更新日期:1999-01-01 00:00:00

  • Genetic analysis of a complex disease in the presence of an environmental risk factor.

    abstract::The role of a gene in a disease may be hidden by the presence of another risk factor such as an environmental factor. In that case, stratifying the data according to this factor strengthens power to detect linkage or association. We followed this strategy on the simulated data provided by GAW11. The transmission/diseq...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170788

    authors: Eichenbaum-Voline S,Baur MP,Knapp M

    更新日期:1999-01-01 00:00:00

  • Monte Carlo analysis on a large pedigree.

    abstract::Monte Carlo methods for linkage and segregation analysis are applied to the HGAR1 pedigree. To address these data, the methods are extended in several ways. The results are compared with those provided by PAP. ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100658

    authors: Thompson EA,Lin S,Olshen AB,Wijsman EM

    更新日期:1993-01-01 00:00:00

  • Meta-Analysis of Rare Variant Association Tests in Multiethnic Populations.

    abstract::Several methods have been proposed to increase power in rare variant association testing by aggregating information from individual rare variants (MAF < 0.005). However, how to best combine rare variants across multiple ethnicities and the relative performance of designs using different ethnic sampling fractions remai...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21939

    authors: Mensah-Ablorh A,Lindstrom S,Haiman CA,Henderson BE,Marchand LL,Lee S,Stram DO,Eliassen AH,Price A,Kraft P

    更新日期:2016-01-01 00:00:00

  • Modeling the HLA component in rheumatoid arthritis: sensitivity to DRB1 allele frequencies.

    abstract::Rheumatoid arthritis is an inflammatory disease for which positive associations have been described with some HLA-DRB1 alleles. The associated alleles share a similar amino acid sequence in the third hypervariable region, the shared epitope, but differ at position 71 and 86. It has been suggested that HLA susceptibili...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/1098-2272(200012)19:4<422::AID-GEPI12>3.0.

    authors: Tézenas du Montcel S,Reviron D,Genin E,Roudier J,Mercier P,Clerget-Darpoux F

    更新日期:2000-12-01 00:00:00

  • Comparison of two linkage inference procedures for genes related to the P300 component of the event related potential.

    abstract::Our goal was to detect genes contributing to the P300 component of the event related potential (ERP). We found that all of the ERP traits were highly correlated. Most of them distinguished alcoholics from nonalcoholics. To have one summary variable for the ERP traits, we calculated the first principal component (PRIN1...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170728

    authors: Goldin LR,Chase GA

    更新日期:1999-01-01 00:00:00

  • POLARIS: Polygenic LD-adjusted risk score approach for set-based analysis of GWAS data.

    abstract::Polygenic risk scores (PRSs) are a method to summarize the additive trait variance captured by a set of SNPs, and can increase the power of set-based analyses by leveraging public genome-wide association study (GWAS) datasets. PRS aims to assess the genetic liability to some phenotype on the basis of polygenic risk fo...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22117

    authors: Baker E,Schmidt KM,Sims R,O'Donovan MC,Williams J,Holmans P,Escott-Price V,Consortium WTG

    更新日期:2018-06-01 00:00:00

  • Presidential address: Six open questions to genetic epidemiologists.

    abstract::Given the rapid pace with which genomics and other -omics disciplines are evolving, it is sometimes necessary to shift down a gear to consider more general scientific questions. In this line, in my presidential address I formulate six questions for genetic epidemiologists to ponder on. These cover the areas of reprodu...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22191

    authors: König IR

    更新日期:2019-04-01 00:00:00

  • Pooling data and linkage analysis in the chromosome 5q candidate region for asthma.

    abstract::We investigated a variety of methods for pooling data from eight data sets (n = 5,424 subjects) to validate evidence for linkage of markers in the cytokine cluster on chromosome 5q31-33 to asthma and asthma-associated phenotypes. Chromosome 5 markers were integrated into current genetic linkage and physical maps, and ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,meta分析

    doi:10.1002/gepi.2001.21.s1.s103

    authors: Jacobs KB,Burton PR,Iyengar SK,Elston RC,Palmer LJ

    更新日期:2001-01-01 00:00:00

  • Linkage disequilibrium between DNA markers at the low-density lipoprotein receptor gene.

    abstract::We determined pairwise linkage disequilibria between 12 restriction fragment length polymorphism (RFLP) markers at or near the low-density lipoprotein receptor (LDLR) locus on chromosome 19p13.2-13.1 in 92 unrelated individuals. Of these 12 RFLPs, two were newly identified under a cosmid-based strategy designed to scr...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370070114

    authors: Hegele RA,Plaetke R,Lalouel JM

    更新日期:1990-01-01 00:00:00