Haplotype variation and genotype imputation in African populations.

Abstract:

:Sub-Saharan Africa has been identified as the part of the world with the greatest human genetic diversity. This high level of diversity causes difficulties for genome-wide association (GWA) studies in African populations-for example, by reducing the accuracy of genotype imputation in African populations compared to non-African populations. Here, we investigate haplotype variation and imputation in Africa, using 253 unrelated individuals from 15 Sub-Saharan African populations. We identify the populations that provide the greatest potential for serving as reference panels for imputing genotypes in the remaining groups. Considering reference panels comprising samples of recent African descent in Phase 3 of the HapMap Project, we identify mixtures of reference groups that produce the maximal imputation accuracy in each of the sampled populations. We find that optimal HapMap mixtures and maximal imputation accuracies identified in detailed tests of imputation procedures can instead be predicted by using simple summary statistics that measure relationships between the pattern of genetic variation in a target population and the patterns in potential reference panels. Our results provide an empirical basis for facilitating the selection of reference panels in GWA studies of diverse human populations, especially those of African ancestry.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Huang L,Jakobsson M,Pemberton TJ,Ibrahim M,Nyambo T,Omar S,Pritchard JK,Tishkoff SA,Rosenberg NA

doi

10.1002/gepi.20626

subject

Has Abstract

pub_date

2011-12-01 00:00:00

pages

766-80

issue

8

eissn

0741-0395

issn

1098-2272

journal_volume

35

pub_type

杂志文章
  • Modeling the HLA component in rheumatoid arthritis: sensitivity to DRB1 allele frequencies.

    abstract::Rheumatoid arthritis is an inflammatory disease for which positive associations have been described with some HLA-DRB1 alleles. The associated alleles share a similar amino acid sequence in the third hypervariable region, the shared epitope, but differ at position 71 and 86. It has been suggested that HLA susceptibili...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/1098-2272(200012)19:4<422::AID-GEPI12>3.0.

    authors: Tézenas du Montcel S,Reviron D,Genin E,Roudier J,Mercier P,Clerget-Darpoux F

    更新日期:2000-12-01 00:00:00

  • SNP selection in genome-wide and candidate gene studies via penalized logistic regression.

    abstract::Penalized regression methods offer an attractive alternative to single marker testing in genetic association analysis. Penalized regression methods shrink down to zero the coefficient of markers that have little apparent effect on the trait of interest, resulting in a parsimonious subset of what we hope are true perti...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20543

    authors: Ayers KL,Cordell HJ

    更新日期:2010-12-01 00:00:00

  • Evaluation of methods accounting for population structure with pedigree data and continuous outcomes.

    abstract::Methods to account for population structure (PS) in genome-wide association studies have been well developed in samples of unrelated individuals, but when a sample is composed of families, the task of finding and accounting for PS is not as straight forward. Family-based tests that condition on parental genotypes or t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20590

    authors: Peloso GM,Dupuis J,Lunetta KL

    更新日期:2011-09-01 00:00:00

  • Comparison of the QTDT analysis for IgE in the CSGA data set.

    abstract::Over the past few years at least 13 transmission/disequilibrium test (TDT)-based tests have been developed for quantitative (Q) traits for the assessment of association or linkage in the presence of the other. A total of six of these QTDT methods were used to analyze log10IgE in the Collaborative Study on the Genetics...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s312

    authors: Page GP,Wilcox MA,Occhiuto J,Adak S,Neuberg D,Bajorunaite R,George V

    更新日期:2001-01-01 00:00:00

  • Quantitative allelic test--a fast test for very large association studies.

    abstract::Advances in high throughput technology have enabled the generation of unprecedented amounts of genomic data (e.g., next-generation sequence data, transcriptomics, metabolomics, and proteomics), which promises to unravel the genetic architecture of complex traits. These discoveries may lead to novel therapeutic targets...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21768

    authors: Lee SM,Karrison TG,Cox NJ,Im HK

    更新日期:2013-12-01 00:00:00

  • Maximum-likelihood estimation of haplotype frequencies in nuclear families.

    abstract::The importance of haplotype analysis in the context of association fine mapping of disease genes has grown steadily over the last years. Since experimental methods to determine haplotypes on a large scale are not available, phase has to be inferred statistically. For individual genotype data, several reconstruction te...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10323

    authors: Becker T,Knapp M

    更新日期:2004-07-01 00:00:00

  • Pleiotropy and principal components of heritability combine to increase power for association analysis.

    abstract::When many correlated traits are measured the potential exists to discover the coordinated control of these traits via genotyped polymorphisms. A common statistical approach to this problem involves assessing the relationship between each phenotype and each single nucleotide polymorphism (SNP) individually (PHN); and t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20257

    authors: Klei L,Luca D,Devlin B,Roeder K

    更新日期:2008-01-01 00:00:00

  • Hierarchical Bayesian model for rare variant association analysis integrating genotype uncertainty in human sequence data.

    abstract::Next-generation sequencing (NGS) has led to the study of rare genetic variants, which possibly explain the missing heritability for complex diseases. Most existing methods for rare variant (RV) association detection do not account for the common presence of sequencing errors in NGS data. The errors can largely affect ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21871

    authors: He L,Pitkäniemi J,Sarin AP,Salomaa V,Sillanpää MJ,Ripatti S

    更新日期:2015-02-01 00:00:00

  • Gene-environment interaction tests for dichotomous traits in trios and sibships.

    abstract::When testing for genetic effects, failure to account for a gene-environment interaction can mask the true association effects of a genetic marker with disease. Family-based association tests are popular because they are completely robust to population substructure and model misspecification. However, when testing for ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20421

    authors: Hoffmann TJ,Lange C,Vansteelandt S,Laird NM

    更新日期:2009-12-01 00:00:00

  • Design of artificial neural network and its applications to the analysis of alcoholism data.

    abstract::Artificial neural networks were applied to the alcoholism data to reveal nonlinear relationships between intermediate phenotypes, marker identity-by-descent sharing, and the affection status. A variable number of hidden units were considered to achieve a balance between the minimal mean-squared error and over-fitting ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170738

    authors: Li W,Haghighi F,Falk CT

    更新日期:1999-01-01 00:00:00

  • Exploiting pleiotropy to map genes for oligogenic phenotypes using extended pedigree data.

    abstract::We investigated the utility of two approaches for exploiting pleiotropy to search for genes influencing related traits. To do this we first assessed the genetic correlations among a set of five closely related quantitative traits (Q1, Q2, Q3, Q4, Q5). We then used the genetic correlations among these five traits both ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<975::AID-GEPI69>

    authors: Comuzzie AG,Mahaney MC,Almasy L,Dyer TD,Blangero J

    更新日期:1997-01-01 00:00:00

  • The role of environmental heterogeneity in meta-analysis of gene-environment interactions with quantitative traits.

    abstract::With challenges in data harmonization and environmental heterogeneity across various data sources, meta-analysis of gene-environment interaction studies can often involve subtle statistical issues. In this paper, we study the effect of environmental covariate heterogeneity (within and between cohorts) on two approache...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21810

    authors: Li S,Mukherjee B,Taylor JM,Rice KM,Wen X,Rice JD,Stringham HM,Boehnke M

    更新日期:2014-07-01 00:00:00

  • Linkage analysis in alcohol dependence.

    abstract::Alcohol dependence often is a familial disorder and has a genetic component. Research in causative factors of alcoholism is coordinated by a multi-center program, COGA [The Collaborative Study on the Genetics of Alcoholism, Begleiter et al., 1995]. We analyzed a subset of the COGA family sample, 84 pedigrees of Caucas...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170768

    authors: Windemuth C,Hahn A,Strauch K,Baur MP,Wienker TF

    更新日期:1999-01-01 00:00:00

  • Bayesian linkage and segregation analysis: factoring the problem.

    abstract::Complex segregation analysis and linkage methods are mathematical techniques for the genetic dissection of complex diseases. They are used to delineate complex modes of familial transmission and to localize putative disease susceptibility loci to specific chromosomal locations. The computational problem of Bayesian li...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/1098-2272(2000)19:1+<::AID-GEPI8>3.0.CO;2-

    authors: Matthysse S

    更新日期:2000-01-01 00:00:00

  • Linkage analysis of candidate obesity genes among the Mexican-American population of Starr County, Texas.

    abstract::Recent advances in the molecular basis of body fat regulation have identified several genes in which genetic variation may influence obesity and related measures in human populations. Genes that have been shown to have a regulatory function in the control of body fat utilization, eating behavior, and/or metabolic rate...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1999)16:4<397::AID-GEPI6>3

    authors: Bray MS,Boerwinkle E,Hanis CL

    更新日期:1999-01-01 00:00:00

  • Genetic epidemiology of breast cancer: segregation analysis of 389 Icelandic pedigrees.

    abstract::A genetic epidemiologic investigation of breast cancer involving 389 breast cancer pedigrees including information on 14,721 individuals from the Icelandic population-based cancer registry is presented. Probands were women born in or after 1920 and reported to have breast cancer in the cancer registry. The average age...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(200001)18:1<81::AID-GEPI6>

    authors: Baffoe-Bonnie AB,Beaty TH,Bailey-Wilson JE,Kiemeney LA,Sigvaldason H,Olafsdóttir G,Tryggvadóttir L,Tulinius H

    更新日期:2000-01-01 00:00:00

  • Improving power in genome-wide association studies: weights tip the scale.

    abstract::The potential of genome-wide association analysis can only be realized when they have power to detect signals despite the detrimental effect of multiple testing on power. We develop a weighted multiple testing procedure that facilitates the input of prior information in the form of groupings of tests. For each group a...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20237

    authors: Roeder K,Devlin B,Wasserman L

    更新日期:2007-11-01 00:00:00

  • Bias in parameter estimates due to omitting gene-environment interaction terms in case-control studies.

    abstract::Genetic studies are continuing to generate volumes and variety of data that can be used to examine the genetic effects. Often the effect of a genetic variant varies by nongenetic measures, what is traditionally defined as gene-environment interaction (G×E). If the G×E term is neglected, estimates of the main effects c...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22154

    authors: Lobach I

    更新日期:2018-12-01 00:00:00

  • Investigation of a candidate gene, environment, and G x E interaction using case-control and case-parent study designs.

    abstract::We investigated the independent contributions of a candidate gene and an environmental factor, and the presence of gene x environment (G x E) interaction, in the etiology of a disease in the Genetic Analysis Workshop (GAW) 12 problem 2 simulated data using a two-stage approach utilizing both case-control and case-pare...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s843

    authors: Norris JM,Selinger-Leneman H,Génin E

    更新日期:2001-01-01 00:00:00

  • Entropy-supported marker selection and Mantel statistics for haplotype sharing analysis.

    abstract::Haplotype sharing analysis is a well-established option for the investigation of the etiology of complex diseases. The statistical power of haplotype association methods depends strongly on how the information of unobserved haplotypes can be captured by multilocus genotypes. In this study we combine an entropy-based m...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20491

    authors: Schulz A,Fischer C,Chang-Claude J,Beckmann L

    更新日期:2010-05-01 00:00:00

  • Testing for association in SLE families.

    abstract::Systemic lupus erythematosus (SLE) is a complex disease which is partly determined by genetic factors which influence susceptibility to the disease phenotype. In this association study we try to define the high risk haplotypes which are responsible for this disease, together with other environmental factors. In many o...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370080607

    authors: Seuchter SA,Knapp M,Hartung K,Coldewey R,Kalden JR,Lakomek HJ,Peter HH,Deicher H,Baur MP

    更新日期:1991-01-01 00:00:00

  • Pooling data and linkage analysis in the chromosome 5q candidate region for asthma.

    abstract::We investigated a variety of methods for pooling data from eight data sets (n = 5,424 subjects) to validate evidence for linkage of markers in the cytokine cluster on chromosome 5q31-33 to asthma and asthma-associated phenotypes. Chromosome 5 markers were integrated into current genetic linkage and physical maps, and ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,meta分析

    doi:10.1002/gepi.2001.21.s1.s103

    authors: Jacobs KB,Burton PR,Iyengar SK,Elston RC,Palmer LJ

    更新日期:2001-01-01 00:00:00

  • Linkage analysis of Alzheimer's disease with methods using relative pairs.

    abstract::Four relative-pair methods for detecting genetic linkage were applied to familial Alzheimer's disease data. Results obtained using an extended Haseman-Elston test and a weighted rank pairwise correlation test, which both use information from all relative pairs, were consistent with previously published likelihood resu...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100608

    authors: Blossey H,Commenges D,Olson JM

    更新日期:1993-01-01 00:00:00

  • Effect of linkage disequilibrium between markers in linkage and association analyses.

    abstract::Contributions to Group 17 of the Genetic Analysis Workshop 15 considered dense markers in linkage disequilibrium (LD) in the context of either linkage or association analysis. Three contributions reported on methods for modeling LD or selecting a subset of markers in linkage equilibrium to perform linkage analysis. Wh...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20291

    authors: Dupuis J,Albers K,Allen-Brady K,Cho K,Elston RC,Kappen HJ,Tang H,Thomas A,Thomson G,Tsung E,Yang Q,Zhang W,Zhao K,Zheng G,Ziegler JT

    更新日期:2007-01-01 00:00:00

  • Rare-variant association tests in longitudinal studies, with an application to the Multi-Ethnic Study of Atherosclerosis (MESA).

    abstract::Over the past few years, an increasing number of studies have identified rare variants that contribute to trait heritability. Due to the extreme rarity of some individual variants, gene-based association tests have been proposed to aggregate the genetic variants within a gene, pathway, or specific genomic region as op...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22081

    authors: He Z,Lee S,Zhang M,Smith JA,Guo X,Palmas W,Kardia SLR,Ionita-Laza I,Mukherjee B

    更新日期:2017-12-01 00:00:00

  • Exploring data from genetic association studies using Bayesian variable selection and the Dirichlet process: application to searching for gene × gene patterns.

    abstract::We construct data exploration tools for recognizing important covariate patterns associated with a phenotype, with particular focus on searching for association with gene-gene patterns. To this end, we propose a new variable selection procedure that employs latent selection weights and compare it to an alternative for...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21661

    authors: Papathomas M,Molitor J,Hoggart C,Hastie D,Richardson S

    更新日期:2012-09-01 00:00:00

  • Estimation of genetic and environmental components in colorectal and lung cancer and melanoma.

    abstract::Cancer has predominant environmental and somatic causes but the assessment of hereditary (genetic) causes is difficult, except for highly penetrant single-gene causes. Family studies are only partially informative in this regard because family members share diet and life-styles. Twin studies have been classically used...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/1098-2272(200101)20:1<107::AID-GEPI9>3.0.C

    authors: Hemminki K,Lönnstedt I,Vaittinen P,Lichtenstein P

    更新日期:2001-01-01 00:00:00

  • Increasing the power of identifying gene x gene interactions in genome-wide association studies.

    abstract::In this paper we investigate the power to identify gene x gene interactions in genome-wide association studies. In our analysis we focus on two-stage analyses: analyses in which we only test for interactions between single nucleotide polymorphisms that show some marginal effect. We give two algorithms to compute signi...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20300

    authors: Kooperberg C,Leblanc M

    更新日期:2008-04-01 00:00:00

  • Population-based family study designs: an interdisciplinary research framework for genetic epidemiology.

    abstract::Most complex traits such as cancer and coronary heart diseases are attributed either to heritable factors or to environmental factors or to both. Dissecting the genetic and environmental etiology of complex traits thus requires an interdisciplinary research strategy. Genetic studies generally involve families and inve...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/(SICI)1098-2272(1997)14:4<365::AID-GEPI3>3

    authors: Zhao LP,Hsu L,Davidov O,Potter J,Elston RC,Prentice RL

    更新日期:1997-01-01 00:00:00

  • Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations.

    abstract::A key aim for current genome-wide association studies (GWAS) is to interrogate the full spectrum of genetic variation underlying human traits, including rare variants, across populations. Deep whole-genome sequencing is the gold standard to fully capture genetic variation, but remains prohibitively expensive for large...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22326

    authors: Quick C,Anugu P,Musani S,Weiss ST,Burchard EG,White MJ,Keys KL,Cucca F,Sidore C,Boehnke M,Fuchsberger C

    更新日期:2020-09-01 00:00:00