Clustering by genetic ancestry using genome-wide SNP data.

Abstract:

BACKGROUND:Population stratification can cause spurious associations in a genome-wide association study (GWAS), and occurs when differences in allele frequencies of single nucleotide polymorphisms (SNPs) are due to ancestral differences between cases and controls rather than the trait of interest. Principal components analysis (PCA) is the established approach to detect population substructure using genome-wide data and to adjust the genetic association for stratification by including the top principal components in the analysis. An alternative solution is genetic matching of cases and controls that requires, however, well defined population strata for appropriate selection of cases and controls. RESULTS:We developed a novel algorithm to cluster individuals into groups with similar ancestral backgrounds based on the principal components computed by PCA. We demonstrate the effectiveness of our algorithm in real and simulated data, and show that matching cases and controls using the clusters assigned by the algorithm substantially reduces population stratification bias. Through simulation we show that the power of our method is higher than adjustment for PCs in certain situations. CONCLUSIONS:In addition to reducing population stratification bias and improving power, matching creates a clean dataset free of population stratification which can then be used to build prediction models without including variables to adjust for ancestry. The cluster assignments also allow for the estimation of genetic heterogeneity by examining cluster specific effects.

journal_name

BMC Genet

journal_title

BMC genetics

authors

Solovieff N,Hartley SW,Baldwin CT,Perls TT,Steinberg MH,Sebastiani P

doi

10.1186/1471-2156-11-108

subject

Has Abstract

pub_date

2010-12-09 00:00:00

pages

108

issn

1471-2156

pii

1471-2156-11-108

journal_volume

11

pub_type

杂志文章
  • Development of Cymbidium ensifolium genic-SSR markers and their utility in genetic diversity and population structure analysis in cymbidiums.

    abstract:BACKGROUND:Cymbidium is a genus of 68 species in the orchid family, with extremely high ornamental value. Marker-assisted selection has proven to be an effective strategy in accelerating plant breeding for many plant species. Analysis of cymbidiums genetic background by molecular markers can be of great value in assist...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-014-0124-5

    authors: Li X,Jin F,Jin L,Jackson A,Huang C,Li K,Shu X

    更新日期:2014-12-05 00:00:00

  • GpnmbR150X allele must be present in bone marrow derived cells to mediate DBA/2J glaucoma.

    abstract:BACKGROUND:The Gpnmb gene encodes a transmembrane protein whose function(s) remain largely unknown. Here, we assess if a mutant allele of Gpnmb confers susceptibility to glaucoma by altering immune functions. DBA/2J mice have a mutant Gpnmb gene and they develop a form of glaucoma preceded by a pigment dispersing iris ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-9-30

    authors: Anderson MG,Nair KS,Amonoo LA,Mehalow A,Trantow CM,Masli S,John SW

    更新日期:2008-04-10 00:00:00

  • Polymorphisms of two loci at the oxytocin receptor gene in populations of Africa, Asia and South Europe.

    abstract:BACKGROUND:The oxytocin (OT) system is known to be implicated in the regulation of complex social behavior, particularly empathy and parenting. The goal of this study was to estimate the gender and population differences in polymorphisms of two oxytocin receptor gene SNPs, rs53576 and rs2254298, in four populations. R...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0323-8

    authors: Butovskaya PR,Lazebny OE,Sukhodolskaya EM,Vasiliev VA,Dronova DA,Fedenok JN,Rosa A,Peletskaya EN,Ryskov AP,Butovskaya ML

    更新日期:2016-01-06 00:00:00

  • Genetic insights into dispersal distance and disperser fitness of African lions (Panthera leo) from the latitudinal extremes of the Kruger National Park, South Africa.

    abstract:BACKGROUND:Female lions generally do not disperse far beyond their natal range, while males can disperse distances of over 200 km. However, in bush-like ecosystems dispersal distances less than 25 km are reported. Here, we investigate dispersal in lions sampled from the northern and southern extremes of Kruger National...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-018-0607-x

    authors: van Hooft P,Keet DF,Brebner DK,Bastos ADS

    更新日期:2018-04-03 00:00:00

  • Comparison of the accuracy of methods of computational haplotype inference using a large empirical dataset.

    abstract:BACKGROUND:Analyses of genetic data at the level of haplotypes provide increased accuracy and power to infer genotype-phenotype correlations and evolutionary history of a locus. However, empirical determination of haplotypes is expensive and laborious. Therefore, several methods of inferring haplotypes from unphased ge...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-5-22

    authors: Adkins RM

    更新日期:2004-08-03 00:00:00

  • Effects of single nucleotide polymorphism marker density on degree of genetic variance explained and genomic evaluation for carcass traits in Japanese Black beef cattle.

    abstract:BACKGROUND:Japanese Black cattle are a beef breed whose meat is well known to excel in meat quality, especially in marbling, and whose effective population size is relatively low in Japan. Unlike dairy cattle, the accuracy of genomic evaluation (GE) for carcass traits in beef cattle, including this breed, has been poor...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-15-15

    authors: Ogawa S,Matsuda H,Taniguchi Y,Watanabe T,Nishimura S,Sugimoto Y,Iwaisaki H

    更新日期:2014-02-03 00:00:00

  • Coronary risk in relation to genetic variation in MEOX2 and TCF15 in a Flemish population.

    abstract:BACKGROUND:In mice MEOX2/TCF15 heterodimers are highly expressed in heart endothelial cells and are involved in the transcriptional regulation of lipid transport. In a general population, we investigated whether genetic variation in these genes predicted coronary heart disease (CHD). RESULTS:In 2027 participants rando...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0272-2

    authors: Yang WY,Petit T,Thijs L,Zhang ZY,Jacobs L,Hara A,Wei FF,Salvi E,Citterio L,Delli Carpini S,Gu YM,Knez J,Cauwenberghs N,Barcella M,Barlassina C,Manunta P,Coppiello G,Aranguren XL,Kuznetsova T,Cusi D,Verhamme P,Lu

    更新日期:2015-10-01 00:00:00

  • Quantitative trait loci in Anopheles gambiae controlling the encapsulation response against Plasmodium cynomolgi Ceylon.

    abstract:BACKGROUND:Anopheles gambiae females are the world's most successful vectors of human malaria. However, a fraction of these mosquitoes is refractory to Plasmodium development. L3-5, a laboratory selected refractory strain, encapsulates transforming ookinetes/early oocysts of a wide variety of Plasmodium species. Previo...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-4-16

    authors: Zheng L,Wang S,Romans P,Zhao H,Luna C,Benedict MQ

    更新日期:2003-10-24 00:00:00

  • Population substructure in Finland and Sweden revealed by the use of spatial coordinates and a small number of unlinked autosomal SNPs.

    abstract:BACKGROUND:Despite several thousands of years of close contacts, there are genetic differences between the neighbouring countries of Finland and Sweden. Within Finland, signs of an east-west duality have been observed, whereas the population structure within Sweden has been suggested to be more subtle. With a fine-scal...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-9-54

    authors: Hannelius U,Salmela E,Lappalainen T,Guillot G,Lindgren CM,von Döbeln U,Lahermo P,Kere J

    更新日期:2008-08-19 00:00:00

  • Composite selection signals can localize the trait specific genomic regions in multi-breed populations of cattle and sheep.

    abstract:BACKGROUND:Discerning the traits evolving under neutral conditions from those traits evolving rapidly because of various selection pressures is a great challenge. We propose a new method, composite selection signals (CSS), which unifies the multiple pieces of selection evidence from the rank distribution of its diverse...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-15-34

    authors: Randhawa IA,Khatkar MS,Thomson PC,Raadsma HW

    更新日期:2014-03-17 00:00:00

  • Identification of major QTLs underlying tomato spotted wilt virus resistance in peanut cultivar Florida-EP(TM) '113'.

    abstract:BACKGROUND:Spotted wilt caused by tomato spotted wilt virus (TSWV) is one of the major peanut (Arachis hypogaea L.) diseases in the southeastern United States. Occurrence, severity, and symptoms of spotted wilt disease are highly variable from season to season, making it difficult to efficiently evaluate breeding popul...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-016-0435-9

    authors: Tseng YC,Tillman BL,Peng Z,Wang J

    更新日期:2016-09-06 00:00:00

  • A copy number variation in human NCF1 and its pseudogenes.

    abstract:BACKGROUND:Neutrophil cytosolic factor-1 (NCF1) is a component of NADPH oxidase. The NCF1 gene colocalizes with two pseudogenes (NCF1B and NCF1C). These two pseudogenes have a GT deletion in exon 2, resulting in a frameshift and an early stop codon. Here, we report a copy number variation (CNV) of the NCF1 pseudogenes ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-11-13

    authors: Brunson T,Wang Q,Chambers I,Song Q

    更新日期:2010-02-23 00:00:00

  • Further identification of a 140bp sequence from amid intron 9 of human FMR1 gene as a new exon.

    abstract:BACKGROUND:The disease gene of fragile X syndrome, FMR1 gene, encodes fragile X mental retardation protein (FMRP). The alternative splicing (AS) of FMR1 can affect the structure and function of FMRP. However, the biological functions of alternatively spliced isoforms remain elusive. In a previous study, we identified a...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-020-00870-2

    authors: Yang WJ,Yan AZ,Xu YJ,Guo XY,Fu XG,Li D,Liao J,Zhang D,Lan FH

    更新日期:2020-06-18 00:00:00

  • Insertion-deletion polymorphisms (indels) as genetic markers in natural populations.

    abstract:BACKGROUND:We introduce the use of short insertion-deletion polymorphisms (indels) for genetic analysis of natural populations. RESULTS:Sequence reads from light shot-gun sequencing efforts of different dog breeds were aligned to the dog genome reference sequence and gaps corresponding to indels were identified. One h...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-9-8

    authors: Väli U,Brandström M,Johansson M,Ellegren H

    更新日期:2008-01-22 00:00:00

  • Bayesian shrinkage mapping of quantitative trait loci in variance component models.

    abstract:BACKGROUND:In this article, I propose a model-selection-free method to map multiple quantitative trait loci (QTL) in variance component model, which is useful in outbred populations. The new method can estimate the variance of zero-effect QTL infinitely to zero, but nearly unbiased for non-zero-effect QTL. It is analog...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-11-30

    authors: Fang M

    更新日期:2010-04-29 00:00:00

  • The use of the SLC16A1 gene as a potential marker to predict race performance in Arabian horses.

    abstract:BACKGROUND:Arabian horses are commonly believed to be one of the oldest and the most popular horse breeds in the world, characterized by favourable stamina traits and exercise phenotypes. During intensive training, the rates of lactate production and utilization are critical to avoid muscle fatigue and a decrease in ex...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-019-0774-4

    authors: Ropka-Molik K,Stefaniuk-Szmukier M,Szmatoła T,Piórkowska K,Bugno-Poniewierska M

    更新日期:2019-09-11 00:00:00

  • Statistically efficient association analysis of quantitative traits with haplotypes and untyped SNPs in family studies.

    abstract:BACKGROUND:Associations between haplotypes and quantitative traits provide valuable information about the genetic basis of complex human diseases. Haplotypes also provide an effective way to deal with untyped SNPs. Two major challenges arise in haplotype-based association analysis of family data. First, haplotypes may ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-020-00902-x

    authors: Diao G,Lin DY

    更新日期:2020-09-07 00:00:00

  • Genome-wide linkage scan for genes affecting longitudinal trends in systolic blood pressure.

    abstract::Only one genome scan to date has attempted to make use of the longitudinal data available in the Framingham Heart Study, and this attempt yielded evidence of linkage to a gene for mean systolic blood pressure. We show how the additional information available in these longitudinal data can be utilized to examine linkag...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-4-S1-S82

    authors: Jacobs KB,Gray-McGuire C,Cartier KC,Elston RC

    更新日期:2003-12-31 00:00:00

  • Genomic characterisation, chromosomal assignment and in vivo localisation of the canine high mobility group A1 (HMGA1) gene.

    abstract:BACKGROUND:The high mobility group A1 proteins (HMGA1a/HMGA1b) are highly conserved between mammalian species and widely described as participating in various cellular processes. By inducing DNA conformation changes the HMGA1 proteins indirectly influence the binding of various transcription factors and therefore effec...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-9-49

    authors: Beuing C,Soller JT,Muth M,Wagner S,Dolf G,Schelling C,Richter A,Willenbrock S,Reimann-Berg N,Winkler S,Nolte I,Bullerdiek J,Murua Escobar H

    更新日期:2008-07-23 00:00:00

  • A single nucleotide polymorphism in CAPN1 associated with marbling score in Korean cattle.

    abstract:BACKGROUND:Marbling score (MS) is the major quantitative trait that affects carcass quality in beef cattle. In this study, we examined the association between genetic polymorphisms of the micromolar calcium-activated neutral protease gene (micro-calpain, CAPN1) and carcass traits in Korean cattle (also known as Hanwoo)...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-9-33

    authors: Cheong HS,Yoon DH,Park BL,Kim LH,Bae JS,Namgoong S,Lee HW,Han CS,Kim JO,Cheong IC,Shin HD

    更新日期:2008-04-19 00:00:00

  • Genome-wide analysis of long non-coding RNAs in Catalpa bungei and their potential function in floral transition using high-throughput sequencing.

    abstract:BACKGROUND:Long non-coding RNAs (lncRNAs) have crucial roles in various biological regulatory processes. However, the study of lncRNAs is limited in woody plants. Catalpa bungei is a valuable ornamental tree with a long cultivation history in China, and a deeper understanding of the floral transition mechanism in C. bu...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-018-0671-2

    authors: Wang Z,Zhu T,Ma W,Wang N,Qu G,Zhang S,Wang J

    更新日期:2018-09-20 00:00:00

  • On different approximations to multilocus identity-by-descent calculations and the resulting power of variance component-based linkage analysis.

    abstract::An empirical comparison between three different methods for estimation of pair-wise identity-by-descent (IBD) sharing at marker loci was conducted in order to quantify the resulting differences in power and localization precision in variance components-based linkage analysis. On the examined simulated, error-free data...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-4-S1-S72

    authors: Göring HH,Williams JT,Dyer TD,Blangero J

    更新日期:2003-12-31 00:00:00

  • Nucleotide variability and linkage disequilibrium patterns in the porcine MUC4 gene.

    abstract:BACKGROUND:MUC4 is a type of membrane anchored glycoprotein and serves as the major constituent of mucus that covers epithelial surfaces of many tissues such as trachea, colon and cervix. MUC4 plays important roles in the lubrication and protection of the surface epithelium, cell proliferation and differentiation, immu...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-13-57

    authors: Yang M,Yang B,Yan X,Ouyang J,Zeng W,Ai H,Ren J,Huang L

    更新日期:2012-07-13 00:00:00

  • The association of the PON1 Q192R polymorphism with coronary heart disease: findings from the British Women's Heart and Health cohort study and a meta-analysis.

    abstract:BACKGROUND:There have been inconsistent results from case-control studies assessing the association of the PON1 Q192R polymorphism with coronary heart disease (CHD). Most studies have included predominantly men and the association in women is unclear. Since lipid levels vary between the sexes the antioxidant effect of ...

    journal_title:BMC genetics

    pub_type: 杂志文章,meta分析,评审

    doi:10.1186/1471-2156-5-17

    authors: Lawlor DA,Day IN,Gaunt TR,Hinks LJ,Briggs PJ,Kiessling M,Timpson N,Smith GD,Ebrahim S

    更新日期:2004-06-23 00:00:00

  • Screening of the arrestin gene in dogs afflicted with generalized progressive retinal atrophy.

    abstract:BACKGROUND:Intronic DNA sequences of the canine arrestin (SAG) gene was screened to identify potential disease causing mutations in dogs with generalized progressive retinal atrophy (gPRA). The intronic sequences flanking each of the 16 exons were obtained from clones of a canine genomic library. RESULTS:Using polymer...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-3-12

    authors: Dekomien G,Epplen JT

    更新日期:2002-07-17 00:00:00

  • Characterization of the late embryogenesis abundant (LEA) proteins family and their role in drought stress tolerance in upland cotton.

    abstract:BACKGROUND:Late embryogenesis abundant (LEA) proteins are large groups of hydrophilic proteins with major role in drought and other abiotic stresses tolerance in plants. In-depth study and characterization of LEA protein families have been carried out in other plants, but not in upland cotton. The main aim of this rese...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-017-0596-1

    authors: Magwanga RO,Lu P,Kirungu JN,Lu H,Wang X,Cai X,Zhou Z,Zhang Z,Salih H,Wang K,Liu F

    更新日期:2018-01-15 00:00:00

  • Characterization of the chromosomal inversion associated with the Koa mutation in the mouse revealed the cause of skeletal abnormalities.

    abstract:BACKGROUND:Koala (Koa) is a dominant mutation in mice causing bushy muzzle and pinna, and is associated with a chromosomal inversion on the distal half of chromosome 15. To identify the gene responsible for the Koa phenotypes, we investigated phenotypes of Koa homozygous mice and determined the breakpoints of the inver...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-10-60

    authors: Katayama K,Miyamoto S,Furuno A,Akiyama K,Takahashi S,Suzuki H,Tsuji T,Kunieda T

    更新日期:2009-09-22 00:00:00

  • GM and KM immunoglobulin allotypes in the Galician population: new insights into the peopling of the Iberian Peninsula.

    abstract:BACKGROUND:The current genetic structure of Iberian populations has presumably been affected by the complex orography of its territory, the different people and civilizations that settled there, its ancient and complex history, the diverse and persistent sociocultural patterns in its different regions, and also by the ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-8-37

    authors: Calderón R,Lodeiro R,Varela TA,Fariña J,Ambrosio B,Guitard E,González-Martín A,Dugoujon JM

    更新日期:2007-06-27 00:00:00

  • A simple nonparametric multipoint procedure to test for linkage through mothers or fathers as well as imprinting effects in the presence of linkage.

    abstract::A simple multipoint procedure to test for parent-of-origin effects in samples of affected siblings is discussed. The procedure consists of artificially changing all full sibs to half-sibs, with distinct mothers or fathers depending on the parental origin to be evaluated, then analyzing these families with commonly use...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-6-S1-S159

    authors: Lemire M

    更新日期:2005-12-30 00:00:00

  • Probability genotype imputation method and integrated weighted lasso for QTL identification.

    abstract:BACKGROUND:Many QTL studies have two common features: (1) often there is missing marker information, (2) among many markers involved in the biological process only a few are causal. In statistics, the second issue falls under the headings "sparsity" and "causal inference". The goal of this work is to develop a two-step...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-14-125

    authors: Demetrashvili N,Van den Heuvel ER,Wit EC

    更新日期:2013-12-30 00:00:00