Simultaneous SNP selection and adjustment for population structure in high dimensional prediction models.

Abstract:

:Complex traits are known to be influenced by a combination of environmental factors and rare and common genetic variants. However, detection of such multivariate associations can be compromised by low statistical power and confounding by population structure. Linear mixed effects models (LMM) can account for correlations due to relatedness but have not been applicable in high-dimensional (HD) settings where the number of fixed effect predictors greatly exceeds the number of samples. False positives or false negatives can result from two-stage approaches, where the residuals estimated from a null model adjusted for the subjects' relationship structure are subsequently used as the response in a standard penalized regression model. To overcome these challenges, we develop a general penalized LMM with a single random effect called ggmix for simultaneous SNP selection and adjustment for population structure in high dimensional prediction models. We develop a blockwise coordinate descent algorithm with automatic tuning parameter selection which is highly scalable, computationally efficient and has theoretical guarantees of convergence. Through simulations and three real data examples, we show that ggmix leads to more parsimonious models compared to the two-stage approach or principal component adjustment with better prediction accuracy. Our method performs well even in the presence of highly correlated markers, and when the causal SNPs are included in the kinship matrix. ggmix can be used to construct polygenic risk scores and select instrumental variables in Mendelian randomization studies. Our algorithms are available in an R package available on CRAN (https://cran.r-project.org/package=ggmix).

journal_name

PLoS Genet

journal_title

PLoS genetics

authors

Bhatnagar SR,Yang Y,Lu T,Schurr E,Loredo-Osti JC,Forest M,Oualkacha K,Greenwood CMT

doi

10.1371/journal.pgen.1008766

subject

Has Abstract

pub_date

2020-05-04 00:00:00

pages

e1008766

issue

5

eissn

1553-7390

issn

1553-7404

pii

PGENETICS-D-19-01153

journal_volume

16

pub_type

杂志文章
  • Binary addition in a living cell based on riboregulation.

    abstract::Synthetic biology aims at (re-)programming living cells like computers to perform new functions for a variety of applications. Initial work rested on transcription factors, but regulatory RNAs have recently gained much attention due to their high programmability. However, functional circuits mainly implemented with re...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1007548

    authors: Rosado A,Cordero T,Rodrigo G

    更新日期:2018-07-19 00:00:00

  • Correction: Protein Poly(ADP-ribosyl)ation Regulates Arabidopsis Immune Gene Expression and Defense Responses.

    abstract::[This corrects the article DOI: 10.1371/journal.pgen.1004936.]. ...

    journal_title:PLoS genetics

    pub_type: 已发布勘误

    doi:10.1371/journal.pgen.1005294

    authors: Feng B,Liu C,de Oliveira MV,Intorne AC,Li B,Babilonia K,de Souza Filho GA,Shan L,He P

    更新日期:2016-09-09 00:00:00

  • Preferential binding to Elk-1 by SLE-associated IL10 risk allele upregulates IL10 expression.

    abstract::Immunoregulatory cytokine interleukin-10 (IL-10) is elevated in sera from patients with systemic lupus erythematosus (SLE) correlating with disease activity. The established association of IL10 with SLE and other autoimmune diseases led us to fine map causal variant(s) and to explore underlying mechanisms. We assessed...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1003870

    authors: Sakurai D,Zhao J,Deng Y,Kelly JA,Brown EE,Harley JB,Bae SC,Alarcόn-Riquelme ME,BIOLUPUS and GENLES networks.,Edberg JC,Kimberly RP,Ramsey-Goldman R,Petri MA,Reveille JD,Vilá LM,Alarcón GS,Kaufman KM,Vyse TJ,Jacob CO,

    更新日期:2013-01-01 00:00:00

  • Transcriptional mutagenesis induced by 8-oxoguanine in mammalian cells.

    abstract::Most of the somatic cells of adult metazoans, including mammals, do not undergo continuous cycles of replication. Instead, they are quiescent and devote most of their metabolic activity to gene expression. The mutagenic consequences of exposure to DNA-damaging agents are well documented, but less is known about the im...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1000577

    authors: Brégeon D,Peignon PA,Sarasin A

    更新日期:2009-07-01 00:00:00

  • DNA dynamics during early double-strand break processing revealed by non-intrusive imaging of living cells.

    abstract::Chromosome breakage is a major threat to genome integrity. The most accurate way to repair DNA double strand breaks (DSB) is homologous recombination (HR) with an intact copy of the broken locus. Mobility of the broken DNA has been seen to increase during the search for a donor copy. Observing chromosome dynamics duri...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1004187

    authors: Saad H,Gallardo F,Dalvai M,Tanguy-le-Gac N,Lane D,Bystricky K

    更新日期:2014-03-13 00:00:00

  • A candidate gene approach identifies the CHRNA5-A3-B4 region as a risk factor for age-dependent nicotine addiction.

    abstract::People who begin daily smoking at an early age are at greater risk of long-term nicotine addiction. We tested the hypothesis that associations between nicotinic acetylcholine receptor (nAChR) genetic variants and nicotine dependence assessed in adulthood will be stronger among smokers who began daily nicotine exposure...

    journal_title:PLoS genetics

    pub_type: 杂志文章,随机对照试验

    doi:10.1371/journal.pgen.1000125

    authors: Weiss RB,Baker TB,Cannon DS,von Niederhausern A,Dunn DM,Matsunami N,Singh NA,Baird L,Coon H,McMahon WM,Piper ME,Fiore MC,Scholand MB,Connett JE,Kanner RE,Gahring LC,Rogers SW,Hoidal JR,Leppert MF

    更新日期:2008-07-11 00:00:00

  • ------Widespread conservation and lineage-specific diversification of genome-wide DNA methylation patterns across arthropods.

    abstract::Cytosine methylation is an ancient epigenetic modification yet its function and extent within genomes is highly variable across eukaryotes. In mammals, methylation controls transposable elements and regulates the promoters of genes. In insects, DNA methylation is generally restricted to a small subset of transcribed g...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1008864

    authors: Lewis SH,Ross L,Bain SA,Pahita E,Smith SA,Cordaux R,Miska EA,Lenhard B,Jiggins FM,Sarkies P

    更新日期:2020-06-25 00:00:00

  • Unraveling the genetics of human obesity.

    abstract::The use of modern molecular biology tools in deciphering the perturbed biochemistry and physiology underlying the obese state has proven invaluable. Identifying the hypothalamic leptin/melanocortin pathway as critical in many cases of monogenic obesity has permitted targeted, hypothesis-driven experiments to be perfor...

    journal_title:PLoS genetics

    pub_type: 杂志文章,评审

    doi:10.1371/journal.pgen.0020188

    authors: Mutch DM,Clément K

    更新日期:2006-12-29 00:00:00

  • Comparative analysis of regulatory elements between Escherichia coli and Klebsiella pneumoniae by genome-wide transcription start site profiling.

    abstract::Genome-wide transcription start site (TSS) profiles of the enterobacteria Escherichia coli and Klebsiella pneumoniae were experimentally determined through modified 5' RACE followed by deep sequencing of intact primary mRNA. This identified 3,746 and 3,143 TSSs for E. coli and K. pneumoniae, respectively. Experimental...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1002867

    authors: Kim D,Hong JS,Qiu Y,Nagarajan H,Seo JH,Cho BK,Tsai SF,Palsson BØ

    更新日期:2012-01-01 00:00:00

  • Leaf shedding as an anti-bacterial defense in Arabidopsis cauline leaves.

    abstract::Plants utilize an innate immune system to protect themselves from disease. While many molecular components of plant innate immunity resemble the innate immunity of animals, plants also have evolved a number of truly unique defense mechanisms, particularly at the physiological level. Plant's flexible developmental prog...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1007132

    authors: Patharkar OR,Gassmann W,Walker JC

    更新日期:2017-12-18 00:00:00

  • Coronary heart disease-associated variation in TCF21 disrupts a miR-224 binding site and miRNA-mediated regulation.

    abstract::Genome-wide association studies (GWAS) have identified chromosomal loci that affect risk of coronary heart disease (CHD) independent of classical risk factors. One such association signal has been identified at 6q23.2 in both Caucasians and East Asians. The lead CHD-associated polymorphism in this region, rs12190287, ...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1004263

    authors: Miller CL,Haas U,Diaz R,Leeper NJ,Kundu RK,Patlolla B,Assimes TL,Kaiser FJ,Perisic L,Hedin U,Maegdefessel L,Schunkert H,Erdmann J,Quertermous T,Sczakiel G

    更新日期:2014-03-27 00:00:00

  • Trends in selenium utilization in marine microbial world revealed through the analysis of the global ocean sampling (GOS) project.

    abstract::Selenium is an important trace element that occurs in proteins in the form of selenocysteine (Sec) and in tRNAs in the form of selenouridine. Recent large-scale metagenomics projects provide an opportunity for understanding global trends in trace element utilization. Herein, we characterized the selenoproteome of the ...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1000095

    authors: Zhang Y,Gladyshev VN

    更新日期:2008-06-13 00:00:00

  • Meiosis-specific stable binding of augmin to acentrosomal spindle poles promotes biased microtubule assembly in oocytes.

    abstract::In the oocytes of many animals including humans, the meiotic spindle assembles without centrosomes. It is still unclear how multiple pathways contribute to spindle microtubule assembly, and whether they are regulated differently in mitosis and meiosis. Augmin is a γ-tubulin recruiting complex which "amplifies" spindle...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1003562

    authors: Colombié N,Głuszek AA,Meireles AM,Ohkura H

    更新日期:2013-06-01 00:00:00

  • HSPB7 prevents cardiac conduction system defect through maintaining intercalated disc integrity.

    abstract::HSPB7 is a member of the small heat-shock protein (HSPB) family and is expressed in the cardiomyocytes from cardiogenesis onwards. A dramatic increase in HSPB7 is detected in the heart and blood plasma immediately after myocardial infarction. Additionally, several single-nucleotide polymorphisms of HSPB7 have been ide...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1006984

    authors: Liao WC,Juo LY,Shih YL,Chen YH,Yan YT

    更新日期:2017-08-21 00:00:00

  • PCNA ubiquitylation ensures timely completion of unperturbed DNA replication in fission yeast.

    abstract::PCNA ubiquitylation on lysine 164 is required for DNA damage tolerance. In many organisms PCNA is also ubiquitylated in unchallenged S phase but the significance of this has not been established. Using Schizosaccharomyces pombe, we demonstrate that lysine 164 ubiquitylation of PCNA contributes to efficient DNA replica...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1006789

    authors: Daigaku Y,Etheridge TJ,Nakazawa Y,Nakayama M,Watson AT,Miyabe I,Ogi T,Osborne MA,Carr AM

    更新日期:2017-05-08 00:00:00

  • Glucocorticoid receptor-dependent gene regulatory networks.

    abstract::While the molecular mechanisms of glucocorticoid regulation of transcription have been studied in detail, the global networks regulated by the glucocorticoid receptor (GR) remain unknown. To address this question, we performed an orthogonal analysis to identify direct targets of the GR. First, we analyzed the expressi...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.0010016

    authors: Phuc Le P,Friedman JR,Schug J,Brestelli JE,Parker JB,Bochkis IM,Kaestner KH

    更新日期:2005-08-01 00:00:00

  • High expression in maize pollen correlates with genetic contributions to pollen fitness as well as with coordinated transcription from neighboring transposable elements.

    abstract::In flowering plants, gene expression in the haploid male gametophyte (pollen) is essential for sperm delivery and double fertilization. Pollen also undergoes dynamic epigenetic regulation of expression from transposable elements (TEs), but how this process interacts with gene expression is not clearly understood. To e...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1008462

    authors: Warman C,Panda K,Vejlupkova Z,Hokin S,Unger-Wallace E,Cole RA,Chettoor AM,Jiang D,Vollbrecht E,Evans MMS,Slotkin RK,Fowler JE

    更新日期:2020-04-01 00:00:00

  • Determinants beyond both complementarity and cleavage govern microR159 efficacy in Arabidopsis.

    abstract::Plant microRNAs (miRNAs) are critical regulators of gene expression, however little attention has been given to the principles governing miRNA silencing efficacy. Here, we utilize the highly conserved Arabidopsis miR159-MYB33/MYB65 regulatory module to explore these principles. Firstly, we show that perfect central co...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1004232

    authors: Li J,Reichel M,Millar AA

    更新日期:2014-03-13 00:00:00

  • A positive feedback loop links opposing functions of P-TEFb/Cdk9 and histone H2B ubiquitylation to regulate transcript elongation in fission yeast.

    abstract::Transcript elongation by RNA polymerase II (RNAPII) is accompanied by conserved patterns of histone modification. Whereas histone modifications have established roles in transcription initiation, their functions during elongation are not understood. Mono-ubiquitylation of histone H2B (H2Bub1) plays a key role in coord...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1002822

    authors: Sansó M,Lee KM,Viladevall L,Jacques PÉ,Pagé V,Nagy S,Racine A,St Amour CV,Zhang C,Shokat KM,Schwer B,Robert F,Fisher RP,Tanny JC

    更新日期:2012-01-01 00:00:00

  • Evolution of the Auxin Response Factors from charophyte ancestors.

    abstract::Auxin is a major developmental regulator in plants and the acquisition of a transcriptional response to auxin likely contributed to developmental innovations at the time of water-to-land transition. Auxin Response Factors (ARFs) Transcription Factors (TFs) that mediate auxin-dependent transcriptional changes are divid...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1008400

    authors: Martin-Arevalillo R,Thévenon E,Jégu F,Vinos-Poyo T,Vernoux T,Parcy F,Dumas R

    更新日期:2019-09-25 00:00:00

  • Compensatory interactions between Sir3p and the nucleosomal LRS surface imply their direct interaction.

    abstract::The previously identified LRS (Loss of rDNA Silencing) domain of the nucleosome is critically important for silencing at both ribosomal DNA and telomeres. To understand the function of the LRS surface in silencing, we performed an EMS mutagenesis screen to identify suppressors of the H3 A75V LRS allele. We identified ...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1000301

    authors: Norris A,Bianchet MA,Boeke JD

    更新日期:2008-12-01 00:00:00

  • Heteroduplex DNA position defines the roles of the Sgs1, Srs2, and Mph1 helicases in promoting distinct recombination outcomes.

    abstract::The contributions of the Sgs1, Mph1, and Srs2 DNA helicases during mitotic double-strand break (DSB) repair in yeast were investigated using a gap-repair assay. A diverged chromosomal substrate was used as a repair template for the gapped plasmid, allowing mismatch-containing heteroduplex DNA (hDNA) formed during reco...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1003340

    authors: Mitchel K,Lehner K,Jinks-Robertson S

    更新日期:2013-01-01 00:00:00

  • Parallelism in eco-morphology and gene expression despite variable evolutionary and genomic backgrounds in a Holarctic fish.

    abstract::Understanding the extent to which ecological divergence is repeatable is essential for predicting responses of biodiversity to environmental change. Here we test the predictability of evolution, from genotype to phenotype, by studying parallel evolution in a salmonid fish, Arctic charr (Salvelinus alpinus), across ele...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1008658

    authors: Jacobs A,Carruthers M,Yurchenko A,Gordeeva NV,Alekseyev SS,Hooker O,Leong JS,Minkley DR,Rondeau EB,Koop BF,Adams CE,Elmer KR

    更新日期:2020-04-17 00:00:00

  • Interplay between UNG and AID governs intratumoral heterogeneity in mature B cell lymphoma.

    abstract::Most B cell lymphomas originate from B cells that have germinal center (GC) experience and bear chromosome translocations and numerous point mutations. GC B cells remodel their immunoglobulin (Ig) genes by somatic hypermutation (SHM) and class switch recombination (CSR) in their Ig genes. Activation Induced Deaminase ...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1008960

    authors: Delgado P,Álvarez-Prado ÁF,Marina-Zárate E,Sernandez IV,Mur SM,de la Barrera J,Sanchez-Cabo F,Cañamero M,de Molina A,Belver L,de Yébenes VG,Ramiro AR

    更新日期:2020-12-23 00:00:00

  • Correction: TOR-autophagy branch signaling via Imp1 dictates plant-microbe biotrophic interface longevity.

    abstract::[This corrects the article DOI: 10.1371/journal.pgen.1007814.]. ...

    journal_title:PLoS genetics

    pub_type: 杂志文章,已发布勘误

    doi:10.1371/journal.pgen.1008016

    authors: Sun G,Elowsky C,Li G,Wilson RA

    更新日期:2019-02-28 00:00:00

  • Endogenous viral elements in animal genomes.

    abstract::Integration into the nuclear genome of germ line cells can lead to vertical inheritance of retroviral genes as host alleles. For other viruses, germ line integration has only rarely been documented. Nonetheless, we identified endogenous viral elements (EVEs) derived from ten non-retroviral families by systematic in si...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1001191

    authors: Katzourakis A,Gifford RJ

    更新日期:2010-11-18 00:00:00

  • Stress-induced nuclear RNA degradation pathways regulate yeast bromodomain factor 2 to promote cell survival.

    abstract::Bromodomain proteins are key regulators of gene expression. How the levels of these factors are regulated in specific environmental conditions is unknown. Previous work has established that expression of yeast Bromodomain factor 2 (BDF2) is limited by spliceosome-mediated decay (SMD). Here we show that BDF2 is subject...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1004661

    authors: Roy K,Chanfreau G

    更新日期:2014-09-18 00:00:00

  • Convergent evolution of linked mating-type loci in basidiomycete fungi.

    abstract::Sexual development is a key evolutionary innovation of eukaryotes. In many species, mating involves interaction between compatible mating partners that can undergo cell and nuclear fusion and subsequent steps of development including meiosis. Mating compatibility in fungi is governed by the mating type (MAT) loci. In ...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1008365

    authors: Sun S,Coelho MA,Heitman J,Nowrousian M

    更新日期:2019-09-06 00:00:00

  • The DSIF subunits Spt4 and Spt5 have distinct roles at various phases of immunoglobulin class switch recombination.

    abstract::Class-switch recombination (CSR), induced by activation-induced cytidine deaminase (AID), can be divided into two phases: DNA cleavage of the switch (S) regions and the joining of the cleaved ends of the different S regions. Here, we show that the DSIF complex (Spt4 and Spt5), a transcription elongation factor, is req...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1002675

    authors: Stanlie A,Begum NA,Akiyama H,Honjo T

    更新日期:2012-01-01 00:00:00

  • The CFTR Met 470 allele is associated with lower birth rates in fertile men from a population isolate.

    abstract::Although little is known about the role of the cystic fibrosis transmembrane regulator (CFTR) gene in reproductive physiology, numerous variants in this gene have been implicated in etiology of male infertility due to congenital bilateral absence of the vas deferens (CBAVD). Here, we studied the fertility effects of t...

    journal_title:PLoS genetics

    pub_type: 杂志文章

    doi:10.1371/journal.pgen.1000974

    authors: Kosova G,Pickrell JK,Kelley JL,McArdle PF,Shuldiner AR,Abney M,Ober C

    更新日期:2010-06-03 00:00:00