A test for detecting differential indirect trans effects between two groups of samples.

Abstract:

:Integrative analysis of copy number and gene expression data can help in understanding the cis and trans effect of copy number aberrations on transcription levels of genes involved in a pathway. To analyse how these copy number mediated gene-gene interactions differ between groups of samples we propose a new method, named dNET. Our method uses ridge regression to model the network topology involving one gene's expression level, its gene dosage and the expression levels of other genes in the network. The interaction parameters are estimated by fitting the model per gene for all samples together. However, instead of testing for differential network topology per gene, dNET tests for an overall difference in estimated parameters between two groups of samples and produces a single p-value. With the help of several simulation studies, we show that dNET can detect differential network nodes with high accuracy and low rate of false positives even in the presence of differential cis effects. We also apply dNET to publicly available TCGA cancer datasets and identify pathways where copy number mediated gene-gene interactions differ between samples with cancer stage lower than stage 3 and samples with cancer stage 3 or above.

authors

Chaturvedi N,Menezes RX,Goeman JJ,Wieringen WV

doi

10.1515/sagmb-2017-0058

subject

Has Abstract

pub_date

2018-07-31 00:00:00

issue

5

eissn

2194-6302

issn

1544-6115

pii

/j/sagmb.ahead-of-print/sagmb-2017-0058/sagmb-2017

journal_volume

17

pub_type

杂志文章
  • Empirical bayes microarray ANOVA and grouping cell lines by equal expression levels.

    abstract::In the exploding field of gene expression techniques such as DNA microarrays, there are still few general probabilistic methods for analysis of variance. Linear models and ANOVA are heavily used tools in many other disciplines of scientific research. The usual F-statistic is unsatisfactory for microarray data, which e...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1125

    authors: Lönnstedt I,Rimini R,Nilsson P

    更新日期:2005-01-01 00:00:00

  • Second order optimization for the inference of gene regulatory pathways.

    abstract::With the increasing availability of experimental data on gene interactions, modeling of gene regulatory pathways has gained special attention. Gradient descent algorithms have been widely used for regression and classification applications. Unfortunately, results obtained after training a model by gradient descent are...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2012-0021

    authors: Das M,Murthy CA,De RK

    更新日期:2014-02-01 00:00:00

  • Node sampling for protein complex estimation in bait-prey graphs.

    abstract::In cellular biology, node-and-edge graph or "network" data collection often uses bait-prey technologies such as co-immunoprecipitation (CoIP). Bait-prey technologies assay relationships or "interactions" between protein pairs, with CoIP specifically measuring protein complex co-membership. Analyses of CoIP data freque...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2015-0007

    authors: Scholtens DM,Spencer BD

    更新日期:2015-08-01 00:00:00

  • Semi-parametric differential expression analysis via partial mixture estimation.

    abstract::We develop an approach for microarray differential expression analysis, i.e. identifying genes whose expression levels differ between two or more groups. Current approaches to inference rely either on full parametric assumptions or on permutation-based techniques for sampling under the null distribution. In some situa...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1333

    authors: Rossell D,Guerra R,Scott C

    更新日期:2008-01-01 00:00:00

  • Asymptotic optimality of likelihood-based cross-validation.

    abstract::Likelihood-based cross-validation is a statistical tool for selecting a density estimate based on n i.i.d. observations from the true density among a collection of candidate density estimators. General examples are the selection of a model indexing a maximum likelihood estimator, and the selection of a bandwidth index...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1036

    authors: van der Laan MJ,Dudoit S,Keles S

    更新日期:2004-01-01 00:00:00

  • On an extended interpretation of linkage disequilibrium in genetic case-control association studies.

    abstract::We are concerned with statistical inference for 2 × C × K contingency tables in the context of genetic case-control association studies. Multivariate methods based on asymptotic Gaussianity of vectors of test statistics require information about the asymptotic correlation structure among these test statistics under th...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2015-0024

    authors: Dickhaus T,Stange J,Demirhan H

    更新日期:2015-11-01 00:00:00

  • Model selection based on FDR-thresholding optimizing the area under the ROC-curve.

    abstract::We evaluate variable selection by multiple tests controlling the false discovery rate (FDR) to build a linear score for prediction of clinical outcome in high-dimensional data. Quality of prediction is assessed by the receiver operating characteristic curve (ROC) for prediction in independent patients. Thus we try to ...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1462

    authors: Graf AC,Bauer P

    更新日期:2009-01-01 00:00:00

  • Addressing the shortcomings of three recent Bayesian methods for detecting interspecific recombination in DNA sequence alignments.

    abstract::We address a potential shortcoming of three probabilistic models for detecting interspecific recombination in DNA sequence alignments: the multiple change-point model (MCP) of Suchard et al. (2003), the dual multiple change-point model (DMCP) of Minin et al. (2005), and the phylogenetic factorial hidden Markov model (...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1399

    authors: Husmeier D,Mantzaris AV

    更新日期:2008-01-01 00:00:00

  • Transmission disequilibrium test power and sample size in the presence of locus heterogeneity.

    abstract::Locus heterogeneity is one of the most important issues in gene mapping and can cause significant reductions in statistical power for gene mapping, yet no research to date has provided power and sample size calculations for family-based association methods in the presence of locus heterogeneity. The purpose of this re...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1501

    authors: Chen C,Yang G,Buyske S,Matise T,Finch SJ,Gordon D

    更新日期:2009-01-01 00:00:00

  • MLML2R: an R package for maximum likelihood estimation of DNA methylation and hydroxymethylation proportions.

    abstract::Accurately measuring epigenetic marks such as 5-methylcytosine (5-mC) and 5-hydroxymethylcytosine (5-hmC) at the single-nucleotide level, requires combining data from DNA processing methods including traditional (BS), oxidative (oxBS) or Tet-Assisted (TAB) bisulfite conversion. We introduce the R package MLML2R, which...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2018-0031

    authors: Kiihl SF,Martinez-Garrido MJ,Domingo-Relloso A,Bermudez J,Tellez-Plaza M

    更新日期:2019-01-17 00:00:00

  • The relative inefficiency of sequence weights approaches in determining a nucleotide position weight matrix.

    abstract::Approaches based upon sequence weights, to construct a position weight matrix of nucleotides from aligned inputs, are popular but little effort has been expended to measure their quality. We derive optimal sequence weights that minimize the sum of the variances of the estimators of base frequency parameters for sequen...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1135

    authors: Newberg LA,McCue LA,Lawrence CE

    更新日期:2005-01-01 00:00:00

  • Accommodating uncertainty in a tree set for function estimation.

    abstract::Multiple branching trees have been used to model the acquisition of HIV drug resistance mutations, and several different algorithms have been developed to construct the tree set that best describes the data. These algorithms have mainly focused on the structure of the tree set. The focal point of this paper is estimat...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1324

    authors: Healy BC,DeGruttola VG,Hu C

    更新日期:2008-01-01 00:00:00

  • Accounting for undetected compounds in statistical analyses of mass spectrometry 'omic studies.

    abstract::Mass spectrometry is an important high-throughput technique for profiling small molecular compounds in biological samples and is widely used to identify potential diagnostic and prognostic compounds associated with disease. Commonly, this data generated by mass spectrometry has many missing values resulting when a com...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2013-0021

    authors: Taylor SL,Leiserowitz GS,Kim K

    更新日期:2013-12-01 00:00:00

  • Weighted-LASSO for structured network inference from time course data.

    abstract::We present a weighted-LASSO method to infer the parameters of a first-order vector auto-regressive model that describes time course expression data generated by directed gene-to-gene regulation networks. These networks are assumed to own prior internal structures of connectivity which drive the inference method. This ...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1519

    authors: Charbonnier C,Chiquet J,Ambroise C

    更新日期:2010-01-01 00:00:00

  • Predicting protein concentrations with ELISA microarray assays, monotonic splines and Monte Carlo simulation.

    abstract::Making sound proteomic inferences using ELISA microarray assay requires both an accurate prediction of protein concentration and a credible estimate of its error. We present a method using monotonic spline statistical models (MS), penalized constrained least squares fitting (PCLS) and Monte Carlo simulation (MC) to pr...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1364

    authors: Daly DS,Anderson KK,White AM,Gonzalez RM,Varnum SM,Zangar RC

    更新日期:2008-01-01 00:00:00

  • Dimension reduction for classification with gene expression microarray data.

    abstract::An important application of gene expression microarray data is classification of biological samples or prediction of clinical and other outcomes. One necessary part of multivariate statistical analysis in such applications is dimension reduction. This paper provides a comparison study of three dimension reduction tech...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1147

    authors: Dai JJ,Lieu L,Rocke D

    更新日期:2006-01-01 00:00:00

  • On the operational characteristics of the Benjamini and Hochberg False Discovery Rate procedure.

    abstract::Multiple testing procedures are commonly used in gene expression studies for the detection of differential expression, where typically thousands of genes are measured over at least two experimental conditions. Given the need for powerful testing procedures, and the attendant danger of false positives in multiple testi...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1302

    authors: Green GH,Diggle PJ

    更新日期:2007-01-01 00:00:00

  • Modeling, simulation and analysis of methylation profiles from reduced representation bisulfite sequencing experiments.

    abstract::The ENCODE project has funded the generation of a diverse collection of methylation profiles using reduced representation bisulfite sequencing (RRBS) technology, enabling the analysis of epigenetic variation on a genomic scale at single-site resolution. A standard application of RRBS experiments is in the location of ...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2013-0027

    authors: Lacey MR,Baribault C,Ehrlich M

    更新日期:2013-12-01 00:00:00

  • A Bayesian approach to estimation and testing in time-course microarray experiments.

    abstract::The objective of the present paper is to develop a truly functional Bayesian method specifically designed for time series microarray data. The method allows one to identify differentially expressed genes in a time-course microarray experiment, to rank them and to estimate their expression profiles. Each gene expressio...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1299

    authors: Angelini C,De Canditiis D,Mutarelli M,Pensky M

    更新日期:2007-01-01 00:00:00

  • Polyunphased: an extension to polytomous outcomes of the Unphased package for family-based genetic association analysis.

    abstract::Polytomous phenotypes arise when a disease has multiple subtypes or when two dichotomous phenotypes are analyzed simultaneously. Few software programs offer the option to analyze such phenotypes in family studies, and none implements conditional polytomous logistic regression for within-family analysis robust to popul...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2016-0035

    authors: Bureau A,Croteau J

    更新日期:2017-03-01 00:00:00

  • BayesMendel: an R environment for Mendelian risk prediction.

    abstract::Several important syndromes are caused by deleterious germline mutations of individual genes. In both clinical and research applications it is useful to evaluate the probability that an individual carries an inherited genetic variant of these genes, and to predict the risk of disease for that individual, using informa...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1063

    authors: Chen S,Wang W,Broman KW,Katki HA,Parmigiani G

    更新日期:2004-01-01 00:00:00

  • Fully Bayesian mixture model for differential gene expression: simulations and model checks.

    abstract::We present a Bayesian hierarchical model for detecting differentially expressed genes using a mixture prior on the parameters representing differential effects. We formulate an easily interpretable 3-component mixture to classify genes as over-expressed, under-expressed and non-differentially expressed, and model gene...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1314

    authors: Lewin A,Bochkina N,Richardson S

    更新日期:2007-01-01 00:00:00

  • Approximate maximum likelihood estimation for population genetic inference.

    abstract::In many population genetic problems, parameter estimation is obstructed by an intractable likelihood function. Therefore, approximate estimation methods have been developed, and with growing computational power, sampling-based methods became popular. However, these methods such as Approximate Bayesian Computation (ABC...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2017-0016

    authors: Bertl J,Ewing G,Kosiol C,Futschik A

    更新日期:2017-11-27 00:00:00

  • Approximating the variance of the conditional probability of the state of a hidden Markov model.

    abstract::In a hidden Markov model, one "estimates" the state of the hidden Markov chain at t by computing via the forwards-backwards algorithm the conditional distribution of the state vector given the observed data. The covariance matrix of this conditional distribution measures the information lost by failure to observe dire...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章,评审

    doi:10.2202/1544-6115.1296

    authors: Siegmund DO,Yakir B

    更新日期:2007-01-01 00:00:00

  • Sparse inverse of covariance matrix of QTL effects with incomplete marker data.

    abstract::Gametic models for fitting breeding values at QTL as random effects in outbred populations have become popular because they require few assumptions about the number and distribution of QTL alleles segregating. The covariance matrix of the gametic effects has an inverse that is sparse and can be constructed rapidly by ...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1048

    authors: Thallman RM,Hanford KJ,Kachman SD,Van Vleck LD

    更新日期:2004-01-01 00:00:00

  • Mapping quantitative trait loci in a non-equilibrium population.

    abstract::The genetic control of a complex trait can be studied by testing and mapping the genotypes of the underlying quantitative trait loci (QTLs) through their associations with observable marker genotypes. All existing statistical methods for QTL mapping assume an equilibrium population, allowing marker-QTL associations to...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1578

    authors: Wu S,Yang J,Wu R

    更新日期:2010-01-01 00:00:00

  • Reproducibility of biomarker identifications from mass spectrometry proteomic data in cancer studies.

    abstract::Reproducibility of disease signatures and clinical biomarkers in multi-omics disease analysis has been a key challenge due to a multitude of factors. The heterogeneity of the limited sample, various biological factors such as environmental confounders, and the inherent experimental and technical noises, compounded wit...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.1515/sagmb-2018-0039

    authors: Liang Y,Kelemen A,Kelemen A

    更新日期:2019-05-11 00:00:00

  • A method to increase the power of multiple testing procedures through sample splitting.

    abstract::Consider the standard multiple testing problem where many hypotheses are to be tested, each hypothesis is associated with a test statistic, and large test statistics provide evidence against the null hypotheses. One proposal to provide probabilistic control of Type-I errors is the use of procedures ensuring that the e...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1148

    authors: Rubin D,Dudoit S,van der Laan M

    更新日期:2006-01-01 00:00:00

  • M-quantile regression analysis of temporal gene expression data.

    abstract::In this paper, we explore the use of M-quantile regression and M-quantile coefficients to detect statistical differences between temporal curves that belong to different experimental conditions. In particular, we consider the application of temporal gene expression data. Here, the aim is to detect genes whose temporal...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1452

    authors: Vinciotti V,Yu K

    更新日期:2009-01-01 00:00:00

  • The cyclohedron test for finding periodic genes in time course expression studies.

    abstract::The problem of finding periodically expressed genes from time course microarray experiments is at the center of numerous efforts to identify the molecular components of biological clocks. We present a new approach to this problem based on the cyclohedron test, which is a rank test inspired by recent advances in algebr...

    journal_title:Statistical applications in genetics and molecular biology

    pub_type: 杂志文章

    doi:10.2202/1544-6115.1286

    authors: Morton J,Pachter L,Shiu A,Sturmfels B

    更新日期:2007-01-01 00:00:00