当前位置： SCI文献检索 > Statistical Applications in Genetics and Molecular Biology期刊下所有文献 > Ensemble survival tree models to reveal pairwise interactions of variables with time-to-events outcomes in low-dimensional setting.

Ensemble survival tree models to reveal pairwise interactions of variables with time-to-events outcomes in low-dimensional setting.

Abstract：

:Unraveling interactions among variables such as genetic, clinical, demographic and environmental factors is essential to understand the development of common and complex diseases. To increase the power to detect such variables interactions associated with clinical time-to-events outcomes, we borrowed established concepts from random survival forest (RSF) models. We introduce a novel RSF-based pairwise interaction estimator and derive a randomization method with bootstrap confidence intervals for inferring interaction significance. Using various linear and nonlinear time-to-events survival models in simulation studies, we first show the efficiency of our approach: true pairwise interaction-effects between variables are uncovered, while they may not be accompanied with their corresponding main-effects, and may not be detected by standard semi-parametric regression modeling and test statistics used in survival analysis. Moreover, using a RSF-based cross-validation scheme for generating prediction estimators, we show that informative predictors may be inferred. We applied our approach to an HIV cohort study recording key host gene polymorphisms and their association with HIV change of tropism or AIDS progression. Altogether, this shows how linear or nonlinear pairwise statistical interactions of variables may be efficiently detected with a predictive value in observational studies with time-to-event outcomes.

journal_name

Stat Appl Genet Mol Biol

journal_title

Statistical applications in genetics and molecular biology

authors

Dazard JE,Ishwaran H,Mehlotra R,Weinberg A,Zimmerman P

doi

10.1515/sagmb-2017-0038

subject

Has Abstract

pub_date

2018-02-17 00:00:00

issue

1

eissn

2194-6302

issn

1544-6115

pii

/j/sagmb.2018.17.issue-1/sagmb-2017-0038/sagmb-201

journal_volume

17

pub_type

杂志文章

相关文献

Statistical Applications in Genetics and Molecular Biology文献大全

Polyunphased: an extension to polytomous outcomes of the Unphased package for family-based genetic association analysis.
abstract：:Polytomous phenotypes arise when a disease has multiple subtypes or when two dichotomous phenotypes are analyzed simultaneously. Few software programs offer the option to analyze such phenotypes in family studies, and none implements conditional polytomous logistic regression for within-family analysis robust to popul...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2016-0035

authors： Bureau A,Croteau J

更新日期：2017-03-01 00:00:00
Sampling correction in pedigree analysis.
abstract：:Usually, a pedigree is sampled and included in the sample that is analyzed after following a predefined non-random sampling design comprising several specific procedures. To obtain a pedigree analysis result free from the bias caused by the sampling procedures, a correction is applied to the pedigree likelihood. The s...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1003

authors： Ginsburg E,Malkin I,Elston RC

更新日期：2003-01-01 00:00:00
On the operational characteristics of the Benjamini and Hochberg False Discovery Rate procedure.
abstract：:Multiple testing procedures are commonly used in gene expression studies for the detection of differential expression, where typically thousands of genes are measured over at least two experimental conditions. Given the need for powerful testing procedures, and the attendant danger of false positives in multiple testi...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1302

authors： Green GH,Diggle PJ

更新日期：2007-01-01 00:00:00
Surveying the manifold divergence of an entire protein class for statistical clues to underlying biochemical mechanisms.
abstract：:Certain residues have no known function yet are co-conserved across distantly related protein families and diverse organisms, suggesting that they perform critical roles associated with as-yet-unidentified molecular properties and mechanisms. This raises the question of how to obtain additional clues regarding these m...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1666

authors： Neuwald AF

更新日期：2011-01-01 00:00:00
On an extended interpretation of linkage disequilibrium in genetic case-control association studies.
abstract：:We are concerned with statistical inference for 2 × C × K contingency tables in the context of genetic case-control association studies. Multivariate methods based on asymptotic Gaussianity of vectors of test statistics require information about the asymptotic correlation structure among these test statistics under th...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2015-0024

authors： Dickhaus T,Stange J,Demirhan H

更新日期：2015-11-01 00:00:00
Modeling, simulation and analysis of methylation profiles from reduced representation bisulfite sequencing experiments.
abstract：:The ENCODE project has funded the generation of a diverse collection of methylation profiles using reduced representation bisulfite sequencing (RRBS) technology, enabling the analysis of epigenetic variation on a genomic scale at single-site resolution. A standard application of RRBS experiments is in the location of ...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2013-0027

authors： Lacey MR,Baribault C,Ehrlich M

更新日期：2013-12-01 00:00:00
A multiple testing approach to high-dimensional association studies with an application to the detection of associations between risk factors of heart disease and genetic polymorphisms.
abstract：:We present an approach to association studies involving a dozen or so ;response' variables and a few hundred ;explanatory' variables which emphasizes transparency, simplicity, and protection against spurious results. The methods proposed are largely non-parametric, and they are systematically rounded-off by the Benjam...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1420

authors： Ferreira JA,Berkhof J,Souverein O,Zwinderman K

更新日期：2009-01-01 00:00:00
A test for detecting differential indirect trans effects between two groups of samples.
abstract：:Integrative analysis of copy number and gene expression data can help in understanding the cis and trans effect of copy number aberrations on transcription levels of genes involved in a pathway. To analyse how these copy number mediated gene-gene interactions differ between groups of samples we propose a new method, n...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2017-0058

authors： Chaturvedi N,Menezes RX,Goeman JJ,Wieringen WV

更新日期：2018-07-31 00:00:00
M-quantile regression analysis of temporal gene expression data.
abstract：:In this paper, we explore the use of M-quantile regression and M-quantile coefficients to detect statistical differences between temporal curves that belong to different experimental conditions. In particular, we consider the application of temporal gene expression data. Here, the aim is to detect genes whose temporal...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1452

authors： Vinciotti V,Yu K

更新日期：2009-01-01 00:00:00
The cyclohedron test for finding periodic genes in time course expression studies.
abstract：:The problem of finding periodically expressed genes from time course microarray experiments is at the center of numerous efforts to identify the molecular components of biological clocks. We present a new approach to this problem based on the cyclohedron test, which is a rank test inspired by recent advances in algebr...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1286

authors： Morton J,Pachter L,Shiu A,Sturmfels B

更新日期：2007-01-01 00:00:00
Addressing the shortcomings of three recent Bayesian methods for detecting interspecific recombination in DNA sequence alignments.
abstract：:We address a potential shortcoming of three probabilistic models for detecting interspecific recombination in DNA sequence alignments: the multiple change-point model (MCP) of Suchard et al. (2003), the dual multiple change-point model (DMCP) of Minin et al. (2005), and the phylogenetic factorial hidden Markov model (...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1399

authors： Husmeier D,Mantzaris AV

更新日期：2008-01-01 00:00:00
Dimension reduction for classification with gene expression microarray data.
abstract：:An important application of gene expression microarray data is classification of biological samples or prediction of clinical and other outcomes. One necessary part of multivariate statistical analysis in such applications is dimension reduction. This paper provides a comparison study of three dimension reduction tech...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1147

authors： Dai JJ,Lieu L,Rocke D

更新日期：2006-01-01 00:00:00
Transmission disequilibrium test power and sample size in the presence of locus heterogeneity.
abstract：:Locus heterogeneity is one of the most important issues in gene mapping and can cause significant reductions in statistical power for gene mapping, yet no research to date has provided power and sample size calculations for family-based association methods in the presence of locus heterogeneity. The purpose of this re...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1501

authors： Chen C,Yang G,Buyske S,Matise T,Finch SJ,Gordon D

更新日期：2009-01-01 00:00:00
Genetic association test based on principal component analysis.
abstract：:Many gene- and pathway-based association tests have been proposed in the literature. Among them, the SKAT is widely used, especially for rare variants association studies. In this paper, we investigate the connection between SKAT and a principal component analysis. This investigation leads to a procedure that encompas...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2016-0061

authors： Chen Z,Han S,Wang K

更新日期：2017-07-26 00:00:00
Genetic linkage analysis in the presence of germline mosaicism.
abstract：:Germline mosaicism is a genetic condition in which some germ cells of an individual contain a mutation. This condition violates the assumptions underlying classic genetic analysis and may lead to failure of such analysis. In this work we extend the statistical model used for genetic linkage analysis in order to incorp...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1709

authors： Weissbrod O,Geiger D

更新日期：2011-10-04 00:00:00
Combining nearest neighbor classifiers versus cross-validation selection.
abstract：:Various discriminant methods have been applied for classification of tumors based on gene expression profiles, among which the nearest neighbor (NN) method has been reported to perform relatively well. Usually cross-validation (CV) is used to select the neighbor size as well as the number of variables for the NN metho...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1054

authors： Paik M,Yang Y

更新日期：2004-01-01 00:00:00
Discrete Wavelet Packet Transform Based Discriminant Analysis for Whole Genome Sequences.
abstract：:In recent years, alignment-free methods have been widely applied in comparing genome sequences, as these methods compute efficiently and provide desirable phylogenetic analysis results. These methods have been successfully combined with hierarchical clustering methods for finding phylogenetic trees. However, it may no...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2018-0045

authors： Huang HH,Girimurugan SB

更新日期：2019-02-15 00:00:00
Variance and covariance heterogeneity analysis for detection of metabolites associated with cadmium exposure.
abstract：:In this study, we propose a novel statistical framework for detecting progressive changes in molecular traits as response to a pathogenic stimulus. In particular, we propose to employ Bayesian hierarchical models to analyse changes in mean level, variance and correlation of metabolic traits in relation to covariates. ...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2013-0041

authors： Salamanca BV,Ebbels TM,Iorio MD

更新日期：2014-04-01 00:00:00
Second order optimization for the inference of gene regulatory pathways.
abstract：:With the increasing availability of experimental data on gene interactions, modeling of gene regulatory pathways has gained special attention. Gradient descent algorithms have been widely used for regression and classification applications. Unfortunately, results obtained after training a model by gradient descent are...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2012-0021

authors： Das M,Murthy CA,De RK

更新日期：2014-02-01 00:00:00
The relative inefficiency of sequence weights approaches in determining a nucleotide position weight matrix.
abstract：:Approaches based upon sequence weights, to construct a position weight matrix of nucleotides from aligned inputs, are popular but little effort has been expended to measure their quality. We derive optimal sequence weights that minimize the sum of the variances of the estimators of base frequency parameters for sequen...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1135

authors： Newberg LA,McCue LA,Lawrence CE

更新日期：2005-01-01 00:00:00
Fully Bayesian mixture model for differential gene expression: simulations and model checks.
abstract：:We present a Bayesian hierarchical model for detecting differentially expressed genes using a mixture prior on the parameters representing differential effects. We formulate an easily interpretable 3-component mixture to classify genes as over-expressed, under-expressed and non-differentially expressed, and model gene...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1314

authors： Lewin A,Bochkina N,Richardson S

更新日期：2007-01-01 00:00:00
Asymptotic optimality of likelihood-based cross-validation.
abstract：:Likelihood-based cross-validation is a statistical tool for selecting a density estimate based on n i.i.d. observations from the true density among a collection of candidate density estimators. General examples are the selection of a model indexing a maximum likelihood estimator, and the selection of a bandwidth index...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1036

authors： van der Laan MJ,Dudoit S,Keles S

更新日期：2004-01-01 00:00:00
Node sampling for protein complex estimation in bait-prey graphs.
abstract：:In cellular biology, node-and-edge graph or "network" data collection often uses bait-prey technologies such as co-immunoprecipitation (CoIP). Bait-prey technologies assay relationships or "interactions" between protein pairs, with CoIP specifically measuring protein complex co-membership. Analyses of CoIP data freque...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2015-0007

authors： Scholtens DM,Spencer BD

更新日期：2015-08-01 00:00:00
MLML2R: an R package for maximum likelihood estimation of DNA methylation and hydroxymethylation proportions.
abstract：:Accurately measuring epigenetic marks such as 5-methylcytosine (5-mC) and 5-hydroxymethylcytosine (5-hmC) at the single-nucleotide level, requires combining data from DNA processing methods including traditional (BS), oxidative (oxBS) or Tet-Assisted (TAB) bisulfite conversion. We introduce the R package MLML2R, which...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2018-0031

authors： Kiihl SF,Martinez-Garrido MJ,Domingo-Relloso A,Bermudez J,Tellez-Plaza M

更新日期：2019-01-17 00:00:00
Likelihood-based inference for multi-color optical mapping.
abstract：:Multi-color optical mapping is a new technique being developed to obtain detailed physical maps (indicating relative positions of various recognition sites) of DNA molecules. We consider a study design in which the data consist of noisy observations of multiple copies of a DNA molecule marked with colors at recognitio...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1266

authors： Tong L,Mets L,McPeek MS

更新日期：2007-01-01 00:00:00
Multiple testing in candidate gene situations: a comparison of classical, discrete, and resampling-based procedures.
abstract：:In candidate gene association studies, usually several elementary hypotheses are tested simultaneously using one particular set of data. The data normally consist of partly correlated SNP information. Every SNP can be tested for association with the disease, e.g., using the Cochran-Armitage test for trend. To account ...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1729

authors： Elsäβer A,Victor A,Hommel G

更新日期：2011-01-01 00:00:00
BayesMendel: an R environment for Mendelian risk prediction.
abstract：:Several important syndromes are caused by deleterious germline mutations of individual genes. In both clinical and research applications it is useful to evaluate the probability that an individual carries an inherited genetic variant of these genes, and to predict the risk of disease for that individual, using informa...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1063

authors： Chen S,Wang W,Broman KW,Katki HA,Parmigiani G

更新日期：2004-01-01 00:00:00
Accounting for undetected compounds in statistical analyses of mass spectrometry 'omic studies.
abstract：:Mass spectrometry is an important high-throughput technique for profiling small molecular compounds in biological samples and is widely used to identify potential diagnostic and prognostic compounds associated with disease. Commonly, this data generated by mass spectrometry has many missing values resulting when a com...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2013-0021

authors： Taylor SL,Leiserowitz GS,Kim K

更新日期：2013-12-01 00:00:00
Semi-parametric differential expression analysis via partial mixture estimation.
abstract：:We develop an approach for microarray differential expression analysis, i.e. identifying genes whose expression levels differ between two or more groups. Current approaches to inference rely either on full parametric assumptions or on permutation-based techniques for sampling under the null distribution. In some situa...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.2202/1544-6115.1333

authors： Rossell D,Guerra R,Scott C

更新日期：2008-01-01 00:00:00
Reproducibility of biomarker identifications from mass spectrometry proteomic data in cancer studies.
abstract：:Reproducibility of disease signatures and clinical biomarkers in multi-omics disease analysis has been a key challenge due to a multitude of factors. The heterogeneity of the limited sample, various biological factors such as environmental confounders, and the inherent experimental and technical noises, compounded wit...

journal_title：Statistical applications in genetics and molecular biology

pub_type： 杂志文章

doi：10.1515/sagmb-2018-0039

authors： Liang Y,Kelemen A,Kelemen A

更新日期：2019-05-11 00:00:00