Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.

Abstract:

:We consider a semiparametric regression model that relates a normal outcome to covariates and a genetic pathway, where the covariate effects are modeled parametrically and the pathway effect of multiple gene expressions is modeled parametrically or nonparametrically using least-squares kernel machines (LSKMs). This unified framework allows a flexible function for the joint effect of multiple genes within a pathway by specifying a kernel function and allows for the possibility that each gene expression effect might be nonlinear and the genes within the same pathway are likely to interact with each other in a complicated way. This semiparametric model also makes it possible to test for the overall genetic pathway effect. We show that the LSKM semiparametric regression can be formulated using a linear mixed model. Estimation and inference hence can proceed within the linear mixed model framework using standard mixed model software. Both the regression coefficients of the covariate effects and the LSKM estimator of the genetic pathway effect can be obtained using the best linear unbiased predictor in the corresponding linear mixed model formulation. The smoothing parameter and the kernel parameter can be estimated as variance components using restricted maximum likelihood. A score test is developed to test for the genetic pathway effect. Model/variable selection within the LSKM framework is discussed. The methods are illustrated using a prostate cancer data set and evaluated using simulations.

journal_name

Biometrics

journal_title

Biometrics

authors

Liu D,Lin X,Ghosh D

doi

10.1111/j.1541-0420.2007.00799.x

subject

Has Abstract

pub_date

2007-12-01 00:00:00

pages

1079-88

issue

4

eissn

0006-341X

issn

1541-0420

pii

BIOM799

journal_volume

63

pub_type

杂志文章
  • Robustness of group testing in the estimation of proportions.

    abstract::In binomial group testing, unlike one-at-a-time testing, the test unit consists of a group of individuals, and each group is declared to be defective or nondefective. A defective group is one that is presumed to include one or more defective (e.g., infected, positive) individuals and a nondefective group to contain on...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00231.x

    authors: Hung M,Swallow WH

    更新日期:1999-03-01 00:00:00

  • Bayesian modeling of multiple lesion onset and growth from interval-censored data.

    abstract::In studying rates of occurrence and progression of lesions (or tumors), it is typically not possible to obtain exact onset times for each lesion. Instead, data consist of the number of lesions that reach a detectable size between screening examinations, along with measures of the size/severity of individual lesions at...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00217.x

    authors: Dunson DB,Holloman C,Calder C,Gunn LH

    更新日期:2004-09-01 00:00:00

  • Regression analysis of multivariate grouped survival data.

    abstract::Multivariate failure time data arise when each study subject may experience several types of event or when there are clusterings of observational units such that failure times within the same cluster are correlated. The failure times are often subject to interval grouping or have truly discrete measurements. In this p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Guo SW,Lin DY

    更新日期:1994-09-01 00:00:00

  • A Bayesian goodness of fit test and semiparametric generalization of logistic regression with measurement data.

    abstract::Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12007

    authors: Schörgendorfer A,Branscum AJ,Hanson TE

    更新日期:2013-06-01 00:00:00

  • An implicitly defined parametric model for censored survival data and covariates.

    abstract::Parametric survival functions are usually defined as explicit functions of time and covariates. However, consideration of some simple differential equations describing certain survival curves leads to a descriptive equation which cannot be explicitly solved for the survival function. Nevertheless, the resulting surviv...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Piantadosi S,Crowley J

    更新日期:1995-03-01 00:00:00

  • Functional multiple indicators, multiple causes measurement error models.

    abstract::Objective measures of oxygen consumption and carbon dioxide production by mammals are used to predict their energy expenditure. Since energy expenditure is not directly observable, it can be viewed as a latent construct with multiple physical indirect measures such as respiratory quotient, volumetric oxygen consumptio...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12706

    authors: Tekwe CD,Zoh RS,Bazer FW,Wu G,Carroll RJ

    更新日期:2018-03-01 00:00:00

  • Maximum likelihood estimation for N-mixture models.

    abstract::The focus of this article is on the nature of the likelihood associated with N-mixture models for repeated count data. It is shown that the infinite sum embedded in the likelihood associated with the Poisson mixing distribution can be expressed in terms of a hypergeometric function and, thence, in closed form. The res...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12521

    authors: Haines LM

    更新日期:2016-12-01 00:00:00

  • Simple test for the Hardy-Weinberg law for HLA data with no observed double blanks.

    abstract::Eguchi and Matsuura (1990, Biometrics 46, 415-426) noted that the generalized Stevens test statistic for the Hardy-Weinberg law for human leukocyte antigen (HLA) data yields an excessively large value when no double blanks are observed. In this paper, we investigated this aberrant case. The inflated value of the test ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Nam J

    更新日期:1995-03-01 00:00:00

  • Small sample inference for fixed effects from restricted maximum likelihood.

    abstract::Restricted maximum likelihood (REML) is now well established as a method for estimating the parameters of the general Gaussian linear model with a structured covariance matrix, in particular for mixed linear models. Conventionally, estimates of precision and inference for fixed effects are based on their asymptotic di...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Kenward MG,Roger JH

    更新日期:1997-09-01 00:00:00

  • A mixed model for repeated dilution assays.

    abstract::We propose a generalized linear mixed model to estimate and test marginal effects on titers repeatedly measured by serial dilution assays. The link is log-log and the titer is assumed to follow a gamma distribution. The parameters are estimated by generalized estimating equations. The marginal effects are tested by me...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Bloch J,Chavance M

    更新日期:1998-06-01 00:00:00

  • Receiver operating characteristic curves and confidence bands for support vector machines.

    abstract::Many problems that appear in biomedical decision-making, such as diagnosing disease and predicting response to treatment, can be expressed as binary classification problems. The support vector machine (SVM) is a popular classification technique that is robust to model misspecification and effectively handles high-dime...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13365

    authors: Luckett DJ,Laber EB,El-Kamary SS,Fan C,Jhaveri R,Perou CM,Shebl FM,Kosorok MR

    更新日期:2020-08-31 00:00:00

  • Sequential model selection-based segmentation to detect DNA copy number variation.

    abstract::Array-based CGH experiments are designed to detect genomic aberrations or regions of DNA copy-number variation that are associated with an outcome, typically a state of disease. Most of the existing statistical methods target on detecting DNA copy number variations in a single sample or array. We focus on the detectio...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12478

    authors: Hu J,Zhang L,Wang HJ

    更新日期:2016-09-01 00:00:00

  • Sequential equivalence testing and repeated confidence intervals, with applications to normal and binary responses.

    abstract::We propose group sequential tests of the equivalence of two treatments based on ideas related to repeated confidence intervals. These tests adapt readily to unpredictable group sizes, to the possibility of continuing even though a boundary has been crossed, and to nonnormal observations. In comparing two binomial dist...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Jennison C,Turnbull BW

    更新日期:1993-03-01 00:00:00

  • Efficient analysis of Weibull survival data from experiments on heterogeneous patient populations.

    abstract::An efficient method is presented for analyses of death rated in one-way or cross-classified experiments where expected survival time for a patient at time of entry on trial is a function of observable covariates. The survival-time distribution used is a Weibull form of Cox's (1972) model. The analysis proceeds in two ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Williams JS

    更新日期:1978-06-01 00:00:00

  • Impact of time to start treatment following infection with application to initiating HAART in HIV-positive patients.

    abstract::We estimate how the effect of antiretroviral treatment depends on the time from HIV-infection to initiation of treatment, using observational data. A major challenge in making inferences from such observational data arises from biases associated with the nonrandom assignment of treatment, for example bias induced by d...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01738.x

    authors: Lok JJ,DeGruttola V

    更新日期:2012-09-01 00:00:00

  • Applications of likelihood asymptotics for nonlinear regression in herbicide bioassays.

    abstract::Dose-response models are intensively used in herbicide bioassays. Despite recent advancements in the development of new herbicides, statistical analyses are commonly based on asymptotic approximations that are sometimes poor. This paper presents the use of recent results in higher order asymptotics for likelihood-base...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.01204.x

    authors: Bellio R,Jensen JE,Seiden P

    更新日期:2000-12-01 00:00:00

  • Interval estimation of the risk ratio between a secondary infection, given a primary infection, and the primary infection.

    abstract::This paper discusses interval estimation of the risk ratio (RR) between a secondary infection, given a primary infection, and the primary infection. Three asymptotic closed-form interval estimators are developed using Wald's test statistic, the logarithmic transformation, and Fieller's theorem. The performance of thes...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lui KJ

    更新日期:1998-06-01 00:00:00

  • Assessing the causal effect of organ transplantation on the distribution of residual lifetime.

    abstract::Because the number of patients waiting for organ transplants exceeds the number of organs available, a better understanding of how transplantation affects the distribution of residual lifetime is needed to improve organ allocation. However, there has been little work to assess the survival benefit of transplantation f...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12084

    authors: Vock DM,Tsiatis AA,Davidian M,Laber EB,Tsuang WM,Finlen Copeland CA,Palmer SM

    更新日期:2013-12-01 00:00:00

  • A biological marker model for predicting disease transitions.

    abstract::For patients with chronic myelogenous leukemia (CML), the effect of elevated blood levels of adenosine deaminase (ADA) is studied as a marker for transitions from stable disease to blast crisis and then to death. Data in the form of snapshots over time, with day, state of disease, and ADA level, are analyzed for 55 pa...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Klein JP,Klotz JH,Grever MR

    更新日期:1984-12-01 00:00:00

  • Operating characteristics of a rank correlation test for publication bias.

    abstract::An adjusted rank correlation test is proposed as a technique for identifying publication bias in a meta-analysis, and its operating characteristics are evaluated via simulations. The test statistic is a direct statistical analogue of the popular "funnel-graph." The number of component studies in the meta-analysis, the...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Begg CB,Mazumdar M

    更新日期:1994-12-01 00:00:00

  • On pooling across strata when frequency matching has been followed in a cohort study.

    abstract::In a study designed to assess the relationship between a dichotomous exposure and the eventual occurrence of a dichotomous outcome, frequency matching has been proposed as a way to balance the exposure cohorts with respect to the sampling distribution of potential confounding factors. This paper discusses the pooled e...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Weinberg CR

    更新日期:1985-03-01 00:00:00

  • Discriminant diagnostics.

    abstract::I discuss diagnostic methods for discriminant analysis. The equivalence with linear regression is noted and regression diagnostics are considered. The leverage is a function of the linear discriminant function and the Mahalanobis distance of the observation from the group mean. The distribution of this distance is app...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lachenbruch PA

    更新日期:1997-12-01 00:00:00

  • Effects of exposure misclassification on regression analyses of epidemiologic follow-up study data.

    abstract::In epidemiologic studies, subjects are often misclassified as to their level of exposure. Ignoring this misclassification error in the analysis introduces bias in the estimates of certain parameters and invalidates many hypothesis tests. For situations in which there is misclassification of exposure in a follow-up stu...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Reade-Christopher SJ,Kupper LL

    更新日期:1991-06-01 00:00:00

  • Confidence intervals and P-values for meta-analysis with publication bias.

    abstract::We study publication bias in meta-analysis by supposing there is a population (y, sigma) of studies which give treatment effect estimates y approximately N(theta, sigma(2)). A selection function describes the probability that each study is selected for review. The overall estimate of theta depends on the studies selec...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00705.x

    authors: Henmi M,Copas JB,Eguchi S

    更新日期:2007-06-01 00:00:00

  • An improved life table method.

    abstract::A life table estimates probabilities of surviving and of dying as well as death rates, as these would apply in a stationary population with the same underlying continuous mortality curve as the observed population. We have derived approximations to the probability of surviving that require no iteration, do not depend ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Keyfitz N,Frauenthal J

    更新日期:1975-12-01 00:00:00

  • Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies.

    abstract::Quantitative traits analyzed in Genome-Wide Association Studies (GWAS) are often nonnormally distributed. For such traits, association tests based on standard linear regression are subject to reduced power and inflated type I error in finite samples. Applying the rank-based inverse normal transformation (INT) to nonno...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13214

    authors: McCaw ZR,Lane JM,Saxena R,Redline S,Lin X

    更新日期:2020-12-01 00:00:00

  • Designing phase II studies in the context of a programme of clinical research.

    abstract::Conventional statistical determinations of sample size in phase II studies typically lead to sample sizes of the order of 25 (Schoenfeld, 1980, International Journal of Radiation Oncology, Biology and Physics 6, 371-374). When the development of new treatments is proceeding rapidly relative to the recruitment of suita...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Whitehead J

    更新日期:1985-06-01 00:00:00

  • Hypothesis testing of matrix graph model with application to brain connectivity analysis.

    abstract::Brain connectivity analysis is now at the foreground of neuroscience research. A connectivity network is characterized by a graph, where nodes represent neural elements such as neurons and brain regions, and links represent statistical dependence that is often encoded in terms of partial correlation. Such a graph is i...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12633

    authors: Xia Y,Li L

    更新日期:2017-09-01 00:00:00

  • A signed-rank test for clustered data.

    abstract::We consider the problem of comparing two outcome measures when the pairs are clustered. Using the general principle of within-cluster resampling, we obtain a novel signed-rank test for clustered paired data. We show by a simple informative cluster size simulation model that only our test maintains the correct size und...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00923.x

    authors: Datta S,Satten GA

    更新日期:2008-06-01 00:00:00

  • Multi-subgroup gene screening using semi-parametric hierarchical mixture models and the optimal discovery procedure: Application to a randomized clinical trial in multiple myeloma.

    abstract::This article proposes an efficient approach to screening genes associated with a phenotypic variable of interest in genomic studies with subgroups. In order to capture and detect various association profiles across subgroups, we flexibly estimate the underlying effect size distribution across subgroups using a semi-pa...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12716

    authors: Matsui S,Noma H,Qu P,Sakai Y,Matsui K,Heuck C,Crowley J

    更新日期:2018-03-01 00:00:00