Feature-specific penalized latent class analysis for genomic data.

Abstract:

:Genomic data are often characterized by a moderate to large number of categorical variables observed for relatively few subjects. Some of the variables may be missing or noninformative. An example of such data is loss of heterozygosity (LOH), a dichotomous variable, observed on a moderate number of genetic markers. We first consider a latent class model where, conditional on unobserved membership in one of k classes, the variables are independent with probabilities determined by a regression model of low dimension q. Using a family of penalties including the ridge and LASSO, we extend this model to address higher-dimensional problems. Finally, we present an orthogonal map that transforms marker space to a space of "features" for which the constrained model has better predictive power. We demonstrate these methods on LOH data collected at 19 markers from 93 brain tumor patients. For this data set, the existing unpenalized latent class methodology does not produce estimates. Additionally, we show that posterior classes obtained from this method are associated with survival for these patients.

journal_name

Biometrics

journal_title

Biometrics

authors

Houseman EA,Coull BA,Betensky RA

doi

10.1111/j.1541-0420.2006.00566.x

subject

Has Abstract

pub_date

2006-12-01 00:00:00

pages

1062-70

issue

4

eissn

0006-341X

issn

1541-0420

pii

BIOM566

journal_volume

62

pub_type

杂志文章
  • Spatial-temporal modeling of the association between air pollution exposure and preterm birth: identifying critical windows of exposure.

    abstract::Exposure to high levels of air pollution during the pregnancy is associated with increased probability of preterm birth (PTB), a major cause of infant morbidity and mortality. New statistical methodology is required to specifically determine when a particular pollutant impacts the PTB outcome, to determine the role of...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01774.x

    authors: Warren J,Fuentes M,Herring A,Langlois P

    更新日期:2012-12-01 00:00:00

  • Sample size determination for establishing equivalence/noninferiority via ratio of two proportions in matched-pair design.

    abstract::In this article, we propose approximate sample size formulas for establishing equivalence or noninferiority of two treatments in match-pairs design. Using the ratio of two proportions as the equivalence measure, we derive sample size formulas based on a score statistic for two types of analyses: hypothesis testing and...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00957.x

    authors: Tang ML,Tang NS,Chan IS,Chan BP

    更新日期:2002-12-01 00:00:00

  • Sequential treatment assignment with balancing for prognostic factors in the controlled clinical trial.

    abstract::In controlled clinical trials there are usually several prognostic factors known or thought to influence the patient's ability to respond to treatment. Therefore, the method of sequential treatment assignment needs to be designed so that treatment balance is simultaneously achieved across all such patients factor. Tra...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:

    authors: Pocock SJ,Simon R

    更新日期:1975-03-01 00:00:00

  • The effect of screening on some pretest-posttest test variances.

    abstract::The clinical trial design in which the endpoint is measured both at baseline and at the end of the study is used in a variety of situations. For two-group designs, test such as the t test or analysis of covariance are commonly used to evaluate treatment efficacy. Often such pretest-posttest trials restrict participati...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Follmann DA

    更新日期:1991-06-01 00:00:00

  • Semiparametric estimation of proportional mean residual life model in presence of censoring.

    abstract::A mean residual life function is the average remaining life of a surviving subject, as it varies with time. The proportional mean residual life model was proposed by Oakes and Dasu (1990, Biometrika77, 409-410) in regression analysis to study its association with related covariates in absence of censoring. In this art...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2005.030224.x

    authors: Chen YQ,Jewell NP,Lei X,Cheng SC

    更新日期:2005-03-01 00:00:00

  • False discovery rate estimation for frequentist pharmacovigilance signal detection methods.

    abstract::Pharmacovigilance systems aim at early detection of adverse effects of marketed drugs. They maintain large spontaneous reporting databases for which several automatic signaling methods have been developed. One limit of those methods is that the decision rules for the signal generation are based on arbitrary thresholds...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01262.x

    authors: Ahmed I,Dalmasso C,Haramburu F,Thiessard F,Broët P,Tubert-Bitter P

    更新日期:2010-03-01 00:00:00

  • Drawing inferences for high-dimensional linear models: A selection-assisted partial regression and smoothing approach.

    abstract::Drawing inferences for high-dimensional models is challenging as regular asymptotic theories are not applicable. This article proposes a new framework of simultaneous estimation and inferences for high-dimensional linear models. By smoothing over partial regression estimates based on a given variable selection scheme,...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13013

    authors: Fei Z,Zhu J,Banerjee M,Li Y

    更新日期:2019-06-01 00:00:00

  • Fitting nonlinear and constrained generalized estimating equations with optimization software.

    abstract::In this article, we present an estimation approach for solving nonlinear constrained generalized estimating equations that can be implemented using object-oriented software for nonlinear programming, such as nlminb in Splus or fmincon and lsqnonlin in Matlab. We show how standard estimating equation theory includes th...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.01268.x

    authors: Contreras M,Ryan LM

    更新日期:2000-12-01 00:00:00

  • A Bayesian approach to dose-response assessment and synergy and its application to in vitro dose-response studies.

    abstract::In this article, we propose a Bayesian approach to dose-response assessment and the assessment of synergy between two combined agents. We consider the case of an in vitro ovarian cancer research study aimed at investigating the antiproliferative activities of four agents, alone and paired, in two human ovarian cancer ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01403.x

    authors: Hennessey VG,Rosner GL,Bast RC Jr,Chen MY

    更新日期:2010-12-01 00:00:00

  • A one-step-ahead pseudo-DIC for comparison of Bayesian state-space models.

    abstract::In the context of state-space modeling, conventional usage of the deviance information criterion (DIC) evaluates the ability of the model to predict an observation at time t given the underlying state at time t. Motivated by the failure of conventional DIC to clearly choose between competing multivariate nonlinear Bay...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12237

    authors: Millar RB,McKechnie S

    更新日期:2014-12-01 00:00:00

  • Alternative hypotheses for the effects of drugs in small-scale clinical studies.

    abstract::New drugs that will be investigated in the future are expected to deal with chronic diseases, where the number of patients available for controlled clinical trials will be small and where the long-term sequelae that it is hoped will be ameliorated take a long time to occur. Thus, it would be useful to construct powerf...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Salsburg D

    更新日期:1986-09-01 00:00:00

  • Multipoint linkage analysis via Metropolis jumping kernels.

    abstract::Multipoint linkage analysis is being performed routinely in medical genetic studies to localize disease genes. This likelihood-based method is very computationally intensive. Exact computations are thus formidable for problems with large number of genetic markers and complex pedigrees. This paper proposes a Monte Carl...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lin S

    更新日期:1996-12-01 00:00:00

  • An improved life table method.

    abstract::A life table estimates probabilities of surviving and of dying as well as death rates, as these would apply in a stationary population with the same underlying continuous mortality curve as the observed population. We have derived approximations to the probability of surviving that require no iteration, do not depend ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Keyfitz N,Frauenthal J

    更新日期:1975-12-01 00:00:00

  • Multiple imputation for model checking: completed-data plots with missing and latent data.

    abstract::In problems with missing or latent data, a standard approach is to first impute the unobserved data, then perform all statistical analyses on the completed dataset--corresponding to the observed data and imputed unobserved data--using standard procedures for complete-data inference. Here, we extend this approach to mo...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2005.031010.x

    authors: Gelman A,Van Mechelen I,Verbeke G,Heitjan DF,Meulders M

    更新日期:2005-03-01 00:00:00

  • A Bayesian integrative approach for multi-platform genomic data: A kidney cancer case study.

    abstract::Integration of genomic data from multiple platforms has the capability to increase precision, accuracy, and statistical power in the identification of prognostic biomarkers. A fundamental problem faced in many multi-platform studies is unbalanced sample sizes due to the inability to obtain measurements from all the pl...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12587

    authors: Chekouo T,Stingo FC,Doecke JD,Do KA

    更新日期:2017-06-01 00:00:00

  • Exact two-sample inference with missing data.

    abstract::When comparing follow-up measurements from two independent populations, missing records may arise due to censoring by events whose occurrence is associated with baseline covariates. In these situations, inferences based only on the completely followed observations may be biased if the follow-up measurements and the co...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2005.00332.x

    authors: Cheung YK

    更新日期:2005-06-01 00:00:00

  • A proportional hazards model for multivariate interval-censored failure time data.

    abstract::This paper focuses on the methodology developed for analyzing a multivariate interval-censored data set from an AIDS observational study. A purpose of the study was to determine the natural history of the opportunistic infection cytomeglovirus (CMV) in an HIV-infected individual. For this observational study, laborato...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00940.x

    authors: Goggins WB,Finkelstein DM

    更新日期:2000-09-01 00:00:00

  • Impact of time to start treatment following infection with application to initiating HAART in HIV-positive patients.

    abstract::We estimate how the effect of antiretroviral treatment depends on the time from HIV-infection to initiation of treatment, using observational data. A major challenge in making inferences from such observational data arises from biases associated with the nonrandom assignment of treatment, for example bias induced by d...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01738.x

    authors: Lok JJ,DeGruttola V

    更新日期:2012-09-01 00:00:00

  • Variable selection and prediction using a nested, matched case-control study: Application to hospital acquired pneumonia in stroke patients.

    abstract::Matched case-control designs are commonly used in epidemiologic studies for increased efficiency. These designs have recently been introduced to the setting of modern imaging and genomic studies, which are characterized by high-dimensional covariates. However, appropriate statistical analyses that adjust for the match...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12113

    authors: Qian J,Payabvash S,Kemmling A,Lev MH,Schwamm LH,Betensky RA

    更新日期:2014-03-01 00:00:00

  • A permutation approach for selecting the penalty parameter in penalized model selection.

    abstract::We describe a simple, computationally efficient, permutation-based procedure for selecting the penalty parameter in LASSO-penalized regression. The procedure, permutation selection, is intended for applications where variable selection is the primary focus, and can be applied in a variety of structural settings, inclu...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12359

    authors: Sabourin JA,Valdar W,Nobel AB

    更新日期:2015-12-01 00:00:00

  • Order-restricted tests for stratified comparisons of binomial proportions.

    abstract::The data set presented relates a binomial response to ordered levels of an explanatory variable, representing doses of a drug, with data collected at several centers. A study goal is to test independence of the response and the ordinal factor, assuming under the alternative only that the binomial parameter is a monoto...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Agresti A,Coull BA

    更新日期:1996-09-01 00:00:00

  • Analysis of ordered categorical data: two score-independent approaches.

    abstract:SUMMARY:A trend test is often employed to analyze ordered categorical data, in which a set of increasing scores is assigned a priori. There is a drawback in this approach, because how to choose a set of scores is not clear. There have been debates on which scores should be used (e.g., Graubard and Korn, 1987, Biometric...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.00992.x

    authors: Zheng G

    更新日期:2008-12-01 00:00:00

  • Bayesian calibration of a stochastic kinetic computer model using multiple data sources.

    abstract::In this article, we describe a Bayesian approach to the calibration of a stochastic computer model of chemical kinetics. As with many applications in the biological sciences, the data available to calibrate the model come from different sources. Furthermore, these data appear to provide somewhat conflicting informatio...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01245.x

    authors: Henderson DA,Boys RJ,Wilkinson DJ

    更新日期:2010-03-01 00:00:00

  • Power and sample size for testing homogeneity of relative risks in prospective studies.

    abstract::Power and sample-size formulas for testing the homogeneity of relative risks using the score method are presented. The homogeneity score test (Gart, 1985, Biometrika 72, 673-677) is formally equivalent to the Pearson chi-square test, although they look different. Results of this paper may be useful in assessing the va...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00289.x

    authors: Nam JM

    更新日期:1999-03-01 00:00:00

  • Hierarchical Bayes estimation of hunting success rates with spatial correlations.

    abstract::A Bayesian hierarchical generalized linear model is used to estimate hunting success rates at the subarea level for postseason harvest surveys. The model includes fixed week effects, random geographic effects, and spatial correlations between neighboring subareas. The computation is done by Gibbs sampling and adaptive...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00360.x

    authors: He Z,Sun D

    更新日期:2000-06-01 00:00:00

  • A multilevel mixed effects varying coefficient model with multilevel predictors and random effects for modeling hospitalization risk in patients on dialysis.

    abstract::For patients on dialysis, hospitalizations remain a major risk factor for mortality and morbidity. We use data from a large national database, United States Renal Data System, to model time-varying effects of hospitalization risk factors as functions of time since initiation of dialysis. To account for the three-level...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13205

    authors: Li Y,Nguyen DV,Kürüm E,Rhee CM,Chen Y,Kalantar-Zadeh K,Şentürk D

    更新日期:2020-09-01 00:00:00

  • Model checking for ROC regression analysis.

    abstract::The receiver operating characteristic (ROC) curve is a prominent tool for characterizing the accuracy of a continuous diagnostic test. To account for factors that might influence the test accuracy, various ROC regression methods have been proposed. However, as in any regression analysis, when the assumed models do not...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00620.x

    authors: Cai T,Zheng Y

    更新日期:2007-03-01 00:00:00

  • Extension of the rank sum test for clustered data: two-group comparisons with group membership defined at the subunit level.

    abstract::The Wilcoxon rank sum test is widely used for two-group comparisons for nonnormal data. An assumption of this test is independence of sampling units both between and within groups. In ophthalmology, data are often collected on two eyes of an individual, which are highly correlated. In ophthalmological clinical trials,...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00582.x

    authors: Rosner B,Glynn RJ,Lee ML

    更新日期:2006-12-01 00:00:00

  • Likelihood ratio tests for a dose-response effect using multiple nonlinear regression models.

    abstract::We consider the problem of testing for a dose-related effect based on a candidate set of (typically nonlinear) dose-response models using likelihood-ratio tests. For the considered models this reduces to assessing whether the slope parameter in these nonlinear regression models is zero or not. A technical problem is t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12563

    authors: Gutjahr G,Bornkamp B

    更新日期:2017-03-01 00:00:00

  • Growth curve models of repeated binary response.

    abstract::Experimental designs that include repeated measures of binary response variables over time and under different conditions are common in biology. In such settings, it is often desirable to characterize the response pattern over time. When response variables are continuous, this characterization can be made in terms of ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Stanek EJ 3rd,Diehl SR

    更新日期:1988-12-01 00:00:00