Feature-specific penalized latent class analysis for genomic data.

Abstract:

:Genomic data are often characterized by a moderate to large number of categorical variables observed for relatively few subjects. Some of the variables may be missing or noninformative. An example of such data is loss of heterozygosity (LOH), a dichotomous variable, observed on a moderate number of genetic markers. We first consider a latent class model where, conditional on unobserved membership in one of k classes, the variables are independent with probabilities determined by a regression model of low dimension q. Using a family of penalties including the ridge and LASSO, we extend this model to address higher-dimensional problems. Finally, we present an orthogonal map that transforms marker space to a space of "features" for which the constrained model has better predictive power. We demonstrate these methods on LOH data collected at 19 markers from 93 brain tumor patients. For this data set, the existing unpenalized latent class methodology does not produce estimates. Additionally, we show that posterior classes obtained from this method are associated with survival for these patients.

journal_name

Biometrics

journal_title

Biometrics

authors

Houseman EA,Coull BA,Betensky RA

doi

10.1111/j.1541-0420.2006.00566.x

subject

Has Abstract

pub_date

2006-12-01 00:00:00

pages

1062-70

issue

4

eissn

0006-341X

issn

1541-0420

pii

BIOM566

journal_volume

62

pub_type

杂志文章
  • A multilevel mixed effects varying coefficient model with multilevel predictors and random effects for modeling hospitalization risk in patients on dialysis.

    abstract::For patients on dialysis, hospitalizations remain a major risk factor for mortality and morbidity. We use data from a large national database, United States Renal Data System, to model time-varying effects of hospitalization risk factors as functions of time since initiation of dialysis. To account for the three-level...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13205

    authors: Li Y,Nguyen DV,Kürüm E,Rhee CM,Chen Y,Kalantar-Zadeh K,Şentürk D

    更新日期:2020-09-01 00:00:00

  • Time series models based on generalized linear models: some further results.

    abstract::This paper considers the problem of extending the classical moving average models to time series with conditional distributions given by generalized linear models. These models have the advantage of easy construction and estimation. Statistical modelling techniques are also proposed. Some simulation results and an ill...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Li WK

    更新日期:1994-06-01 00:00:00

  • Comparing the performances of Diggle's tests of spatial randomness for small samples with and without edge-effect correction: application to ecological data.

    abstract::Diggle's tests of spatial randomness based on empirical distributions of interpoint distances can be performed with and without edge-effect correction. We present here numerical results illustrating that tests without the edge-effect correction proposed by Diggle (1979, Biometrics 35, 87-101) have a higher power for s...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00156.x

    authors: Gignoux J,Duby C,Barot S

    更新日期:1999-03-01 00:00:00

  • A moving blocks empirical likelihood method for longitudinal data.

    abstract::In the analysis of longitudinal or panel data, neglecting the serial correlations among the repeated measurements within subjects may lead to inefficient inference. In particular, when the number of repeated measurements is large, it may be desirable to model the serial correlations more generally. An appealing approa...

    journal_title:Biometrics

    pub_type: 杂志文章,随机对照试验

    doi:10.1111/biom.12317

    authors: Qiu J,Wu L

    更新日期:2015-09-01 00:00:00

  • Dose-finding designs for HIV studies.

    abstract::We present a class of simple designs that can be used in early dose-finding studies in HIV. Such designs, in contrast with Phase I designs in cancer, have a lot of the Phase II flavor about them. Information on efficacy is obtained during the trial and is as important as that relating to toxicity. The designs proposed...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.01018.x

    authors: O'Quigley J,Hughes MD,Fenton T

    更新日期:2001-12-01 00:00:00

  • Sample size determination for establishing equivalence/noninferiority via ratio of two proportions in matched-pair design.

    abstract::In this article, we propose approximate sample size formulas for establishing equivalence or noninferiority of two treatments in match-pairs design. Using the ratio of two proportions as the equivalence measure, we derive sample size formulas based on a score statistic for two types of analyses: hypothesis testing and...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00957.x

    authors: Tang ML,Tang NS,Chan IS,Chan BP

    更新日期:2002-12-01 00:00:00

  • Marginal analysis of correlated failure time data with informative cluster sizes.

    abstract::We consider modeling correlated survival data when cluster sizes may be informative to the outcome of interest based on a within-cluster resampling (WCR) approach and a weighted score function (WSF) method. We derive the large sample properties for the WCR estimators under the Cox proportional hazards model. We establ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00730.x

    authors: Cong XJ,Yin G,Shen Y

    更新日期:2007-09-01 00:00:00

  • Calculating sample size for studies with expected all-or-none nonadherence and selection bias.

    abstract:SUMMARY:We develop sample size formulas for studies aiming to test mean differences between a treatment and control group when all-or-none nonadherence (noncompliance) and selection bias are expected. Recent work by Fay, Halloran, and Follmann (2007, Biometrics 63, 465-474) addressed the increased variances within grou...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01114.x

    authors: Shardell MD,El-Kamary SS

    更新日期:2009-06-01 00:00:00

  • Nonparametric comparison of two survival-time distributions in the presence of dependent censoring.

    abstract::When testing the null hypothesis that treatment arm-specific survival-time distributions are equal, the log-rank test is asymptotically valid when the distribution of time to censoring is conditionally independent of randomized treatment group given survival time. We introduce a test of the null hypothesis for use whe...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/1541-0420.00059

    authors: DiRienzo AG

    更新日期:2003-09-01 00:00:00

  • A multi-source adaptive platform design for testing sequential combinatorial therapeutic strategies.

    abstract::Traditional paradigms for clinical translation are challenged in settings where multiple contemporaneous therapeutic strategies have been identified as potentially beneficial. Platform trials have emerged as an approach for sequentially comparing multiple trials using a single protocol. The Ebola virus disease outbrea...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12841

    authors: Kaizer AM,Hobbs BP,Koopmeiners JS

    更新日期:2018-09-01 00:00:00

  • Bayesian experimental design for nonlinear mixed-effects models with application to HIV dynamics.

    abstract::Bayesian experimental design is investigated for Bayesian analysis of nonlinear mixed-effects models. Existence of the posterior risk for parameter estimation is shown. When the same prior distribution is used for both design and inference, existence of the preposterior risk for design is also proven. If the prior dis...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00148.x

    authors: Han C,Chaloner K

    更新日期:2004-03-01 00:00:00

  • A stochastic model for the analysis of bivariate longitudinal AIDS data.

    abstract::We present a model for multivariate repeated measures that incorporates random effects, correlated stochastic processes, and measurement errors. The model is a multivariate generalization of the model for univariate longitudinal data given by Taylor, Cumberland, and Sy (1994, Journal of the American Statistical Associ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Sy JP,Taylor JM,Cumberland WG

    更新日期:1997-06-01 00:00:00

  • Impact of time to start treatment following infection with application to initiating HAART in HIV-positive patients.

    abstract::We estimate how the effect of antiretroviral treatment depends on the time from HIV-infection to initiation of treatment, using observational data. A major challenge in making inferences from such observational data arises from biases associated with the nonrandom assignment of treatment, for example bias induced by d...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01738.x

    authors: Lok JJ,DeGruttola V

    更新日期:2012-09-01 00:00:00

  • Extraction of food consumption systems by nonnegative matrix factorization (NMF) for the assessment of food choices.

    abstract::In Western countries where food supply is satisfactory, consumers organize their diets around a large combination of foods. It is the purpose of this article to examine how recent nonnegative matrix factorization (NMF) techniques can be applied to food consumption data to understand these combinations. Such data are n...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01588.x

    authors: Zetlaoui M,Feinberg M,Verger P,Clémençon S

    更新日期:2011-12-01 00:00:00

  • Partially supervised learning using an EM-boosting algorithm.

    abstract::Training data in a supervised learning problem consist of the class label and its potential predictors for a set of observations. Constructing effective classifiers from training data is the goal of supervised learning. In biomedical sciences and other scientific applications, class labels may be subject to errors. We...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00156.x

    authors: Yasui Y,Pepe M,Hsu L,Adam BL,Feng Z

    更新日期:2004-03-01 00:00:00

  • Additive gamma frailty models with applications to competing risks in related individuals.

    abstract::Epidemiological studies of related individuals are often complicated by the fact that follow-up on the event type of interest is incomplete due to the occurrence of other events. We suggest a class of frailty models with cause-specific hazards for correlated competing events in related individuals. The frailties are b...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12326

    authors: Eriksson F,Scheike T

    更新日期:2015-09-01 00:00:00

  • Likelihood ratio tests for a dose-response effect using multiple nonlinear regression models.

    abstract::We consider the problem of testing for a dose-related effect based on a candidate set of (typically nonlinear) dose-response models using likelihood-ratio tests. For the considered models this reduces to assessing whether the slope parameter in these nonlinear regression models is zero or not. A technical problem is t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12563

    authors: Gutjahr G,Bornkamp B

    更新日期:2017-03-01 00:00:00

  • Estimating predictors for long- or short-term survivors.

    abstract::Suppose that the response variable in a well-executed clinical or observational study to evaluate a treatment is the time to a certain event, and a set of baseline covariates or predictors was collected for each study patient. Furthermore, suppose that a significant number of study patients had nontrivial, long-term a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2003.00116.x

    authors: Tian L,Wang W,Wei LJ

    更新日期:2003-12-01 00:00:00

  • Hypothesis testing under mixture models: application to genetic linkage analysis.

    abstract::In this paper we propose a new class of statistics to test a simple hypothesis against a family of alternatives characterized by a mixture model. Unlike the likelihood ratio statistic, whose large sample distribution is still unknown in this situation, these new statistics have a simple asymptotic distribution to whic...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00065.x

    authors: Liang KY,Rathouz PJ

    更新日期:1999-03-01 00:00:00

  • Use of historical marker data for assessing treatment effects in phase I/II trials when subject selection is determined by baseline marker level.

    abstract::Although the primary focus of Phase I clinical trials is to assess clinical pharmacology and possible toxicities, any information on the potential effect of treatment would be useful in helping to determine priorities between treatments for further study. We consider the scenario where data are routinely collected on ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lin HM,Hughes MD

    更新日期:1995-09-01 00:00:00

  • Unbiased and locally efficient estimation of genetic effect on quantitative trait in the presence of population admixture.

    abstract::Population admixture can be a confounding factor in genetic association studies. Family-based methods (Rabinowitz and Larid, 2000, Human Heredity 50, 211-223) have been proposed in both testing and estimation settings to adjust for this confounding, especially in case-only association studies. The family-based methods...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01454.x

    authors: Wang Y,Yang Q,Rabinowitz D

    更新日期:2011-06-01 00:00:00

  • Statistical inference for serial dilution assay data.

    abstract::Serial dilution assays are widely employed for estimating substance concentrations and minimum inhibitory concentrations. The Poisson-Bernoulli model for such assays is appropriate for count data but not for continuous measurements that are encountered in applications involving substance concentrations. This paper pre...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.01215.x

    authors: Lee ML,Whitmore GA

    更新日期:1999-12-01 00:00:00

  • Symmetrically dependent models arising in visual assessment data.

    abstract::Given data from bilateral visual assessments on N subjects at k occasions, we consider inference for contralateral correlations (C) between fellow eyes and lateral correlations (L) among p different assessments of the same eye. Under permutation symmetric dependence structure between observations from fellow eyes and ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.01188.x

    authors: Viana M,Olkin I

    更新日期:2000-12-01 00:00:00

  • Optimal designs when the variance is a function of the mean.

    abstract::We develop locally D-optimal designs for nonlinear models when the variance of the response is a function of its mean. Using the two-parameter Michaelis-Menten model as an example, we show that the optimal design depends on both the type of heteroscedasticity and the magnitude of the variation. In addition, our result...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00925.x

    authors: Dette H,Wong WK

    更新日期:1999-09-01 00:00:00

  • A note on the conditional approach to interval estimation in the calibration problem.

    abstract::In the calibration problem, the need to construct a confidence interval to estimate the unknown chi 0 arises when the null hypothesis of zero slope is rejected. Otherwise, the resulting confidence interval will be infinite to reflect the fact that the slope of the regression line may be zero. Under the condition of re...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lee JJ

    更新日期:1991-12-01 00:00:00

  • A statistical method for detecting differentially expressed SNVs based on next-generation RNA-seq data.

    abstract::In this article, we propose a new statistical method-MutRSeq-for detecting differentially expressed single nucleotide variants (SNVs) based on RNA-seq data. Specifically, we focus on nonsynonymous mutations and employ a hierarchical likelihood approach to jointly model observed mutation events as well as read count me...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12548

    authors: Fu R,Wang P,Ma W,Taguchi A,Wong CH,Zhang Q,Gazdar A,Hanash SM,Zhou Q,Zhong H,Feng Z

    更新日期:2017-03-01 00:00:00

  • A note on permutation tests for variance components in multilevel generalized linear mixed models.

    abstract::In many applications of generalized linear mixed models to multilevel data, it is of interest to test whether a random effects variance component is zero. It is well known that the usual asymptotic chi-square distribution of the likelihood ratio and score statistics under the null does not necessarily hold. In this no...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00775.x

    authors: Fitzmaurice GM,Lipsitz SR,Ibrahim JG

    更新日期:2007-09-01 00:00:00

  • Case-control analysis with partial knowledge of exposure misclassification probabilities.

    abstract::Consider case control analysis with a dichotomous exposure variable that is subject to misclassification. If the classification probabilities are known, then methods are available to adjust odds-ratio estimates in light of the misclassification. We study the realistic scenario where reasonable guesses, but not exact v...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00598.x

    authors: Gustafson P,Le ND,Saskin R

    更新日期:2001-06-01 00:00:00

  • Randomization model methods for evaluating treatment efficacy in multicenter clinical trials.

    abstract::This paper studies randomization model methods for analyzing data from a multicenter study comparing the effectiveness of two treatments. The Mantel-Haenszel mean score statistic, which can be used for continuous or ordered categorical response variables, is shown to be a useful nonparametric alternative to standard l...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Davis CS,Chung Y

    更新日期:1995-09-01 00:00:00

  • Hypothesis testing of matrix graph model with application to brain connectivity analysis.

    abstract::Brain connectivity analysis is now at the foreground of neuroscience research. A connectivity network is characterized by a graph, where nodes represent neural elements such as neurons and brain regions, and links represent statistical dependence that is often encoded in terms of partial correlation. Such a graph is i...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12633

    authors: Xia Y,Li L

    更新日期:2017-09-01 00:00:00