Abstract:
:Genomic data are often characterized by a moderate to large number of categorical variables observed for relatively few subjects. Some of the variables may be missing or noninformative. An example of such data is loss of heterozygosity (LOH), a dichotomous variable, observed on a moderate number of genetic markers. We first consider a latent class model where, conditional on unobserved membership in one of k classes, the variables are independent with probabilities determined by a regression model of low dimension q. Using a family of penalties including the ridge and LASSO, we extend this model to address higher-dimensional problems. Finally, we present an orthogonal map that transforms marker space to a space of "features" for which the constrained model has better predictive power. We demonstrate these methods on LOH data collected at 19 markers from 93 brain tumor patients. For this data set, the existing unpenalized latent class methodology does not produce estimates. Additionally, we show that posterior classes obtained from this method are associated with survival for these patients.
journal_name
Biometricsjournal_title
Biometricsauthors
Houseman EA,Coull BA,Betensky RAdoi
10.1111/j.1541-0420.2006.00566.xsubject
Has Abstractpub_date
2006-12-01 00:00:00pages
1062-70issue
4eissn
0006-341Xissn
1541-0420pii
BIOM566journal_volume
62pub_type
杂志文章相关文献
BIOMETRICS文献大全abstract::For patients on dialysis, hospitalizations remain a major risk factor for mortality and morbidity. We use data from a large national database, United States Renal Data System, to model time-varying effects of hospitalization risk factors as functions of time since initiation of dialysis. To account for the three-level...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13205
更新日期:2020-09-01 00:00:00
abstract::This paper considers the problem of extending the classical moving average models to time series with conditional distributions given by generalized linear models. These models have the advantage of easy construction and estimation. Statistical modelling techniques are also proposed. Some simulation results and an ill...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1994-06-01 00:00:00
abstract::Diggle's tests of spatial randomness based on empirical distributions of interpoint distances can be performed with and without edge-effect correction. We present here numerical results illustrating that tests without the edge-effect correction proposed by Diggle (1979, Biometrics 35, 87-101) have a higher power for s...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.1999.00156.x
更新日期:1999-03-01 00:00:00
abstract::In the analysis of longitudinal or panel data, neglecting the serial correlations among the repeated measurements within subjects may lead to inefficient inference. In particular, when the number of repeated measurements is large, it may be desirable to model the serial correlations more generally. An appealing approa...
journal_title:Biometrics
pub_type: 杂志文章,随机对照试验
doi:10.1111/biom.12317
更新日期:2015-09-01 00:00:00
abstract::We present a class of simple designs that can be used in early dose-finding studies in HIV. Such designs, in contrast with Phase I designs in cancer, have a lot of the Phase II flavor about them. Information on efficacy is obtained during the trial and is as important as that relating to toxicity. The designs proposed...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.01018.x
更新日期:2001-12-01 00:00:00
abstract::In this article, we propose approximate sample size formulas for establishing equivalence or noninferiority of two treatments in match-pairs design. Using the ratio of two proportions as the equivalence measure, we derive sample size formulas based on a score statistic for two types of analyses: hypothesis testing and...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2002.00957.x
更新日期:2002-12-01 00:00:00
abstract::We consider modeling correlated survival data when cluster sizes may be informative to the outcome of interest based on a within-cluster resampling (WCR) approach and a weighted score function (WSF) method. We derive the large sample properties for the WCR estimators under the Cox proportional hazards model. We establ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00730.x
更新日期:2007-09-01 00:00:00
abstract:SUMMARY:We develop sample size formulas for studies aiming to test mean differences between a treatment and control group when all-or-none nonadherence (noncompliance) and selection bias are expected. Recent work by Fay, Halloran, and Follmann (2007, Biometrics 63, 465-474) addressed the increased variances within grou...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2008.01114.x
更新日期:2009-06-01 00:00:00
abstract::When testing the null hypothesis that treatment arm-specific survival-time distributions are equal, the log-rank test is asymptotically valid when the distribution of time to censoring is conditionally independent of randomized treatment group given survival time. We introduce a test of the null hypothesis for use whe...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/1541-0420.00059
更新日期:2003-09-01 00:00:00
abstract::Traditional paradigms for clinical translation are challenged in settings where multiple contemporaneous therapeutic strategies have been identified as potentially beneficial. Platform trials have emerged as an approach for sequentially comparing multiple trials using a single protocol. The Ebola virus disease outbrea...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12841
更新日期:2018-09-01 00:00:00
abstract::Bayesian experimental design is investigated for Bayesian analysis of nonlinear mixed-effects models. Existence of the posterior risk for parameter estimation is shown. When the same prior distribution is used for both design and inference, existence of the preposterior risk for design is also proven. If the prior dis...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341X.2004.00148.x
更新日期:2004-03-01 00:00:00
abstract::We present a model for multivariate repeated measures that incorporates random effects, correlated stochastic processes, and measurement errors. The model is a multivariate generalization of the model for univariate longitudinal data given by Taylor, Cumberland, and Sy (1994, Journal of the American Statistical Associ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1997-06-01 00:00:00
abstract::We estimate how the effect of antiretroviral treatment depends on the time from HIV-infection to initiation of treatment, using observational data. A major challenge in making inferences from such observational data arises from biases associated with the nonrandom assignment of treatment, for example bias induced by d...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2011.01738.x
更新日期:2012-09-01 00:00:00
abstract::In Western countries where food supply is satisfactory, consumers organize their diets around a large combination of foods. It is the purpose of this article to examine how recent nonnegative matrix factorization (NMF) techniques can be applied to food consumption data to understand these combinations. Such data are n...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2011.01588.x
更新日期:2011-12-01 00:00:00
abstract::Training data in a supervised learning problem consist of the class label and its potential predictors for a set of observations. Constructing effective classifiers from training data is the goal of supervised learning. In biomedical sciences and other scientific applications, class labels may be subject to errors. We...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341X.2004.00156.x
更新日期:2004-03-01 00:00:00
abstract::Epidemiological studies of related individuals are often complicated by the fact that follow-up on the event type of interest is incomplete due to the occurrence of other events. We suggest a class of frailty models with cause-specific hazards for correlated competing events in related individuals. The frailties are b...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12326
更新日期:2015-09-01 00:00:00
abstract::We consider the problem of testing for a dose-related effect based on a candidate set of (typically nonlinear) dose-response models using likelihood-ratio tests. For the considered models this reduces to assessing whether the slope parameter in these nonlinear regression models is zero or not. A technical problem is t...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12563
更新日期:2017-03-01 00:00:00
abstract::Suppose that the response variable in a well-executed clinical or observational study to evaluate a treatment is the time to a certain event, and a set of baseline covariates or predictors was collected for each study patient. Furthermore, suppose that a significant number of study patients had nontrivial, long-term a...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2003.00116.x
更新日期:2003-12-01 00:00:00
abstract::In this paper we propose a new class of statistics to test a simple hypothesis against a family of alternatives characterized by a mixture model. Unlike the likelihood ratio statistic, whose large sample distribution is still unknown in this situation, these new statistics have a simple asymptotic distribution to whic...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.1999.00065.x
更新日期:1999-03-01 00:00:00
abstract::Although the primary focus of Phase I clinical trials is to assess clinical pharmacology and possible toxicities, any information on the potential effect of treatment would be useful in helping to determine priorities between treatments for further study. We consider the scenario where data are routinely collected on ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1995-09-01 00:00:00
abstract::Population admixture can be a confounding factor in genetic association studies. Family-based methods (Rabinowitz and Larid, 2000, Human Heredity 50, 211-223) have been proposed in both testing and estimation settings to adjust for this confounding, especially in case-only association studies. The family-based methods...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2010.01454.x
更新日期:2011-06-01 00:00:00
abstract::Serial dilution assays are widely employed for estimating substance concentrations and minimum inhibitory concentrations. The Poisson-Bernoulli model for such assays is appropriate for count data but not for continuous measurements that are encountered in applications involving substance concentrations. This paper pre...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.1999.01215.x
更新日期:1999-12-01 00:00:00
abstract::Given data from bilateral visual assessments on N subjects at k occasions, we consider inference for contralateral correlations (C) between fellow eyes and lateral correlations (L) among p different assessments of the same eye. Under permutation symmetric dependence structure between observations from fellow eyes and ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.01188.x
更新日期:2000-12-01 00:00:00
abstract::We develop locally D-optimal designs for nonlinear models when the variance of the response is a function of its mean. Using the two-parameter Michaelis-Menten model as an example, we show that the optimal design depends on both the type of heteroscedasticity and the magnitude of the variation. In addition, our result...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.1999.00925.x
更新日期:1999-09-01 00:00:00
abstract::In the calibration problem, the need to construct a confidence interval to estimate the unknown chi 0 arises when the null hypothesis of zero slope is rejected. Otherwise, the resulting confidence interval will be infinite to reflect the fact that the slope of the regression line may be zero. Under the condition of re...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1991-12-01 00:00:00
abstract::In this article, we propose a new statistical method-MutRSeq-for detecting differentially expressed single nucleotide variants (SNVs) based on RNA-seq data. Specifically, we focus on nonsynonymous mutations and employ a hierarchical likelihood approach to jointly model observed mutation events as well as read count me...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12548
更新日期:2017-03-01 00:00:00
abstract::In many applications of generalized linear mixed models to multilevel data, it is of interest to test whether a random effects variance component is zero. It is well known that the usual asymptotic chi-square distribution of the likelihood ratio and score statistics under the null does not necessarily hold. In this no...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2007.00775.x
更新日期:2007-09-01 00:00:00
abstract::Consider case control analysis with a dichotomous exposure variable that is subject to misclassification. If the classification probabilities are known, then methods are available to adjust odds-ratio estimates in light of the misclassification. We study the realistic scenario where reasonable guesses, but not exact v...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.00598.x
更新日期:2001-06-01 00:00:00
abstract::This paper studies randomization model methods for analyzing data from a multicenter study comparing the effectiveness of two treatments. The Mantel-Haenszel mean score statistic, which can be used for continuous or ordered categorical response variables, is shown to be a useful nonparametric alternative to standard l...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1995-09-01 00:00:00
abstract::Brain connectivity analysis is now at the foreground of neuroscience research. A connectivity network is characterized by a graph, where nodes represent neural elements such as neurons and brain regions, and links represent statistical dependence that is often encoded in terms of partial correlation. Such a graph is i...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12633
更新日期:2017-09-01 00:00:00