Abstract:
:Genomic data are often characterized by a moderate to large number of categorical variables observed for relatively few subjects. Some of the variables may be missing or noninformative. An example of such data is loss of heterozygosity (LOH), a dichotomous variable, observed on a moderate number of genetic markers. We first consider a latent class model where, conditional on unobserved membership in one of k classes, the variables are independent with probabilities determined by a regression model of low dimension q. Using a family of penalties including the ridge and LASSO, we extend this model to address higher-dimensional problems. Finally, we present an orthogonal map that transforms marker space to a space of "features" for which the constrained model has better predictive power. We demonstrate these methods on LOH data collected at 19 markers from 93 brain tumor patients. For this data set, the existing unpenalized latent class methodology does not produce estimates. Additionally, we show that posterior classes obtained from this method are associated with survival for these patients.
journal_name
Biometricsjournal_title
Biometricsauthors
Houseman EA,Coull BA,Betensky RAdoi
10.1111/j.1541-0420.2006.00566.xsubject
Has Abstractpub_date
2006-12-01 00:00:00pages
1062-70issue
4eissn
0006-341Xissn
1541-0420pii
BIOM566journal_volume
62pub_type
杂志文章相关文献
BIOMETRICS文献大全abstract::Exposure to high levels of air pollution during the pregnancy is associated with increased probability of preterm birth (PTB), a major cause of infant morbidity and mortality. New statistical methodology is required to specifically determine when a particular pollutant impacts the PTB outcome, to determine the role of...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2012.01774.x
更新日期:2012-12-01 00:00:00
abstract::In this article, we propose approximate sample size formulas for establishing equivalence or noninferiority of two treatments in match-pairs design. Using the ratio of two proportions as the equivalence measure, we derive sample size formulas based on a score statistic for two types of analyses: hypothesis testing and...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2002.00957.x
更新日期:2002-12-01 00:00:00
abstract::In controlled clinical trials there are usually several prognostic factors known or thought to influence the patient's ability to respond to treatment. Therefore, the method of sequential treatment assignment needs to be designed so that treatment balance is simultaneously achieved across all such patients factor. Tra...
journal_title:Biometrics
pub_type: 临床试验,杂志文章
doi:
更新日期:1975-03-01 00:00:00
abstract::The clinical trial design in which the endpoint is measured both at baseline and at the end of the study is used in a variety of situations. For two-group designs, test such as the t test or analysis of covariance are commonly used to evaluate treatment efficacy. Often such pretest-posttest trials restrict participati...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1991-06-01 00:00:00
abstract::A mean residual life function is the average remaining life of a surviving subject, as it varies with time. The proportional mean residual life model was proposed by Oakes and Dasu (1990, Biometrika77, 409-410) in regression analysis to study its association with related covariates in absence of censoring. In this art...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341X.2005.030224.x
更新日期:2005-03-01 00:00:00
abstract::Pharmacovigilance systems aim at early detection of adverse effects of marketed drugs. They maintain large spontaneous reporting databases for which several automatic signaling methods have been developed. One limit of those methods is that the decision rules for the signal generation are based on arbitrary thresholds...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2009.01262.x
更新日期:2010-03-01 00:00:00
abstract::Drawing inferences for high-dimensional models is challenging as regular asymptotic theories are not applicable. This article proposes a new framework of simultaneous estimation and inferences for high-dimensional linear models. By smoothing over partial regression estimates based on a given variable selection scheme,...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13013
更新日期:2019-06-01 00:00:00
abstract::In this article, we present an estimation approach for solving nonlinear constrained generalized estimating equations that can be implemented using object-oriented software for nonlinear programming, such as nlminb in Splus or fmincon and lsqnonlin in Matlab. We show how standard estimating equation theory includes th...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.01268.x
更新日期:2000-12-01 00:00:00
abstract::In this article, we propose a Bayesian approach to dose-response assessment and the assessment of synergy between two combined agents. We consider the case of an in vitro ovarian cancer research study aimed at investigating the antiproliferative activities of four agents, alone and paired, in two human ovarian cancer ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2010.01403.x
更新日期:2010-12-01 00:00:00
abstract::In the context of state-space modeling, conventional usage of the deviance information criterion (DIC) evaluates the ability of the model to predict an observation at time t given the underlying state at time t. Motivated by the failure of conventional DIC to clearly choose between competing multivariate nonlinear Bay...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12237
更新日期:2014-12-01 00:00:00
abstract::New drugs that will be investigated in the future are expected to deal with chronic diseases, where the number of patients available for controlled clinical trials will be small and where the long-term sequelae that it is hoped will be ameliorated take a long time to occur. Thus, it would be useful to construct powerf...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1986-09-01 00:00:00
abstract::Multipoint linkage analysis is being performed routinely in medical genetic studies to localize disease genes. This likelihood-based method is very computationally intensive. Exact computations are thus formidable for problems with large number of genetic markers and complex pedigrees. This paper proposes a Monte Carl...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1996-12-01 00:00:00
abstract::A life table estimates probabilities of surviving and of dying as well as death rates, as these would apply in a stationary population with the same underlying continuous mortality curve as the observed population. We have derived approximations to the probability of surviving that require no iteration, do not depend ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1975-12-01 00:00:00
abstract::In problems with missing or latent data, a standard approach is to first impute the unobserved data, then perform all statistical analyses on the completed dataset--corresponding to the observed data and imputed unobserved data--using standard procedures for complete-data inference. Here, we extend this approach to mo...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341X.2005.031010.x
更新日期:2005-03-01 00:00:00
abstract::Integration of genomic data from multiple platforms has the capability to increase precision, accuracy, and statistical power in the identification of prognostic biomarkers. A fundamental problem faced in many multi-platform studies is unbalanced sample sizes due to the inability to obtain measurements from all the pl...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12587
更新日期:2017-06-01 00:00:00
abstract::When comparing follow-up measurements from two independent populations, missing records may arise due to censoring by events whose occurrence is associated with baseline covariates. In these situations, inferences based only on the completely followed observations may be biased if the follow-up measurements and the co...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2005.00332.x
更新日期:2005-06-01 00:00:00
abstract::This paper focuses on the methodology developed for analyzing a multivariate interval-censored data set from an AIDS observational study. A purpose of the study was to determine the natural history of the opportunistic infection cytomeglovirus (CMV) in an HIV-infected individual. For this observational study, laborato...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00940.x
更新日期:2000-09-01 00:00:00
abstract::We estimate how the effect of antiretroviral treatment depends on the time from HIV-infection to initiation of treatment, using observational data. A major challenge in making inferences from such observational data arises from biases associated with the nonrandom assignment of treatment, for example bias induced by d...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2011.01738.x
更新日期:2012-09-01 00:00:00
abstract::Matched case-control designs are commonly used in epidemiologic studies for increased efficiency. These designs have recently been introduced to the setting of modern imaging and genomic studies, which are characterized by high-dimensional covariates. However, appropriate statistical analyses that adjust for the match...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12113
更新日期:2014-03-01 00:00:00
abstract::We describe a simple, computationally efficient, permutation-based procedure for selecting the penalty parameter in LASSO-penalized regression. The procedure, permutation selection, is intended for applications where variable selection is the primary focus, and can be applied in a variety of structural settings, inclu...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12359
更新日期:2015-12-01 00:00:00
abstract::The data set presented relates a binomial response to ordered levels of an explanatory variable, representing doses of a drug, with data collected at several centers. A study goal is to test independence of the response and the ordinal factor, assuming under the alternative only that the binomial parameter is a monoto...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1996-09-01 00:00:00
abstract:SUMMARY:A trend test is often employed to analyze ordered categorical data, in which a set of increasing scores is assigned a priori. There is a drawback in this approach, because how to choose a set of scores is not clear. There have been debates on which scores should be used (e.g., Graubard and Korn, 1987, Biometric...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2008.00992.x
更新日期:2008-12-01 00:00:00
abstract::In this article, we describe a Bayesian approach to the calibration of a stochastic computer model of chemical kinetics. As with many applications in the biological sciences, the data available to calibrate the model come from different sources. Furthermore, these data appear to provide somewhat conflicting informatio...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2009.01245.x
更新日期:2010-03-01 00:00:00
abstract::Power and sample-size formulas for testing the homogeneity of relative risks using the score method are presented. The homogeneity score test (Gart, 1985, Biometrika 72, 673-677) is formally equivalent to the Pearson chi-square test, although they look different. Results of this paper may be useful in assessing the va...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.1999.00289.x
更新日期:1999-03-01 00:00:00
abstract::A Bayesian hierarchical generalized linear model is used to estimate hunting success rates at the subarea level for postseason harvest surveys. The model includes fixed week effects, random geographic effects, and spatial correlations between neighboring subareas. The computation is done by Gibbs sampling and adaptive...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00360.x
更新日期:2000-06-01 00:00:00
abstract::For patients on dialysis, hospitalizations remain a major risk factor for mortality and morbidity. We use data from a large national database, United States Renal Data System, to model time-varying effects of hospitalization risk factors as functions of time since initiation of dialysis. To account for the three-level...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13205
更新日期:2020-09-01 00:00:00
abstract::The receiver operating characteristic (ROC) curve is a prominent tool for characterizing the accuracy of a continuous diagnostic test. To account for factors that might influence the test accuracy, various ROC regression methods have been proposed. However, as in any regression analysis, when the assumed models do not...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00620.x
更新日期:2007-03-01 00:00:00
abstract::The Wilcoxon rank sum test is widely used for two-group comparisons for nonnormal data. An assumption of this test is independence of sampling units both between and within groups. In ophthalmology, data are often collected on two eyes of an individual, which are highly correlated. In ophthalmological clinical trials,...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00582.x
更新日期:2006-12-01 00:00:00
abstract::We consider the problem of testing for a dose-related effect based on a candidate set of (typically nonlinear) dose-response models using likelihood-ratio tests. For the considered models this reduces to assessing whether the slope parameter in these nonlinear regression models is zero or not. A technical problem is t...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12563
更新日期:2017-03-01 00:00:00
abstract::Experimental designs that include repeated measures of binary response variables over time and under different conditions are common in biology. In such settings, it is often desirable to characterize the response pattern over time. When response variables are continuous, this characterization can be made in terms of ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1988-12-01 00:00:00