Abstract:
:High throughput technologies, such as gene expression arrays and protein mass spectrometry, allow one to simultaneously evaluate thousands of potential biomarkers that could distinguish different tissue types. Of particular interest here is distinguishing between cancerous and normal organ tissues. We consider statistical methods to rank genes (or proteins) in regards to differential expression between tissues. Various statistical measures are considered, and we argue that two measures related to the Receiver Operating Characteristic Curve are particularly suitable for this purpose. We also propose that sampling variability in the gene rankings be quantified, and suggest using the "selection probability function," the probability distribution of rankings for each gene. This is estimated via the bootstrap. A real dataset, derived from gene expression arrays of 23 normal and 30 ovarian cancer tissues, is analyzed. Simulation studies are also used to assess the relative performance of different statistical gene ranking measures and our quantification of sampling variability. Our approach leads naturally to a procedure for sample-size calculations, appropriate for exploratory studies that seek to identify differentially expressed genes.
journal_name
Biometricsjournal_title
Biometricsauthors
Pepe MS,Longton G,Anderson GL,Schummer Mdoi
10.1111/1541-0420.00016subject
Has Abstractpub_date
2003-03-01 00:00:00pages
133-42issue
1eissn
0006-341Xissn
1541-0420journal_volume
59pub_type
杂志文章相关文献
BIOMETRICS文献大全abstract::The situation considered is the prediction of a future observation from a simple exponential survival distribution in a hierarchical Bayes context. It is shown that when the hyperparameters need to be estimated from the data, a sample reuse approach is superior to maximum likelihood and method of moments estimation pr...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1990-03-01 00:00:00
abstract::Complex issues arise when investigating the association between longitudinal immunologic measures and time to an event, such as time to relapse, in cancer vaccine trials. Unlike many clinical trials, we may encounter patients who are cured and no longer susceptible to the time-to-event endpoint. If there are cured pat...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/1541-0420.00079
更新日期:2003-09-01 00:00:00
abstract::In multivariate recurrent event data regression, observation of recurrent events is usually terminated by other events that are associated with the recurrent event processes, resulting in informative censoring. Additionally, some covariates could be measured with errors. In some applications, an instrumental variable ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12857
更新日期:2018-09-01 00:00:00
abstract:SUMMARY:We consider the analysis of clinical trials that involve randomization to an active treatment (T = 1) or a control treatment (T = 0), when the active treatment is subject to all-or-nothing compliance. We compare three approaches to estimating treatment efficacy in this situation: as-treated analysis, per-protoc...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2008.01066.x
更新日期:2009-06-01 00:00:00
abstract::Case-control studies offer a rapid and efficient way to evaluate hypotheses. On the other hand, proper selection of the controls is challenging, and the potential for selection bias is a major weakness. Valid inferences about parameters of interest cannot be drawn if selection bias exists. Furthermore, the selection b...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.01106.x
更新日期:2001-12-01 00:00:00
abstract::In biostatistical practice, it is common to use information criteria as a guide for model selection. We propose new versions of the focused information criterion (FIC) for variable selection in logistic regression. The FIC gives, depending on the quantity to be estimated, possibly different sets of selected variables....
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00567.x
更新日期:2006-12-01 00:00:00
abstract::In this article, we present an estimation approach for solving nonlinear constrained generalized estimating equations that can be implemented using object-oriented software for nonlinear programming, such as nlminb in Splus or fmincon and lsqnonlin in Matlab. We show how standard estimating equation theory includes th...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.01268.x
更新日期:2000-12-01 00:00:00
abstract::The feasibility and cost-effectiveness of estimation of kappa using a case-control method of sampling, proposed by Jannarone, Macera, and Garrison (1987, Biometrics 43, 433-437), is provided support. However, in this article unrealistic assumptions in their presentation are identified and more general results for more...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1990-03-01 00:00:00
abstract::Genetic variants associated with disease outcomes can be used to develop personalized treatment. To reach this precision medicine goal, hundreds of large-scale genome-wide association studies (GWAS) have been conducted in the past decade to search for promising genetic variants associated with various traits. They hav...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13076
更新日期:2019-12-01 00:00:00
abstract::We extend the linear random-effects growth curve model (REGCM) (Laird and Ware, 1982, Biometrics 38, 963-974) to study the effects of population covariates on one or more characteristics of the growth curve when the characteristics are expressed as linear combinations of the growth curve parameters. This definition in...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1997-06-01 00:00:00
abstract::In the context of group testing screening, McMahan, Tebbs, and Bilder (2012, Biometrics 68, 287-296) proposed a two-stage procedure in a heterogenous population in the presence of misclassification. In earlier work published in Biometrics, Kim, Hudgens, Dreyfuss, Westreich, and Pilcher (2007, Biometrics 63, 1152-1162)...
journal_title:Biometrics
pub_type: 评论,杂志文章
doi:10.1111/biom.12385
更新日期:2016-03-01 00:00:00
abstract::A simple method for comparing independent groups of clustered binary data with group-specific covariates is proposed. It is based on the concepts of design effect and effective sample size widely used in sample surveys, and assumes no specific models for the intracluster correlations. It can be implemented using any s...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1992-06-01 00:00:00
abstract::Laska and Meisner (1989, Biometrics 45, 1139-1151) dealt with the problem of testing whether an identified treatment belonging to a set of k + 1 treatments is better than each of the other k treatments. They calculated sample size tables for k = 2 when using multiple t-tests or Wilcoxon-Mann-Whitney tests, both under ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00879.x
更新日期:2000-09-01 00:00:00
abstract::A procedure is given for estimating the ratio of the means of two populations using the data from two independent random samples when the observations are normally distributed with population variances that are related to the population means. ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1985-03-01 00:00:00
abstract::Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable a...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12007
更新日期:2013-06-01 00:00:00
abstract::We consider the problem of testing for a dose-related effect based on a candidate set of (typically nonlinear) dose-response models using likelihood-ratio tests. For the considered models this reduces to assessing whether the slope parameter in these nonlinear regression models is zero or not. A technical problem is t...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12563
更新日期:2017-03-01 00:00:00
abstract::Receiver operating characteristic (ROC) curves are frequently used to assess the usefulness of diagnostic markers. When several diagnostic markers are available, they can be combined by a best linear combination: that is, when the area under the ROC curve of this combination is maximized among all possible linear comb...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1997-06-01 00:00:00
abstract::In most research on smoothing splines the focus has been on estimation, while inference, especially hypothesis testing, has received less attention. By defining design matrices for fixed and random effects and the structure of the covariance matrices of random errors in an appropriate way, the cubic smoothing spline a...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2010.01537.x
更新日期:2011-09-01 00:00:00
abstract::In this article, we propose a Bayesian approach to phase I/II dose-finding oncology trials by jointly modeling a binary toxicity outcome and a continuous biomarker expression outcome. We apply our method to a clinical trial of a new gene therapy for bladder cancer patients. In this trial, the biomarker expression indi...
journal_title:Biometrics
pub_type: 临床试验,杂志文章
doi:10.1111/j.1541-0420.2005.00314.x
更新日期:2005-06-01 00:00:00
abstract::A formula is derived for determining the number of observations necessary to test the equality of two survival distributions when concomitant information is incorporated. This formula should be useful in designing clinical trials with a heterogeneous patient population. Schoenfeld (1981, Biometrika 68, 316-319) derive...
journal_title:Biometrics
pub_type: 临床试验,杂志文章
doi:
更新日期:1983-06-01 00:00:00
abstract::In many applications of generalized linear mixed models to multilevel data, it is of interest to test whether a random effects variance component is zero. It is well known that the usual asymptotic chi-square distribution of the likelihood ratio and score statistics under the null does not necessarily hold. In this no...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2007.00775.x
更新日期:2007-09-01 00:00:00
abstract::Serial dilution assays are widely employed for estimating substance concentrations and minimum inhibitory concentrations. The Poisson-Bernoulli model for such assays is appropriate for count data but not for continuous measurements that are encountered in applications involving substance concentrations. This paper pre...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.1999.01215.x
更新日期:1999-12-01 00:00:00
abstract::An individual's health condition can affect the frequency and intensity of episodes that can occur repeatedly and that may be related to an event time of interest. For example, bleeding episodes during pregnancy may indicate problems predictive of preterm delivery. Motivated by this application, we propose a joint mod...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00720.x
更新日期:2007-06-01 00:00:00
abstract::Given a large number of t-statistics, we consider the problem of approximating the distribution of noncentrality parameters (NCPs) by a continuous density. This problem is closely related to the control of false discovery rates (FDR) in massive hypothesis testing applications, e.g., microarray gene expression analysis...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2012.01764.x
更新日期:2012-12-01 00:00:00
abstract::This article describes the theoretical and practical issues in experimental design for gene expression microarrays. Specifically, this article 1) discusses the basic principles of design (randomization, replication, and blocking) as they pertain to microarrays, and 2) provides some general guidelines for statisticians...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2003.00096.x
更新日期:2003-12-01 00:00:00
abstract::Reversible jump Markov chain Monte Carlo (RJMCMC) methods are used to fit Bayesian capture-recapture models incorporating heterogeneity in individuals and samples. Heterogeneity in capture probabilities comes from finite mixtures and/or fixed sample effects allowing for interactions. Estimation by RJMCMC allows automa...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2009.01289.x
更新日期:2010-06-01 00:00:00
abstract::Andersen et al. (1985, Biometrics 41, 921-932) gave an estimator of the cumulative relative mortality comparing rates of death in an epidemiologic cohort to an external population as a function of time when covariate information is available on all cohort members. We present an analogous estimator when covariate infor...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1993-06-01 00:00:00
abstract::In this article, we study nonparametric estimation of the mean function of a counting process with panel observations. We introduce the gamma frailty variable to account for the intracorrelation between the panel counts of the counting process and construct a maximum pseudo-likelihood estimate with the frailty variabl...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2003.00126.x
更新日期:2003-12-01 00:00:00
abstract::In this article, we describe a Bayesian approach to the calibration of a stochastic computer model of chemical kinetics. As with many applications in the biological sciences, the data available to calibrate the model come from different sources. Furthermore, these data appear to provide somewhat conflicting informatio...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2009.01245.x
更新日期:2010-03-01 00:00:00
abstract::The variogram is a standard tool in the analysis of spatial data, and its shape provides useful information on the form of spatial correlation that may be present. However, it is also useful to be able to assess the evidence for the presence of any spatial correlation. A method of doing this, based on an assessment of...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.00211.x
更新日期:2001-03-01 00:00:00