Selecting differentially expressed genes from microarray experiments.

Abstract:

:High throughput technologies, such as gene expression arrays and protein mass spectrometry, allow one to simultaneously evaluate thousands of potential biomarkers that could distinguish different tissue types. Of particular interest here is distinguishing between cancerous and normal organ tissues. We consider statistical methods to rank genes (or proteins) in regards to differential expression between tissues. Various statistical measures are considered, and we argue that two measures related to the Receiver Operating Characteristic Curve are particularly suitable for this purpose. We also propose that sampling variability in the gene rankings be quantified, and suggest using the "selection probability function," the probability distribution of rankings for each gene. This is estimated via the bootstrap. A real dataset, derived from gene expression arrays of 23 normal and 30 ovarian cancer tissues, is analyzed. Simulation studies are also used to assess the relative performance of different statistical gene ranking measures and our quantification of sampling variability. Our approach leads naturally to a procedure for sample-size calculations, appropriate for exploratory studies that seek to identify differentially expressed genes.

journal_name

Biometrics

journal_title

Biometrics

authors

Pepe MS,Longton G,Anderson GL,Schummer M

doi

10.1111/1541-0420.00016

subject

Has Abstract

pub_date

2003-03-01 00:00:00

pages

133-42

issue

1

eissn

0006-341X

issn

1541-0420

journal_volume

59

pub_type

杂志文章
  • On hierarchical Bayes procedures for predicting simple exponential survival.

    abstract::The situation considered is the prediction of a future observation from a simple exponential survival distribution in a hierarchical Bayes context. It is shown that when the hyperparameters need to be estimated from the data, a sample reuse approach is superior to maximum likelihood and method of moments estimation pr...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Geisser S

    更新日期:1990-03-01 00:00:00

  • Bayesian approaches to joint cure-rate and longitudinal models with applications to cancer vaccine trials.

    abstract::Complex issues arise when investigating the association between longitudinal immunologic measures and time to an event, such as time to relapse, in cancer vaccine trials. Unlike many clinical trials, we may encounter patients who are cured and no longer susceptible to the time-to-event endpoint. If there are cured pat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/1541-0420.00079

    authors: Brown ER,Ibrahim JG

    更新日期:2003-09-01 00:00:00

  • Methods for multivariate recurrent event data with measurement error and informative censoring.

    abstract::In multivariate recurrent event data regression, observation of recurrent events is usually terminated by other events that are associated with the recurrent event processes, resulting in informative censoring. Additionally, some covariates could be measured with errors. In some applications, an instrumental variable ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12857

    authors: Yu H,Cheng YJ,Wang CY

    更新日期:2018-09-01 00:00:00

  • A comparison of methods for estimating the causal effect of a treatment in randomized clinical trials subject to noncompliance.

    abstract:SUMMARY:We consider the analysis of clinical trials that involve randomization to an active treatment (T = 1) or a control treatment (T = 0), when the active treatment is subject to all-or-nothing compliance. We compare three approaches to estimating treatment efficacy in this situation: as-treated analysis, per-protoc...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01066.x

    authors: Little RJ,Long Q,Lin X

    更新日期:2009-06-01 00:00:00

  • Matched case-control data analysis with selection bias.

    abstract::Case-control studies offer a rapid and efficient way to evaluate hypotheses. On the other hand, proper selection of the controls is challenging, and the potential for selection bias is a major weakness. Valid inferences about parameters of interest cannot be drawn if selection bias exists. Furthermore, the selection b...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.01106.x

    authors: Lin IF,Paik MC

    更新日期:2001-12-01 00:00:00

  • Variable selection for logistic regression using a prediction-focused information criterion.

    abstract::In biostatistical practice, it is common to use information criteria as a guide for model selection. We propose new versions of the focused information criterion (FIC) for variable selection in logistic regression. The FIC gives, depending on the quantity to be estimated, possibly different sets of selected variables....

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00567.x

    authors: Claeskens G,Croux C,Van Kerckhoven J

    更新日期:2006-12-01 00:00:00

  • Fitting nonlinear and constrained generalized estimating equations with optimization software.

    abstract::In this article, we present an estimation approach for solving nonlinear constrained generalized estimating equations that can be implemented using object-oriented software for nonlinear programming, such as nlminb in Splus or fmincon and lsqnonlin in Matlab. We show how standard estimating equation theory includes th...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.01268.x

    authors: Contreras M,Ryan LM

    更新日期:2000-12-01 00:00:00

  • A note on case-control sampling to estimate kappa coefficients.

    abstract::The feasibility and cost-effectiveness of estimation of kappa using a case-control method of sampling, proposed by Jannarone, Macera, and Garrison (1987, Biometrics 43, 433-437), is provided support. However, in this article unrealistic assumptions in their presentation are identified and more general results for more...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Kraemer HC,Bloch DA

    更新日期:1990-03-01 00:00:00

  • Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data.

    abstract::Genetic variants associated with disease outcomes can be used to develop personalized treatment. To reach this precision medicine goal, hundreds of large-scale genome-wide association studies (GWAS) have been conducted in the past decade to search for promising genetic variants associated with various traits. They hav...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13076

    authors: Masotti M,Guo B,Wu B

    更新日期:2019-12-01 00:00:00

  • Two-stage method of estimation for general linear growth curve models.

    abstract::We extend the linear random-effects growth curve model (REGCM) (Laird and Ware, 1982, Biometrics 38, 963-974) to study the effects of population covariates on one or more characteristics of the growth curve when the characteristics are expressed as linear combinations of the growth curve parameters. This definition in...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Stukel TA,Demidenko E

    更新日期:1997-06-01 00:00:00

  • Reader reaction: A note on the evaluation of group testing algorithms in the presence of misclassification.

    abstract::In the context of group testing screening, McMahan, Tebbs, and Bilder (2012, Biometrics 68, 287-296) proposed a two-stage procedure in a heterogenous population in the presence of misclassification. In earlier work published in Biometrics, Kim, Hudgens, Dreyfuss, Westreich, and Pilcher (2007, Biometrics 63, 1152-1162)...

    journal_title:Biometrics

    pub_type: 评论,杂志文章

    doi:10.1111/biom.12385

    authors: Malinovsky Y,Albert PS,Roy A

    更新日期:2016-03-01 00:00:00

  • A simple method for the analysis of clustered binary data.

    abstract::A simple method for comparing independent groups of clustered binary data with group-specific covariates is proposed. It is based on the concepts of design effect and effective sample size widely used in sample surveys, and assumes no specific models for the intracluster correlations. It can be implemented using any s...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Rao JN,Scott AJ

    更新日期:1992-06-01 00:00:00

  • Sample size determination for testing whether an identified treatment is best.

    abstract::Laska and Meisner (1989, Biometrics 45, 1139-1151) dealt with the problem of testing whether an identified treatment belonging to a set of k + 1 treatments is better than each of the other k treatments. They calculated sample size tables for k = 2 when using multiple t-tests or Wilcoxon-Mann-Whitney tests, both under ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00879.x

    authors: Horn M,Vollandt R,Dunnett CW

    更新日期:2000-09-01 00:00:00

  • Interval estimates for the ratio of the means of two normal populations with variances related to the means.

    abstract::A procedure is given for estimating the ratio of the means of two populations using the data from two independent random samples when the observations are normally distributed with population variances that are related to the population means. ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Cox CP

    更新日期:1985-03-01 00:00:00

  • A Bayesian goodness of fit test and semiparametric generalization of logistic regression with measurement data.

    abstract::Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12007

    authors: Schörgendorfer A,Branscum AJ,Hanson TE

    更新日期:2013-06-01 00:00:00

  • Likelihood ratio tests for a dose-response effect using multiple nonlinear regression models.

    abstract::We consider the problem of testing for a dose-related effect based on a candidate set of (typically nonlinear) dose-response models using likelihood-ratio tests. For the considered models this reduces to assessing whether the slope parameter in these nonlinear regression models is zero or not. A technical problem is t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12563

    authors: Gutjahr G,Bornkamp B

    更新日期:2017-03-01 00:00:00

  • Confidence intervals for the generalized ROC criterion.

    abstract::Receiver operating characteristic (ROC) curves are frequently used to assess the usefulness of diagnostic markers. When several diagnostic markers are available, they can be combined by a best linear combination: that is, when the area under the ROC curve of this combination is maximized among all possible linear comb...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Reiser B,Faraggi D

    更新日期:1997-06-01 00:00:00

  • Testing for cubic smoothing splines under dependent data.

    abstract::In most research on smoothing splines the focus has been on estimation, while inference, especially hypothesis testing, has received less attention. By defining design matrices for fixed and random effects and the structure of the covariance matrices of random errors in an appropriate way, the cubic smoothing spline a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01537.x

    authors: Nummi T,Pan J,Siren T,Liu K

    更新日期:2011-09-01 00:00:00

  • A Bayesian approach to jointly modeling toxicity and biomarker expression in a phase I/II dose-finding trial.

    abstract::In this article, we propose a Bayesian approach to phase I/II dose-finding oncology trials by jointly modeling a binary toxicity outcome and a continuous biomarker expression outcome. We apply our method to a clinical trial of a new gene therapy for bladder cancer patients. In this trial, the biomarker expression indi...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:10.1111/j.1541-0420.2005.00314.x

    authors: Bekele BN,Shen Y

    更新日期:2005-06-01 00:00:00

  • Sample-size formula for the proportional-hazards regression model.

    abstract::A formula is derived for determining the number of observations necessary to test the equality of two survival distributions when concomitant information is incorporated. This formula should be useful in designing clinical trials with a heterogeneous patient population. Schoenfeld (1981, Biometrika 68, 316-319) derive...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:

    authors: Schoenfeld DA

    更新日期:1983-06-01 00:00:00

  • A note on permutation tests for variance components in multilevel generalized linear mixed models.

    abstract::In many applications of generalized linear mixed models to multilevel data, it is of interest to test whether a random effects variance component is zero. It is well known that the usual asymptotic chi-square distribution of the likelihood ratio and score statistics under the null does not necessarily hold. In this no...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00775.x

    authors: Fitzmaurice GM,Lipsitz SR,Ibrahim JG

    更新日期:2007-09-01 00:00:00

  • Statistical inference for serial dilution assay data.

    abstract::Serial dilution assays are widely employed for estimating substance concentrations and minimum inhibitory concentrations. The Poisson-Bernoulli model for such assays is appropriate for count data but not for continuous measurements that are encountered in applications involving substance concentrations. This paper pre...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.01215.x

    authors: Lee ML,Whitmore GA

    更新日期:1999-12-01 00:00:00

  • Bayesian modeling of multiple episode occurrence and severity with a terminating event.

    abstract::An individual's health condition can affect the frequency and intensity of episodes that can occur repeatedly and that may be related to an event time of interest. For example, bleeding episodes during pregnancy may indicate problems predictive of preterm delivery. Motivated by this application, we propose a joint mod...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00720.x

    authors: Herring AH,Yang J

    更新日期:2007-06-01 00:00:00

  • Improved estimation of the noncentrality parameter distribution from a large number of t-statistics, with applications to false discovery rate estimation in microarray data analysis.

    abstract::Given a large number of t-statistics, we consider the problem of approximating the distribution of noncentrality parameters (NCPs) by a continuous density. This problem is closely related to the control of false discovery rates (FDR) in massive hypothesis testing applications, e.g., microarray gene expression analysis...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01764.x

    authors: Qu L,Nettleton D,Dekkers JC

    更新日期:2012-12-01 00:00:00

  • Design considerations for efficient and effective microarray studies.

    abstract::This article describes the theoretical and practical issues in experimental design for gene expression microarrays. Specifically, this article 1) discusses the basic principles of design (randomization, replication, and blocking) as they pertain to microarrays, and 2) provides some general guidelines for statisticians...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2003.00096.x

    authors: Kerr MK

    更新日期:2003-12-01 00:00:00

  • Capture-recapture estimation using finite mixtures of arbitrary dimension.

    abstract::Reversible jump Markov chain Monte Carlo (RJMCMC) methods are used to fit Bayesian capture-recapture models incorporating heterogeneity in individuals and samples. Heterogeneity in capture probabilities comes from finite mixtures and/or fixed sample effects allowing for interactions. Estimation by RJMCMC allows automa...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01289.x

    authors: Arnold R,Hayakawa Y,Yip P

    更新日期:2010-06-01 00:00:00

  • Nonparametric estimation of relative mortality from nested case-control studies.

    abstract::Andersen et al. (1985, Biometrics 41, 921-932) gave an estimator of the cumulative relative mortality comparing rates of death in an epidemiologic cohort to an external population as a function of time when covariate information is available on all cohort members. We present an analogous estimator when covariate infor...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Borgan O,Langholz B

    更新日期:1993-06-01 00:00:00

  • The gamma-frailty Poisson model for the nonparametric estimation of panel count data.

    abstract::In this article, we study nonparametric estimation of the mean function of a counting process with panel observations. We introduce the gamma frailty variable to account for the intracorrelation between the panel counts of the counting process and construct a maximum pseudo-likelihood estimate with the frailty variabl...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2003.00126.x

    authors: Zhang Y,Jamshidian M

    更新日期:2003-12-01 00:00:00

  • Bayesian calibration of a stochastic kinetic computer model using multiple data sources.

    abstract::In this article, we describe a Bayesian approach to the calibration of a stochastic computer model of chemical kinetics. As with many applications in the biological sciences, the data available to calibrate the model come from different sources. Furthermore, these data appear to provide somewhat conflicting informatio...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01245.x

    authors: Henderson DA,Boys RJ,Wilkinson DJ

    更新日期:2010-03-01 00:00:00

  • On the use of the variogram in checking for independence in spatial data.

    abstract::The variogram is a standard tool in the analysis of spatial data, and its shape provides useful information on the form of spatial correlation that may be present. However, it is also useful to be able to assess the evidence for the presence of any spatial correlation. A method of doing this, based on an assessment of...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00211.x

    authors: Diblasi A,Bowman AW

    更新日期:2001-03-01 00:00:00