Novel two-phase sampling designs for studying binary outcomes.

Abstract:

:In biomedical cohort studies for assessing the association between an outcome variable and a set of covariates, usually, some covariates can only be measured on a subgroup of study subjects. An important design question is-which subjects to select into the subgroup to increase statistical efficiency. When the outcome is binary, one may adopt a case-control sampling design or a balanced case-control design where cases and controls are further matched on a small number of complete discrete covariates. While the latter achieves success in estimating odds ratio (OR) parameters for the matching covariates, similar two-phase design options have not been explored for the remaining covariates, especially the incompletely collected ones. This is of great importance in studies where the covariates of interest cannot be completely collected. To this end, assuming that an external model is available to relate the outcome and complete covariates, we propose a novel sampling scheme that oversamples cases and controls with worse goodness-of-fit based on the external model and further matches them on complete covariates similarly to the balanced design. We develop a pseudolikelihood method for estimating OR parameters. Through simulation studies and explorations in a real-cohort study, we find that our design generally leads to reduced asymptotic variances of the OR estimates and the reduction for the matching covariates is comparable to that of the balanced design.

journal_name

Biometrics

journal_title

Biometrics

authors

Wang L,Williams ML,Chen Y,Chen J

doi

10.1111/biom.13140

subject

Has Abstract

pub_date

2020-03-01 00:00:00

pages

210-223

issue

1

eissn

0006-341X

issn

1541-0420

journal_volume

76

pub_type

杂志文章
  • Variable selection for logistic regression using a prediction-focused information criterion.

    abstract::In biostatistical practice, it is common to use information criteria as a guide for model selection. We propose new versions of the focused information criterion (FIC) for variable selection in logistic regression. The FIC gives, depending on the quantity to be estimated, possibly different sets of selected variables....

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00567.x

    authors: Claeskens G,Croux C,Van Kerckhoven J

    更新日期:2006-12-01 00:00:00

  • Simple test for the Hardy-Weinberg law for HLA data with no observed double blanks.

    abstract::Eguchi and Matsuura (1990, Biometrics 46, 415-426) noted that the generalized Stevens test statistic for the Hardy-Weinberg law for human leukocyte antigen (HLA) data yields an excessively large value when no double blanks are observed. In this paper, we investigated this aberrant case. The inflated value of the test ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Nam J

    更新日期:1995-03-01 00:00:00

  • Valid inference in random effects meta-analysis.

    abstract::The standard approach to inference for random effects meta-analysis relies on approximating the null distribution of a test statistic by a standard normal distribution. This approximation is asymptotic on k, the number of studies, and can be substantially in error in medical meta-analyses, which often have only a few ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00732.x

    authors: Follmann DA,Proschan MA

    更新日期:1999-09-01 00:00:00

  • A one-step-ahead pseudo-DIC for comparison of Bayesian state-space models.

    abstract::In the context of state-space modeling, conventional usage of the deviance information criterion (DIC) evaluates the ability of the model to predict an observation at time t given the underlying state at time t. Motivated by the failure of conventional DIC to clearly choose between competing multivariate nonlinear Bay...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12237

    authors: Millar RB,McKechnie S

    更新日期:2014-12-01 00:00:00

  • Nonparametric estimation of relative mortality from nested case-control studies.

    abstract::Andersen et al. (1985, Biometrics 41, 921-932) gave an estimator of the cumulative relative mortality comparing rates of death in an epidemiologic cohort to an external population as a function of time when covariate information is available on all cohort members. We present an analogous estimator when covariate infor...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Borgan O,Langholz B

    更新日期:1993-06-01 00:00:00

  • Fitting nonlinear and constrained generalized estimating equations with optimization software.

    abstract::In this article, we present an estimation approach for solving nonlinear constrained generalized estimating equations that can be implemented using object-oriented software for nonlinear programming, such as nlminb in Splus or fmincon and lsqnonlin in Matlab. We show how standard estimating equation theory includes th...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.01268.x

    authors: Contreras M,Ryan LM

    更新日期:2000-12-01 00:00:00

  • Randomization model methods for evaluating treatment efficacy in multicenter clinical trials.

    abstract::This paper studies randomization model methods for analyzing data from a multicenter study comparing the effectiveness of two treatments. The Mantel-Haenszel mean score statistic, which can be used for continuous or ordered categorical response variables, is shown to be a useful nonparametric alternative to standard l...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Davis CS,Chung Y

    更新日期:1995-09-01 00:00:00

  • Identification of differential aberrations in multiple-sample array CGH studies.

    abstract::Most existing methods for identifying aberrant regions with array CGH data are confined to a single target sample. Focusing on the comparison of multiple samples from two different groups, we develop a new penalized regression approach with a fused adaptive lasso penalty to accommodate the spatial dependence of the cl...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01457.x

    authors: Wang HJ,Hu J

    更新日期:2011-06-01 00:00:00

  • UMPU and alternative tests for association in 2 x 2 tables.

    abstract::The use of the uniformly most powerful among the unbiased (UMPU) test was recently suggested for the study of gametic association between two polymorphic loci as an alternative to the Fisher's exact test (Zapata and Alvarez, 1997, Annals of Human Genetics 61, 71-77). However, the proposed test is not UMPU for two-side...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00535.x

    authors: Fuchs C

    更新日期:2001-06-01 00:00:00

  • Combined maximum likelihood estimates for the equicorrelation coefficient.

    abstract::Combined maximum likelihood estimates for equicorrelation covariance matrices are considered. The case of a common equicorrelation rho and possibly different standard deviations sigma 1, ..., sigma k among k experimental groups is examined first, and the estimation of (rho, sigma 1, ..., sigma k) is discussed. Second,...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Viana MA

    更新日期:1994-09-01 00:00:00

  • On logit confidence intervals for the odds ratio with small samples.

    abstract::Unless the true association is very strong, simple large-sample confidence intervals for the odds ratio based on the delta method perform well even for small samples. Such intervals include the Woolf logit interval and the related Gart interval based on adding .5 before computing the log odds ratio estimate and its st...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00597.x

    authors: Agresti A

    更新日期:1999-06-01 00:00:00

  • Alternative hypotheses for the effects of drugs in small-scale clinical studies.

    abstract::New drugs that will be investigated in the future are expected to deal with chronic diseases, where the number of patients available for controlled clinical trials will be small and where the long-term sequelae that it is hoped will be ameliorated take a long time to occur. Thus, it would be useful to construct powerf...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Salsburg D

    更新日期:1986-09-01 00:00:00

  • Catch estimation with restricted randomization in the effort survey.

    abstract::One common method for estimating total catch is to multiply an estimate for CPUE, the catch per unit effort, by an estimate of total effort obtained from an independent second survey. In general, estimating total effort requires that sample times are chosen at random over the full fishing period; however, in practice,...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00461.x

    authors: Dauk PC,Schwarz CJ

    更新日期:2001-06-01 00:00:00

  • On modelling microbial infections.

    abstract::Several models for the course of microbial infections during the incubation period are examined. Each model fits sore throat incubation data very well. Together with the lack of precision of the resulting parameter estimates, this suggests that incubation data alone are insufficient for elucidating the finer details o...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Morgan BJ,Watts SA

    更新日期:1980-06-01 00:00:00

  • A biological marker model for predicting disease transitions.

    abstract::For patients with chronic myelogenous leukemia (CML), the effect of elevated blood levels of adenosine deaminase (ADA) is studied as a marker for transitions from stable disease to blast crisis and then to death. Data in the form of snapshots over time, with day, state of disease, and ADA level, are analyzed for 55 pa...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Klein JP,Klotz JH,Grever MR

    更新日期:1984-12-01 00:00:00

  • Incorporating Patient Preferences into Estimation of Optimal Individualized Treatment Rules.

    abstract::Precision medicine seeks to provide treatment only if, when, to whom, and at the dose it is needed. Thus, precision medicine is a vehicle by which healthcare can be made both more effective and efficient. Individualized treatment rules operationalize precision medicine as a map from current patient information to a re...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12743

    authors: Butler EL,Laber EB,Davis SM,Kosorok MR

    更新日期:2018-03-01 00:00:00

  • Asymptotic confidence bands for generalized nonlinear regression models.

    abstract::Asymptotic confidence bands for generalized nonlinear regression models are developed. These are based on a combination of the S method of Scheffe, together with the delta method which is used to approximate the mean function by a linear combination of the parameters. The approach can be used in any situation where la...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Cox C,Ma G

    更新日期:1995-03-01 00:00:00

  • Robust inference for the stepped wedge design.

    abstract::Stepped wedge designed trials are a type of cluster-randomized study in which the intervention is introduced to each cluster in a random order over time. This design is often used to assess the effect of a new intervention as it is rolled out across a series of clinics or communities. Based on a permutation argument, ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13106

    authors: Hughes JP,Heagerty PJ,Xia F,Ren Y

    更新日期:2020-03-01 00:00:00

  • Multilist population estimation with incomplete and partial stratification.

    abstract::Multilist capture-recapture methods are commonly used to estimate the size of elusive populations. In many situations, lists are stratified by distinguishing features, such as age or sex. Stratification has often been used to reduce biases caused by heterogeneity in the probability of list membership among members of ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00767.x

    authors: Sutherland JM,Schwarz CJ,Rivest LP

    更新日期:2007-09-01 00:00:00

  • Semiparametric modelling and estimation of covariate-adjusted dependence between bivariate recurrent events.

    abstract::A time-dependent measure, termed the rate ratio, was proposed to assess the local dependence between two types of recurrent event processes in one-sample settings. However, the one-sample work does not consider modeling the dependence by covariates such as subject characteristics and treatments received. The focus of ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13229

    authors: Ning J,Cai C,Chen Y,Huang X,Wang MC

    更新日期:2020-12-01 00:00:00

  • Modeling longitudinal data with nonparametric multiplicative random effects jointly with survival data.

    abstract::In clinical studies, longitudinal biomarkers are often used to monitor disease progression and failure time. Joint modeling of longitudinal and survival data has certain advantages and has emerged as an effective way to mutually enhance information. Typically, a parametric longitudinal model is assumed to facilitate t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00896.x

    authors: Ding J,Wang JL

    更新日期:2008-06-01 00:00:00

  • Quantifying and comparing dynamic predictive accuracy of joint models for longitudinal marker and time-to-event in presence of censoring and competing risks.

    abstract::Thanks to the growing interest in personalized medicine, joint modeling of longitudinal marker and time-to-event data has recently started to be used to derive dynamic individual risk predictions. Individual predictions are called dynamic because they are updated when information on the subject's health profile grows ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12232

    authors: Blanche P,Proust-Lima C,Loubère L,Berr C,Dartigues JF,Jacqmin-Gadda H

    更新日期:2015-03-01 00:00:00

  • Biased and unbiased estimation in longitudinal studies with informative visit processes.

    abstract::The availability of data in longitudinal studies is often driven by features of the characteristics being studied. For example, clinical databases are increasingly being used for research to address longitudinal questions. Because visit times in such data are often driven by patient characteristics that may be related...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12501

    authors: McCulloch CE,Neuhaus JM,Olin RL

    更新日期:2016-12-01 00:00:00

  • Likelihood ratio tests for a dose-response effect using multiple nonlinear regression models.

    abstract::We consider the problem of testing for a dose-related effect based on a candidate set of (typically nonlinear) dose-response models using likelihood-ratio tests. For the considered models this reduces to assessing whether the slope parameter in these nonlinear regression models is zero or not. A technical problem is t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12563

    authors: Gutjahr G,Bornkamp B

    更新日期:2017-03-01 00:00:00

  • Mixture models for estimating the size of a closed population when capture rates vary among individuals.

    abstract::We develop a parameterization of the beta-binomial mixture that provides sensible inferences about the size of a closed population when probabilities of capture or detection vary among individuals. Three classes of mixture models (beta-binomial, logistic-normal, and latent-class) are fitted to recaptures of snowshoe h...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/1541-0420.00042

    authors: Dorazio RM,Royle JA

    更新日期:2003-06-01 00:00:00

  • Alternative estimation procedures for Pr(X less than Y) in categorized data.

    abstract::Consider two independent random variables X and Y. The functional R = Pr(X less than Y) [or gamma = Pr(X less than Y) - Pr(Y less than X)] is of practical importance in many situations, including clinical trials, genetics, and reliability. In this paper several approaches to estimation of gamma when X and Y are presen...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Simonoff JS,Hochberg Y,Reiser B

    更新日期:1986-12-01 00:00:00

  • Dynamic models for estimating the effect of HAART on CD4 in observational studies: Application to the Aquitaine Cohort and the Swiss HIV Cohort Study.

    abstract::Highly active antiretroviral therapy (HAART) has proved efficient in increasing CD4 counts in many randomized clinical trials. Because randomized trials have some limitations (e.g., short duration, highly selected subjects), it is interesting to assess the effect of treatments using observational studies. This is chal...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12564

    authors: Prague M,Commenges D,Gran JM,Ledergerber B,Young J,Furrer H,Thiébaut R

    更新日期:2017-03-01 00:00:00

  • Multi-subgroup gene screening using semi-parametric hierarchical mixture models and the optimal discovery procedure: Application to a randomized clinical trial in multiple myeloma.

    abstract::This article proposes an efficient approach to screening genes associated with a phenotypic variable of interest in genomic studies with subgroups. In order to capture and detect various association profiles across subgroups, we flexibly estimate the underlying effect size distribution across subgroups using a semi-pa...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12716

    authors: Matsui S,Noma H,Qu P,Sakai Y,Matsui K,Heuck C,Crowley J

    更新日期:2018-03-01 00:00:00

  • Statistical testing of genetic linkage under heterogeneity.

    abstract::Recent advances in human genetics have led to a renewed interest in statistical methods for the detection of linkage from family data--for example, between marker loci and disease traits. Statistical analysis of linkage between two loci is carried out almost exclusively by means of the lod (log-odds) score test, equiv...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Shoukri MM,Lathrop GM

    更新日期:1993-03-01 00:00:00

  • Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.

    abstract::We consider a semiparametric regression model that relates a normal outcome to covariates and a genetic pathway, where the covariate effects are modeled parametrically and the pathway effect of multiple gene expressions is modeled parametrically or nonparametrically using least-squares kernel machines (LSKMs). This un...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00799.x

    authors: Liu D,Lin X,Ghosh D

    更新日期:2007-12-01 00:00:00