Statistical inferences for data from studies conducted with an aggregated multivariate outcome-dependent sample design.

Abstract:

:Outcome-dependent sampling (ODS) scheme is a cost-effective sampling scheme where one observes the exposure with a probability that depends on the outcome. The well-known such design is the case-control design for binary response, the case-cohort design for the failure time data, and the general ODS design for a continuous response. While substantial work has been carried out for the univariate response case, statistical inference and design for the ODS with multivariate cases remain under-developed. Motivated by the need in biological studies for taking the advantage of the available responses for subjects in a cluster, we propose a multivariate outcome-dependent sampling (multivariate-ODS) design that is based on a general selection of the continuous responses within a cluster. The proposed inference procedure for the multivariate-ODS design is semiparametric where all the underlying distributions of covariates are modeled nonparametrically using the empirical likelihood methods. We show that the proposed estimator is consistent and developed the asymptotically normality properties. Simulation studies show that the proposed estimator is more efficient than the estimator obtained using only the simple-random-sample portion of the multivariate-ODS or the estimator from a simple random sample with the same sample size. The multivariate-ODS design together with the proposed estimator provides an approach to further improve study efficiency for a given fixed study budget. We illustrate the proposed design and estimator with an analysis of association of polychlorinated biphenyl exposure to hearing loss in children born to the Collaborative Perinatal Study. Copyright © 2016 John Wiley & Sons, Ltd.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Lu TS,Longnecker MP,Zhou H

doi

10.1002/sim.7195

subject

Has Abstract

pub_date

2017-03-15 00:00:00

pages

985-997

issue

6

eissn

0277-6715

issn

1097-0258

journal_volume

36

pub_type

杂志文章
  • Exact test size and power of a Gaussian error linear model for an internal pilot study.

    abstract::Wittes and Brittain recommended using an 'internal pilot study' to adjust sample size. The approach involves five steps in testing a general linear hypothesis for a general linear univariate model, with Gaussian errors. First, specify the design, hypothesis, desired test size, power, a smallest 'clinically meaningful'...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990530)18:10<1199::aid-s

    authors: Coffey CS,Muller KE

    更新日期:1999-05-30 00:00:00

  • A multiple imputation strategy for incomplete longitudinal data.

    abstract::Longitudinal studies are commonly used to study processes of change. Because data are collected over time, missing data are pervasive in longitudinal studies, and complete ascertainment of all variables is rare. In this paper a new imputation strategy for completing longitudinal data sets is proposed. The proposed met...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.740

    authors: Landrum MB,Becker MP

    更新日期:2001-09-15 00:00:00

  • Explaining community-level variance in group randomized trials.

    abstract::Between-community variance or community-by-time variance is one of the key factors driving the cost of conducting group randomized trials, which are often very expensive. We investigated empirically whether between-community variance could be reduced by controlling individual- and/or community-level covariates and ide...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990315)18:5<539::aid-sim

    authors: Feng Z,Diehr P,Yasui Y,Evans B,Beresford S,Koepsell TD

    更新日期:1999-03-15 00:00:00

  • Variation in heart disease mortality across census tracts as a function of overdispersion and social class mixture.

    abstract::Variation in heart disease (HD) mortality rates across census tracts is greater than expected given binomial error and available explanatory variables. We extended an extra-binomial variation model for rates standardized by the direct method. The overdispersion parameter accounted for 36 per cent of the observed varia...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780091009

    authors: Jarjoura D,Logue E

    更新日期:1990-10-01 00:00:00

  • Re-use of case-control data for analysis of new outcome variables.

    abstract::Case-control studies are usually defined to investigate risk factors for a single disease of interest. However, subsequent to data collection, investigators may wish to examine as an 'outcome' a variable that was an exposure in the original study. A naive analysis that disregards the sampling strategy that gave rise t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2398

    authors: Reilly M,Torrång A,Klint A

    更新日期:2005-12-30 00:00:00

  • Predictive value of statistical models.

    abstract::A review is given of different ways of estimating the error rate of a prediction rule based on a statistical model. A distinction is drawn between apparent, optimum and actual error rates. Moreover it is shown how cross-validation can be used to obtain an adjusted predictor with smaller error rate. A detailed discussi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780091109

    authors: Van Houwelingen JC,Le Cessie S

    更新日期:1990-11-01 00:00:00

  • A spatial scan statistic for ordinal data.

    abstract::Spatial scan statistics are widely used for count data to detect geographical disease clusters of high or low incidence, mortality or prevalence and to evaluate their statistical significance. Some data are ordinal or continuous in nature, however, so that it is necessary to dichotomize the data to use a traditional s...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2607

    authors: Jung I,Kulldorff M,Klassen AC

    更新日期:2007-03-30 00:00:00

  • A mediation analysis for a nonrare dichotomous outcome with sequentially ordered multiple mediators.

    abstract::Mediation analyses can help us to understand the biological mechanism in which an exposure or treatment affects an outcome. Single mediator analyses have been used in various applications, but may not be appropriate for analyzing intricate mechanisms involving multiple mediators that affect each other. Thus, in this a...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8485

    authors: Lai EY,Shih S,Huang YT,Wang S

    更新日期:2020-05-15 00:00:00

  • Regression models for mixed Poisson and continuous longitudinal data.

    abstract::In this article we develop flexible regression models in two respects to evaluate the influence of the covariate variables on the mixed Poisson and continuous responses and to evaluate how the correlation between Poisson response and continuous response changes over time. A scenario for dealing with regression models ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2776

    authors: Yang Y,Kang J,Mao K,Zhang J

    更新日期:2007-09-10 00:00:00

  • Sequential designs for phase III clinical trials incorporating treatment selection.

    abstract::Most statistical methodology for phase III clinical trials focuses on the comparison of a single experimental treatment with a control. An increasing desire to reduce the time before regulatory approval of a new drug is sought has led to development of two-stage or sequential designs for trials that combine the defini...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1362

    authors: Stallard N,Todd S

    更新日期:2003-03-15 00:00:00

  • An analysis of disease surveillance data that uses the geographic locations of the reporting units.

    abstract::The primary purpose of a disease surveillance system is to provide data for the detection of changes in the incidence of the disease. Methods for the analysis of data from surveillance systems are reviewed. A new procedure is proposed for use when the system includes geographically dispersed reporting units, such as h...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780080306

    authors: Raubertas RF

    更新日期:1989-03-01 00:00:00

  • Causal inference in survival analysis using pseudo-observations.

    abstract::Causal inference for non-censored response variables, such as binary or quantitative outcomes, is often based on either (1) direct standardization ('G-formula') or (2) inverse probability of treatment assignment weights ('propensity score'). To do causal inference in survival analysis, one needs to address right-censo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7297

    authors: Andersen PK,Syriopoulou E,Parner ET

    更新日期:2017-07-30 00:00:00

  • Modelling risk when binary outcomes are subject to error.

    abstract::We present methods for binomial regression when the outcome is determined using the results of a single diagnostic test with imperfect sensitivity and specificity. We present our model, illustrate it with the analysis of real data, and provide an example of WinBUGS program code for performing such an analysis. Conditi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1656

    authors: McInturff P,Johnson WO,Cowling D,Gardner IA

    更新日期:2004-04-15 00:00:00

  • Spatial clustering of the failure to geocode and its implications for the detection of disease clustering.

    abstract::Geocoding a study population as completely as possible is an important data assimilation component of many spatial epidemiologic studies. Unfortunately, complete geocoding is rare in practice. The failure of a substantial proportion of study subjects' addresses to geocode has consequences for spatial analyses, some of...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3288

    authors: Zimmerman DL,Fang X,Mazumdar S

    更新日期:2008-09-20 00:00:00

  • Behind closed doors: the data monitoring board in randomized clinical trials.

    abstract::Many randomized clinical trials include a data and safety monitoring board (DSMB) that is responsible for reviewing accruing data, monitoring performance of the trial, assuring safety of the participants in the trial, and assessing the efficacy of treatment. The DSMB often makes recommendations about continuation of t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780120504

    authors: Wittes J

    更新日期:1993-03-01 00:00:00

  • Application of kriging models for a drug combination experiment on lung cancer.

    abstract::Combinatorial drugs have been widely applied in disease treatment, especially chemotherapy for cancer, due to its improved efficacy and reduced toxicity compared with individual drugs. The study of combinatorial drugs requires efficient experimental designs and proper follow-up statistical modeling techniques. Linear ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7971

    authors: Xiao Q,Wang L,Xu H

    更新日期:2019-01-30 00:00:00

  • Goodman and Kruskal's lambda: a new look at an old measure of association.

    abstract::We examine Goodman and Kruskal's lambda using Efron's approach to regression and analysis of variance (ANOVA) for zero-one outcome data. For a binary response cross-classified by a single nominal predictor, we present a computationally simple ANOVA table in which lambda is analogous to Pearson's R-square. We character...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780080511

    authors: Makuch RW,Rosenberg PS,Scott G

    更新日期:1989-05-01 00:00:00

  • Comparative calibration without a gold standard.

    abstract::Comparative calibration is the broad statistical methodology used to assess the calibration of a set of p instruments, each designed to measure the same characteristic, on a common group of individuals. Different from the usual calibration problem, the true underlying quantity measured is unobservable. Many authors ha...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19970830)16:16<1889::aid-s

    authors: Lu Y,Ye K,Mathur AK,Hui S,Fuerst TP,Genant HK

    更新日期:1997-08-30 00:00:00

  • First steps in analysing NHS waiting times: avoiding the 'stationary and closed population' fallacy.

    abstract::The aim of this paper is to demonstrate the effect of excluding incomplete observations and competing events when calculating cross-sectional measures of NHS waiting times, and to obtain a more accurate estimate of the 'time-to-admission' of those listed on NHS waiting lists using life-table methods. The official 'tim...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/1097-0258(20000815)19:15<2037::aid-sim606>

    authors: Armstrong PW

    更新日期:2000-08-15 00:00:00

  • Comparison of hypertabastic survival model with other unimodal hazard rate functions using a goodness-of-fit test.

    abstract::We studied the problem of testing a hypothesized distribution in survival regression models when the data is right censored and survival times are influenced by covariates. A modified chi-squared type test, known as Nikulin-Rao-Robson statistic, is applied for the comparison of accelerated failure time models. This st...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7244

    authors: Tahir MR,Tran QX,Nikulin MS

    更新日期:2017-05-30 00:00:00

  • Statistical inferences of extended concentration indices for directly standardized rates.

    abstract::The relative concentration index (RCI) and the absolute concentration index (ACI) have been widely used for monitoring health disparities with ranked health determinants. The RCI has been extended to allow value judgments about inequality aversion by Pereira in 1998 and by Wagstaff in 2002. Previous studies of the ext...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7952

    authors: Yu M,Liu B,Li Y,Zou ZJ,Breen N

    更新日期:2019-01-15 00:00:00

  • Classification using ensemble learning under weighted misclassification loss.

    abstract::Binary classification rules based on covariates typically depend on simple loss functions such as zero-one misclassification. Some cases may require more complex loss functions. For example, individual-level monitoring of HIV-infected individuals on antiretroviral therapy requires periodic assessment of treatment fail...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8082

    authors: Xu Y,Liu T,Daniels MJ,Kantor R,Mwangi A,Hogan JW

    更新日期:2019-05-20 00:00:00

  • Prevalence-dependent diagnostic accuracy measures.

    abstract::We study prevalence-dependent diagnostic accuracy measures, specifically, positive and negative predictive values. These measures permit an assessment of the clinical utility of diagnostic tests across populations with different disease prevalences. In many cases, prevalence may not be known with certainty and the eva...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2812

    authors: Li J,Fine JP,Safdar N

    更新日期:2007-07-30 00:00:00

  • The effect of salvage therapy on survival in a longitudinal study with treatment by indication.

    abstract::We consider using observational data to estimate the effect of a treatment on disease recurrence, when the decision to initiate treatment is based on longitudinal factors associated with the risk of recurrence. The effect of salvage androgen deprivation therapy (SADT) on the risk of recurrence of prostate cancer is in...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4017

    authors: Kennedy EH,Taylor JM,Schaubel DE,Williams S

    更新日期:2010-11-10 00:00:00

  • Variable selection in covariate dependent random partition models: an application to urinary tract infection.

    abstract::Lower urinary tract symptoms can indicate the presence of urinary tract infection (UTI), a condition that if it becomes chronic requires expensive and time consuming care as well as leading to reduced quality of life. Detecting the presence and gravity of an infection from the earliest symptoms is then highly valuable...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6786

    authors: Barcella W,Iorio MD,Baio G,Malone-Lee J

    更新日期:2016-04-15 00:00:00

  • Randomization tests for multiarmed randomized clinical trials.

    abstract::We examine the use of randomization-based inference for analyzing multiarmed randomized clinical trials, including the application of conditional randomization tests to multiple comparisons. The view is taken that the linkage of the statistical test to the experimental design (randomization procedure) should be recogn...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8418

    authors: Wang Y,Rosenberger WF,Uschner D

    更新日期:2020-02-20 00:00:00

  • Bayesian predictive approach to interim monitoring in clinical trials.

    abstract::This paper reviews Bayesian strategies for monitoring clinical trial data. It focuses on a Bayesian stochastic curtailment method based on the predictive probability of observing a clinically significant outcome at the scheduled end of the study given the observed data. The proposed method is applied to derive efficac...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2204

    authors: Dmitrienko A,Wang MD

    更新日期:2006-07-15 00:00:00

  • Semiparametric transformation models for joint analysis of multivariate recurrent and terminal events.

    abstract::Recurrent event data occur in many clinical and observational studies, and in these situations, there may exist a terminal event such as death that is related to the recurrent event of interest. In addition, sometimes more than one type of recurrent events may occur, that is, one may encounter multivariate recurrent e...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4306

    authors: Zhu L,Sun J,Srivastava DK,Tong X,Leisenring W,Zhang H,Robison LL

    更新日期:2011-11-10 00:00:00

  • Infant growth modelling using a shape invariant model with random effects.

    abstract::Models for infant growth have usually been based on parametric forms, commonly an exponential or similar model, which have been shown to fit poorly especially during the first year of life. An alternative approach is to use a non-parametric model, based on a shape invariant model (SIM), where a single function is tran...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2718

    authors: Beath KJ

    更新日期:2007-05-30 00:00:00

  • Reflecting on "A Statistician in Medicine" in 2020.

    abstract::In this commentary, we revisit Sir Austin Bradford Hill's seminal Alfred Watson Memorial Lecture in 1962 through the eyes of two practicing biostatisticians of the current era. We summarize some eternal takeaway messages from Hill's lecture regarding observations and experiments translated through the modern lexicon o...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8830

    authors: Dempsey W,Mukherjee B

    更新日期:2021-01-15 00:00:00