Abstract:
:Outcome-dependent sampling (ODS) scheme is a cost-effective sampling scheme where one observes the exposure with a probability that depends on the outcome. The well-known such design is the case-control design for binary response, the case-cohort design for the failure time data, and the general ODS design for a continuous response. While substantial work has been carried out for the univariate response case, statistical inference and design for the ODS with multivariate cases remain under-developed. Motivated by the need in biological studies for taking the advantage of the available responses for subjects in a cluster, we propose a multivariate outcome-dependent sampling (multivariate-ODS) design that is based on a general selection of the continuous responses within a cluster. The proposed inference procedure for the multivariate-ODS design is semiparametric where all the underlying distributions of covariates are modeled nonparametrically using the empirical likelihood methods. We show that the proposed estimator is consistent and developed the asymptotically normality properties. Simulation studies show that the proposed estimator is more efficient than the estimator obtained using only the simple-random-sample portion of the multivariate-ODS or the estimator from a simple random sample with the same sample size. The multivariate-ODS design together with the proposed estimator provides an approach to further improve study efficiency for a given fixed study budget. We illustrate the proposed design and estimator with an analysis of association of polychlorinated biphenyl exposure to hearing loss in children born to the Collaborative Perinatal Study. Copyright © 2016 John Wiley & Sons, Ltd.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Lu TS,Longnecker MP,Zhou Hdoi
10.1002/sim.7195subject
Has Abstractpub_date
2017-03-15 00:00:00pages
985-997issue
6eissn
0277-6715issn
1097-0258journal_volume
36pub_type
杂志文章abstract::Wittes and Brittain recommended using an 'internal pilot study' to adjust sample size. The approach involves five steps in testing a general linear hypothesis for a general linear univariate model, with Gaussian errors. First, specify the design, hypothesis, desired test size, power, a smallest 'clinically meaningful'...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990530)18:10<1199::aid-s
更新日期:1999-05-30 00:00:00
abstract::Longitudinal studies are commonly used to study processes of change. Because data are collected over time, missing data are pervasive in longitudinal studies, and complete ascertainment of all variables is rare. In this paper a new imputation strategy for completing longitudinal data sets is proposed. The proposed met...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.740
更新日期:2001-09-15 00:00:00
abstract::Between-community variance or community-by-time variance is one of the key factors driving the cost of conducting group randomized trials, which are often very expensive. We investigated empirically whether between-community variance could be reduced by controlling individual- and/or community-level covariates and ide...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990315)18:5<539::aid-sim
更新日期:1999-03-15 00:00:00
abstract::Variation in heart disease (HD) mortality rates across census tracts is greater than expected given binomial error and available explanatory variables. We extended an extra-binomial variation model for rates standardized by the direct method. The overdispersion parameter accounted for 36 per cent of the observed varia...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780091009
更新日期:1990-10-01 00:00:00
abstract::Case-control studies are usually defined to investigate risk factors for a single disease of interest. However, subsequent to data collection, investigators may wish to examine as an 'outcome' a variable that was an exposure in the original study. A naive analysis that disregards the sampling strategy that gave rise t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2398
更新日期:2005-12-30 00:00:00
abstract::A review is given of different ways of estimating the error rate of a prediction rule based on a statistical model. A distinction is drawn between apparent, optimum and actual error rates. Moreover it is shown how cross-validation can be used to obtain an adjusted predictor with smaller error rate. A detailed discussi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780091109
更新日期:1990-11-01 00:00:00
abstract::Spatial scan statistics are widely used for count data to detect geographical disease clusters of high or low incidence, mortality or prevalence and to evaluate their statistical significance. Some data are ordinal or continuous in nature, however, so that it is necessary to dichotomize the data to use a traditional s...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2607
更新日期:2007-03-30 00:00:00
abstract::Mediation analyses can help us to understand the biological mechanism in which an exposure or treatment affects an outcome. Single mediator analyses have been used in various applications, but may not be appropriate for analyzing intricate mechanisms involving multiple mediators that affect each other. Thus, in this a...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8485
更新日期:2020-05-15 00:00:00
abstract::In this article we develop flexible regression models in two respects to evaluate the influence of the covariate variables on the mixed Poisson and continuous responses and to evaluate how the correlation between Poisson response and continuous response changes over time. A scenario for dealing with regression models ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2776
更新日期:2007-09-10 00:00:00
abstract::Most statistical methodology for phase III clinical trials focuses on the comparison of a single experimental treatment with a control. An increasing desire to reduce the time before regulatory approval of a new drug is sought has led to development of two-stage or sequential designs for trials that combine the defini...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1362
更新日期:2003-03-15 00:00:00
abstract::The primary purpose of a disease surveillance system is to provide data for the detection of changes in the incidence of the disease. Methods for the analysis of data from surveillance systems are reviewed. A new procedure is proposed for use when the system includes geographically dispersed reporting units, such as h...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780080306
更新日期:1989-03-01 00:00:00
abstract::Causal inference for non-censored response variables, such as binary or quantitative outcomes, is often based on either (1) direct standardization ('G-formula') or (2) inverse probability of treatment assignment weights ('propensity score'). To do causal inference in survival analysis, one needs to address right-censo...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7297
更新日期:2017-07-30 00:00:00
abstract::We present methods for binomial regression when the outcome is determined using the results of a single diagnostic test with imperfect sensitivity and specificity. We present our model, illustrate it with the analysis of real data, and provide an example of WinBUGS program code for performing such an analysis. Conditi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1656
更新日期:2004-04-15 00:00:00
abstract::Geocoding a study population as completely as possible is an important data assimilation component of many spatial epidemiologic studies. Unfortunately, complete geocoding is rare in practice. The failure of a substantial proportion of study subjects' addresses to geocode has consequences for spatial analyses, some of...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3288
更新日期:2008-09-20 00:00:00
abstract::Many randomized clinical trials include a data and safety monitoring board (DSMB) that is responsible for reviewing accruing data, monitoring performance of the trial, assuring safety of the participants in the trial, and assessing the efficacy of treatment. The DSMB often makes recommendations about continuation of t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780120504
更新日期:1993-03-01 00:00:00
abstract::Combinatorial drugs have been widely applied in disease treatment, especially chemotherapy for cancer, due to its improved efficacy and reduced toxicity compared with individual drugs. The study of combinatorial drugs requires efficient experimental designs and proper follow-up statistical modeling techniques. Linear ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7971
更新日期:2019-01-30 00:00:00
abstract::We examine Goodman and Kruskal's lambda using Efron's approach to regression and analysis of variance (ANOVA) for zero-one outcome data. For a binary response cross-classified by a single nominal predictor, we present a computationally simple ANOVA table in which lambda is analogous to Pearson's R-square. We character...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780080511
更新日期:1989-05-01 00:00:00
abstract::Comparative calibration is the broad statistical methodology used to assess the calibration of a set of p instruments, each designed to measure the same characteristic, on a common group of individuals. Different from the usual calibration problem, the true underlying quantity measured is unobservable. Many authors ha...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19970830)16:16<1889::aid-s
更新日期:1997-08-30 00:00:00
abstract::The aim of this paper is to demonstrate the effect of excluding incomplete observations and competing events when calculating cross-sectional measures of NHS waiting times, and to obtain a more accurate estimate of the 'time-to-admission' of those listed on NHS waiting lists using life-table methods. The official 'tim...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/1097-0258(20000815)19:15<2037::aid-sim606>
更新日期:2000-08-15 00:00:00
abstract::We studied the problem of testing a hypothesized distribution in survival regression models when the data is right censored and survival times are influenced by covariates. A modified chi-squared type test, known as Nikulin-Rao-Robson statistic, is applied for the comparison of accelerated failure time models. This st...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7244
更新日期:2017-05-30 00:00:00
abstract::The relative concentration index (RCI) and the absolute concentration index (ACI) have been widely used for monitoring health disparities with ranked health determinants. The RCI has been extended to allow value judgments about inequality aversion by Pereira in 1998 and by Wagstaff in 2002. Previous studies of the ext...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7952
更新日期:2019-01-15 00:00:00
abstract::Binary classification rules based on covariates typically depend on simple loss functions such as zero-one misclassification. Some cases may require more complex loss functions. For example, individual-level monitoring of HIV-infected individuals on antiretroviral therapy requires periodic assessment of treatment fail...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8082
更新日期:2019-05-20 00:00:00
abstract::We study prevalence-dependent diagnostic accuracy measures, specifically, positive and negative predictive values. These measures permit an assessment of the clinical utility of diagnostic tests across populations with different disease prevalences. In many cases, prevalence may not be known with certainty and the eva...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2812
更新日期:2007-07-30 00:00:00
abstract::We consider using observational data to estimate the effect of a treatment on disease recurrence, when the decision to initiate treatment is based on longitudinal factors associated with the risk of recurrence. The effect of salvage androgen deprivation therapy (SADT) on the risk of recurrence of prostate cancer is in...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4017
更新日期:2010-11-10 00:00:00
abstract::Lower urinary tract symptoms can indicate the presence of urinary tract infection (UTI), a condition that if it becomes chronic requires expensive and time consuming care as well as leading to reduced quality of life. Detecting the presence and gravity of an infection from the earliest symptoms is then highly valuable...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6786
更新日期:2016-04-15 00:00:00
abstract::We examine the use of randomization-based inference for analyzing multiarmed randomized clinical trials, including the application of conditional randomization tests to multiple comparisons. The view is taken that the linkage of the statistical test to the experimental design (randomization procedure) should be recogn...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8418
更新日期:2020-02-20 00:00:00
abstract::This paper reviews Bayesian strategies for monitoring clinical trial data. It focuses on a Bayesian stochastic curtailment method based on the predictive probability of observing a clinically significant outcome at the scheduled end of the study given the observed data. The proposed method is applied to derive efficac...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2204
更新日期:2006-07-15 00:00:00
abstract::Recurrent event data occur in many clinical and observational studies, and in these situations, there may exist a terminal event such as death that is related to the recurrent event of interest. In addition, sometimes more than one type of recurrent events may occur, that is, one may encounter multivariate recurrent e...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4306
更新日期:2011-11-10 00:00:00
abstract::Models for infant growth have usually been based on parametric forms, commonly an exponential or similar model, which have been shown to fit poorly especially during the first year of life. An alternative approach is to use a non-parametric model, based on a shape invariant model (SIM), where a single function is tran...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2718
更新日期:2007-05-30 00:00:00
abstract::In this commentary, we revisit Sir Austin Bradford Hill's seminal Alfred Watson Memorial Lecture in 1962 through the eyes of two practicing biostatisticians of the current era. We summarize some eternal takeaway messages from Hill's lecture regarding observations and experiments translated through the modern lexicon o...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8830
更新日期:2021-01-15 00:00:00