Abstract:
:The usual methods for analyzing case-cohort studies rely on sometimes not fully efficient weighted estimators. Multiple imputation might be a good alternative because it uses all the data available and approximates the maximum partial likelihood estimator. This method is based on the generation of several plausible complete data sets, taking into account uncertainty about missing values. When the imputation model is correctly defined, the multiple imputation estimator is asymptotically unbiased and its variance is correctly estimated. We show that a correct imputation model must be estimated from the fully observed data (cases and controls), using the case status among the explanatory variable. To validate the approach, we analyzed case-cohort studies first with completely simulated data and then with case-cohort data sampled from two real cohorts. The analyses of simulated data showed that, when the imputation model was correct, the multiple imputation estimator was unbiased and efficient. The observed gain in precision ranged from 8 to 37 per cent for phase-1 variables and from 5 to 19 per cent for the phase-2 variable. When the imputation model was misspecified, the multiple imputation estimator was still more efficient than the weighted estimators but it was also slightly biased. The analyses of case-cohort data sampled from complete cohorts showed that even when no strong predictor of the phase-2 variable was available, the multiple imputation was unbiased, as precised as the weighted estimator for the phase-2 variable and slightly more precise than the weighted estimators for the phase-1 variables. However, the multiple imputation estimator was found to be biased when, because of interaction terms, some coefficients of the imputation model had to be estimated from small samples. Multiple imputation is an efficient technique for analyzing case-cohort data. Practically, we suggest building the analysis model using only the case-cohort data and weighted estimators. Multiple imputation can eventually be used to reanalyze the data using the selected model in order to improve the precision of the results.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Marti H,Chavance Mdoi
10.1002/sim.4130subject
Has Abstractpub_date
2011-06-15 00:00:00pages
1595-607issue
13eissn
0277-6715issn
1097-0258journal_volume
30pub_type
杂志文章abstract::In medical research, risk difference (RD) and number needed to treat (NNT) measures for survival times have been mainly proposed without consideration of covariates. In this paper, we develop adjusted RD and NNT measures for use in observational studies with survival time outcomes within the framework of the Cox propo...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3793
更新日期:2010-03-30 00:00:00
abstract::We consider the use of the assurance method in clinical trial planning. In the assurance method, which is an alternative to a power calculation, we calculate the probability of a clinical trial resulting in a successful outcome, via eliciting a prior probability distribution about the relevant treatment effect. This i...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5916
更新日期:2014-01-15 00:00:00
abstract::We present graphical and numerical methods for assessing the adequacy of the logistic regression model for stratified case-control data. The proposed methods are derived from the cumulative sum of residuals over the covariate or linear predictor. Under the assumed model, the cumulative residual process converges weakl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1932
更新日期:2005-01-30 00:00:00
abstract::In vitro fertilization (IVF) is an increasingly common method of assisted reproductive technology. Because of the careful observation and follow-up required as part of the procedure, IVF studies provide an ideal opportunity to identify and assess clinical and demographic factors along with environmental exposures that...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6050
更新日期:2014-05-10 00:00:00
abstract::Although transportation safety has greatly improved over the past 2 decades, motor vehicle crash injuries remain a leading cause of morbidity and mortality, particularly among young drivers. Driver errors and behaviors such as speeding and distraction contribute disproportionately to crashes among inexperienced novice...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7404
更新日期:2017-10-30 00:00:00
abstract::Novel therapies are challenging the standards of drug development. Agents with specific biologic targets, unknown dose-efficacy curves, and limited toxicity mandate novel designs to identify biologically optimal doses. We review two model-based designs that utilize either a proportional odds model or a continuation ra...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3706
更新日期:2010-05-10 00:00:00
abstract::In a cluster randomized cross-over trial, all participating clusters receive both intervention and control treatments consecutively, in separate time periods. Patients recruited by each cluster within the same time period receive the same intervention, and randomization determines order of treatment within a cluster. ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2537
更新日期:2007-01-30 00:00:00
abstract::It is often of interest to use observational data to estimate the causal effect of a target exposure or treatment on an outcome. When estimating the treatment effect, it is essential to appropriately adjust for selection bias due to observed confounders using, for example, propensity score weighting. Selection bias du...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8549
更新日期:2020-08-15 00:00:00
abstract::The generalized estimating equations (GEE) approach is commonly used to model incomplete longitudinal binary data. When drop-outs are missing at random through dependence on observed responses (MAR), GEE may give biased parameter estimates in the model for the marginal means. A weighted estimating equations approach g...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1241
更新日期:2002-10-30 00:00:00
abstract::Health authorities are often alerted to suspected cancer clusters near the vicinity of potential point sources by members of the public. A surveillance system, where administrative regions around the potential point sources are regularly monitored for high disease rates, would allow for responses which are easier to o...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19960415)15:7/9<727::aid-s
更新日期:1996-04-15 00:00:00
abstract::This paper investigates the impact on area life tables of the specification of unobserved frailty. Frailty specification may affect both the regression effects of area and individual level covariates, and lead to changes in the value of summary mortality parameters, such as life expectancy. The paper also investigates...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780141703
更新日期:1995-09-15 00:00:00
abstract::In observational cohort studies we may wish to examine the associations between fixed patient characteristics and the longitudinal changes from baseline in a repeated outcome measure. Many biological and other outcome measures are known to be subject to measurement error and biological variation. In an initial analysi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3725
更新日期:2009-11-20 00:00:00
abstract::Confidence intervals for a standardized effect are derived after stabilizing the variance of the Welch t-statistic. Simulation studies demonstrate the viability of the resulting intervals for a wide range of parameter values and sample sizes as small as five. The methodology is extended to the combination of results f...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2751
更新日期:2007-06-30 00:00:00
abstract::Conditional power based on summary statistic by comparing outcomes (such as the sample mean) directly between 2 groups is a convenient tool for decision making in randomized controlled trial studies. In this paper, we extend the traditional summary statistic-based conditional power with a general model-based assessmen...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7454
更新日期:2017-12-30 00:00:00
abstract::The U.S. Food and Drug Administration (FDA) has proposed new regulations that address the 'prescribability' and 'switchability' of new formulations of already-approved drugs. These new criteria are known, respectively, as population and individual bioequivalence. Two methods have been proposed in the bioequivalence li...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1346
更新日期:2003-01-15 00:00:00
abstract::In modern observational studies using electronic health records or other routinely collected data, both the outcome and covariates of interest can be error-prone and their errors often correlated. A cost-effective solution is the two-phase design, under which the error-prone outcome and covariates are observed for all...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8799
更新日期:2021-02-10 00:00:00
abstract::For time-to-event outcomes, a rich literature exists on the bias introduced by covariate measurement error in regression models, such as the Cox model, and methods of analysis to address this bias. By comparison, less attention has been given to understanding the impact or addressing errors in the failure time outcome...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7554
更新日期:2018-04-15 00:00:00
abstract::Researchers in clinical science and bioinformatics frequently aim to learn which of a set of candidate biomarkers is important in determining a given outcome, and to rank the contributions of the candidates accordingly. This article introduces a new approach to research questions of this type, based on targeted maximu...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3414
更新日期:2009-01-15 00:00:00
abstract::The correct identification of change-points during ongoing outbreak investigations of infectious diseases is a matter of paramount importance in epidemiology, with major implications for the management of health care resources, public health and, as the COVID-19 pandemic has shown, social live. Onsets, peaks, and infl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8807
更新日期:2021-02-20 00:00:00
abstract::Phase II trials often test the null hypothesis H(0): p
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2653
更新日期:2007-03-30 00:00:00
abstract::Immigration-death models are proposed to analyse the infection dynamics in longitudinal studies of panels of heavily parasitized human hosts where parasites have been typed at regular intervals by PCR. Immigration refers to the acquisition of a new parasitic genotype, occurring at rate lambda, and death refers to the ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2189
更新日期:2005-11-15 00:00:00
abstract::A study of long term survival of 1487 patients given an allogeneic bone marrow transplant for acute myelogenous leukaemia and 729 patients given a transplant for severe aplastic anaemia was conducted by the International Bone Marrow Transplant Registry. One aim of this study is to determine if the mortality rates of t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990630)18:12<1529::aid-s
更新日期:1999-06-30 00:00:00
abstract::Previous work on the consequences of regression to the mean for the interpretation of responses to treatment is extended to the situation where the response measured is the proportional change in some variable. Methods for correcting for the bias are discussed. ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780060203
更新日期:1987-03-01 00:00:00
abstract::Population heterogeneity is frequently observed among patients' treatment responses in clinical trials because of various factors such as clinical background, environmental, and genetic factors. Different subpopulations defined by those baseline factors can lead to differences in the benefit or safety profile of a the...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7925
更新日期:2018-12-20 00:00:00
abstract::The Box-Cox power exponential (BCPE) distribution, developed in this paper, provides a model for a dependent variable Y exhibiting both skewness and kurtosis (leptokurtosis or platykurtosis). The distribution is defined by a power transformation Y(nu) having a shifted and scaled (truncated) standard power exponential ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1861
更新日期:2004-10-15 00:00:00
abstract::Identification of the latency period for the effect of a time-varying exposure is key when assessing many environmental, nutritional, and behavioral risk factors. A pre-specified exposure metric involving an unknown latency parameter is often used in the statistical model for the exposure-disease relationship. Likelih...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8038
更新日期:2019-03-30 00:00:00
abstract::This paper reviews several emerging and recurrent issues relating to the drug development process. These emerging issues include changes to the FDA regulatory environment, internationalization of drug development, advances in computer technology and visualization tools, and efforts to incorporate meta-analysis methodo...
journal_title:Statistics in medicine
pub_type: 杂志文章,评审
doi:10.1002/(sici)1097-0258(19990915/30)18:17/18<2301:
更新日期:1999-09-15 00:00:00
abstract::The focus of this paper is the development of a range of cluster detection diagnostics that can be used to assess the degree to which a clustering method recovers the true clustering behaviour of small area data. The diagnostics proposed range from individual region specific diagnostics to neighbourhood diagnostics, a...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2401
更新日期:2006-03-15 00:00:00
abstract::Random forest is a supervised learning method that combines many classification or regression trees for prediction. Here we describe an extension of the random forest method for building event risk prediction models in survival analysis with competing risks. In case of right-censored data, the event status at the pred...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5775
更新日期:2013-08-15 00:00:00
abstract::Estimates of relative efficacy between alternative treatments are crucial for decision making in health care. Bayesian mixed treatment comparison models provide a powerful methodology to obtain such estimates when head-to-head evidence is not available or insufficient. In recent years, this methodology has become wide...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5764
更新日期:2013-07-30 00:00:00