Sampling-based estimation for massive survival data with additive hazards model.

Abstract:

:For massive survival data, we propose a subsampling algorithm to efficiently approximate the estimates of regression parameters in the additive hazards model. We establish consistency and asymptotic normality of the subsample-based estimator given the full data. The optimal subsampling probabilities are obtained via minimizing asymptotic variance of the resulting estimator. The subsample-based procedure can largely reduce the computational cost compared with the full data method. In numerical simulations, our method has low bias and satisfactory coverage probabilities. We provide an illustrative example on the survival analysis of patients with lymphoma cancer from the Surveillance, Epidemiology, and End Results Program.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Zuo L,Zhang H,Wang H,Liu L

doi

10.1002/sim.8783

subject

Has Abstract

pub_date

2021-01-30 00:00:00

pages

441-450

issue

2

eissn

0277-6715

issn

1097-0258

journal_volume

40

pub_type

杂志文章
  • Comparison of predictive values of two diagnostic tests from the same sample of subjects using weighted least squares.

    abstract::Screening and diagnostic tests are important in disease prevention or control. The predictive values of positive and negative (PPV and NPV) test results are two of four operational characteristics of a screening test. We review an existing method based on the generalized estimating equation (GEE) methodology for compa...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2332

    authors: Wang W,Davis CS,Soong SJ

    更新日期:2006-07-15 00:00:00

  • Testing the equality of two survival functions with right truncated data.

    abstract::To compare the survival functions based on right-truncated data, Lagakos et al. proposed a weighted logrank test based on a reverse time scale. This is in contrast to Bilker and Wang, who suggested a semi-parametric version of the Mann-Whitney test by assuming that the distribution of truncation times is known or can ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2556

    authors: Chi Y,Tsai WY,Chiang CL

    更新日期:2007-02-20 00:00:00

  • Linear regression for bivariate censored data via multiple imputation.

    abstract::Bivariate survival data arise, for example, in twin studies and studies of both eyes or ears of the same individual. Often it is of interest to regress the survival times on a set of predictors. In this paper we extend Wei and Tanner's multiple imputation approach for linear regression with univariate censored data to...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19991130)18:22<3111::aid-s

    authors: Pan W,Kooperberg C

    更新日期:1999-11-30 00:00:00

  • Utility-based optimization of phase II/III programs.

    abstract::Phase II and phase III trials play a crucial role in drug development programs. They are costly and time consuming and, because of high failure rates in late development stages, at the same time risky investments. Commonly, sample size calculation of phase III is based on the treatment effect observed in phase II. The...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6624

    authors: Kirchner M,Kieser M,Götte H,Schüler A

    更新日期:2016-01-30 00:00:00

  • Bounding the bias of unmeasured factors with confounding and effect-modifying potentials.

    abstract::Confounding is a major concern in observational studies. To adjust for confounding bias, the potential confounder(s) for a study must first be identified and measured. But this is not always possible. The unmeasured factors may also exhibit effect modification, and this further complicates the situation. In this paper...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4151

    authors: Lee WC

    更新日期:2011-04-30 00:00:00

  • Quantifying the impact of between-study heterogeneity in multivariate meta-analyses.

    abstract::Measures that quantify the impact of heterogeneity in univariate meta-analysis, including the very popular I(2) statistic, are now well established. Multivariate meta-analysis, where studies provide multiple outcomes that are pooled in a single analysis, is also becoming more commonly used. The question of how to quan...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5453

    authors: Jackson D,White IR,Riley RD

    更新日期:2012-12-20 00:00:00

  • Variable selection for proportional odds model.

    abstract::In this paper we study the problem of variable selection for the proportional odds model, which is a useful alternative to the proportional hazards model and might be appropriate when the proportional hazards assumption is not satisfied. We propose to fit the proportional odds model by maximizing the marginal likeliho...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2833

    authors: Lu W,Zhang HH

    更新日期:2007-09-10 00:00:00

  • Estimating kappa from binocular data and comparing marginal probabilities.

    abstract::Suppose that two graders classify all eyes in a sample of patients for the presence or absence of a specified abnormality. In the statistical analysis of the data, possible correlation between the observations in the right and left eyes should be taken into account. Recently, general methods have been developed to ana...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780122306

    authors: Schouten HJ

    更新日期:1993-12-15 00:00:00

  • Use of max and min scores for trend tests for association when the genetic model is unknown.

    abstract::In case-control studies, the Cochran-Armitage (CA) trend test is powerful for detection of an association between a risk allele and a marker. To apply this test, a score should be assigned to the genotypes based on the genetic model. When the underlying genetic model is unknown, the trend test statistic is a function ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1474

    authors: Zheng G

    更新日期:2003-08-30 00:00:00

  • Promoting interactions with basic scientists and clinicians: the NIA Alzheimer's Disease Data Coordinating Center.

    abstract::To benefit Alzheimer's disease research, a central data co-ordinating centre (CDCC) is planned that will systematically collect data from 27 Alzheimer's disease centres (ADCs) located nationwide. This CDCC will combine, analyse and disseminate epidemiologic, demographic, clinical and neuropathological data to research...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(20000615/30)19:11/12<1453:

    authors: Cronin-Stubbs D,DeKosky ST,Morris JC,Evans DA

    更新日期:2000-06-15 00:00:00

  • Discriminant analysis when all variables are ordered.

    abstract::Determination of the equation that relates an ordered dependent variable to ordered independent variables is sought. One solution, non-parametric discriminant analysis (NPD), involves obtaining the best monotonic step function by means of a computer search procedure. Although one can use alternative selection criteria...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110804

    authors: Johnston B,Seshia SS

    更新日期:1992-06-15 00:00:00

  • A non-parametric approach to the design and analysis of two-dimensional dose-finding trials.

    abstract::This paper investigates the design and analysis of dose-finding trials with two agents. The set of doses for each agent is fixed in advance. The goal of the trial is to find the set of dose combinations with probability of toxicity closest to a pre-specified value. For each of the two agents we assume that the probabi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1796

    authors: Ivanova A,Wang K

    更新日期:2004-06-30 00:00:00

  • Non-inferiority trials: the 'at least as good as' criterion with dichotomous data.

    abstract::The 'at least as good as' criterion, introduced by Laster and Johnson for a continuous response variate, is developed here for applications with dichotomous data. This approach is adaptive in nature, as the margin of non-inferiority is not taken as a fixed difference; it varies as a function of the positive control re...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2476

    authors: Laster LL,Johnson MF,Kotler ML

    更新日期:2006-04-15 00:00:00

  • A new and improved confidence interval for the Mantel-Haenszel risk difference.

    abstract::Writing the variance of the Mantel-Haenszel estimator under the null of homogeneity and inverting the corresponding test, we arrive at an improved confidence interval for the common risk difference in stratified 2 × 2 tables. This interval outperforms a variety of other intervals currently recommended in the literatur...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6122

    authors: Klingenberg B

    更新日期:2014-07-30 00:00:00

  • A general approach to evaluating the bias of 2-stage instrumental variable estimators.

    abstract::Unmeasured confounding is a common concern when researchers attempt to estimate a treatment effect using observational data or randomized studies with nonperfect compliance. To address this concern, instrumental variable methods, such as 2-stage predictor substitution (2SPS) and 2-stage residual inclusion (2SRI), have...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7636

    authors: Wan F,Small D,Mitra N

    更新日期:2018-05-30 00:00:00

  • Modelling the association between patient characteristics and the change over time in a disease measure using observational cohort data.

    abstract::In observational cohort studies we may wish to examine the associations between fixed patient characteristics and the longitudinal changes from baseline in a repeated outcome measure. Many biological and other outcome measures are known to be subject to measurement error and biological variation. In an initial analysi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3725

    authors: Harrison L,Dunn DT,Green H,Copas AJ

    更新日期:2009-11-20 00:00:00

  • Comparison of tests for categorical data from a stratified cluster randomized trial.

    abstract::Two features commonly exhibited by randomized trials of health promotion interventions are cluster randomization and stratification. Ignoring correlations between individuals within clusters can lead to an inflated type I error rate and hence a P-value which overstates the significance of the result. This paper compar...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1256

    authors: Dobbins TA,Simpson JM

    更新日期:2002-12-30 00:00:00

  • Local influence measure of zero-inflated generalized Poisson mixture regression models.

    abstract::In many practical applications, count data often exhibit greater or less variability than allowed by the equality of mean and variance, referred to as overdispersion/underdispersion, and there are several reasons that may lead to the overdispersion/underdispersion such as zero inflation and mixture. Moreover, if the c...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5560

    authors: Chen XD,Fu YZ,Wang XR

    更新日期:2013-04-15 00:00:00

  • Evaluation of surrogate endpoints in randomized experiments with mixed discrete and continuous outcomes.

    abstract::A statistical definition of surrogate endpoints as well as validation criteria was first presented by Prentice. Freedman et al. supplemented these criteria with the so-called proportion explained. Buyse and Molenberghs pointed to inadequacies of these criteria and suggested a new definition of surrogacy based on (i) t...

    journal_title:Statistics in medicine

    pub_type: 评论,杂志文章

    doi:10.1002/sim.923

    authors: Molenberghs G,Geys H,Buyse M

    更新日期:2001-10-30 00:00:00

  • Second-stage least squares versus penalized quasi-likelihood for fitting hierarchical models in epidemiologic analyses.

    abstract::Hierarchical regression analysis holds much promise for epidemiologic analysis, but has as yet seen limited application because of lack of easily used software and the relatively lengthy run times of preferred fitting methods (such as true maximum likelihood and Bayesian approaches). This paper compares three relative...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19970315)16:5<515::aid-sim

    authors: Greenland S

    更新日期:1997-03-15 00:00:00

  • Construction of group sequential designs in clinical trials on the basis of detectable treatment differences.

    abstract::The treatment effect sizes that can be detected with sufficient power up to the different interim analyses constitute a clinically meaningful criterion for the selection of a group sequential test for a clinical trial. For any pre-specified sequence of effect sizes, it is possible to construct group sequential boundar...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1751

    authors: Schäfer H,Müller HH

    更新日期:2004-05-15 00:00:00

  • Emerging and recurrent issues in drug development.

    abstract::This paper reviews several emerging and recurrent issues relating to the drug development process. These emerging issues include changes to the FDA regulatory environment, internationalization of drug development, advances in computer technology and visualization tools, and efforts to incorporate meta-analysis methodo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审

    doi:10.1002/(sici)1097-0258(19990915/30)18:17/18<2301:

    authors: Anello C

    更新日期:1999-09-15 00:00:00

  • Logistic regression with incompletely observed categorical covariates--investigating the sensitivity against violation of the missing at random assumption.

    abstract::Missing values in the covariates are a widespread complication in the statistical inference of regression models. The maximum likelihood principle requires specification of the distribution of the covariates, at least in part. For categorical covariates, log-linear models can be used. Additionally, the missing at rand...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780141205

    authors: Vach W,Blettner M

    更新日期:1995-06-30 00:00:00

  • On the estimation of total variability in assay validation.

    abstract::In the pharmaceutical industry, an assay method is considered validated if the accuracy and precision for an assay meet some acceptable limits. This paper discusses the assessment of assay precision in terms of the estimation of total variability of an assay from a one-way random effects model which is often considere...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780101006

    authors: Chow SC,Tse SK

    更新日期:1991-10-01 00:00:00

  • Drug treatment of mild hypertension to reduce the risk of CHD: is it worth-while?

    abstract::Although hypertension is regarded as a causal factor for coronary heart disease (CHD) a reduction in the risk of CHD as a result of lowering blood pressure in mild hypertension could not be demonstrated. This conclusion is based on an overview analysis of all published randomized trials in mild hypertension, including...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780071104

    authors: Holme I

    更新日期:1988-11-01 00:00:00

  • A community trial strategy for evaluating treatment for symptomatic conditions.

    abstract::A method has been developed for simultaneously comparing the usefulness of many treatments of established value for symptomatic medical conditions. Medical assessment of outcome is not employed. Instead patients are required to assess treatments prescribed during the course of ordinary general practice rather than und...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/sim.4780040104

    authors: Charlton JR,D'Souza MF,Tooley M,Silver R

    更新日期:1985-01-01 00:00:00

  • Historical and methodological developments in clinical trials at the National Cancer Institute.

    abstract::The first randomized clinical trial at the National Cancer Institute (NCI), planned in 1954, commenced in 1955 for the treatment of patients with acute leukaemia. The programme in clinical trials at NCI had strong influence from the clinician and administrator, C. Gordon Zubrod, who introduced the randomized clinical ...

    journal_title:Statistics in medicine

    pub_type: 临床试验,历史文章,杂志文章,随机对照试验

    doi:10.1002/sim.4780090803

    authors: Gehan EA,Schneiderman MA

    更新日期:1990-08-01 00:00:00

  • A comparison of group sequential methods for binary longitudinal data.

    abstract::Interim analyses are conducted to allow for early termination of the trial, for ethical as well as economical reasons. Here we consider interim analyses in repeated measurements studies where the measurements are binary. Two methods for analysing this kind of data are compared according to their operating characterist...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1361

    authors: Spiessens B,Lesaffre E,Verbeke G

    更新日期:2003-02-28 00:00:00

  • Designs for phase I trials in ordered groups.

    abstract::We propose a new design for dose finding for cytotoxic agents in two ordered groups of patients. By ordered groups, we mean that prior to the study there is clinical information that would indicate that for a given dose one group would be more susceptible to toxicities than patients in the other group. The designs are...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7133

    authors: Conaway MR,Wages NA

    更新日期:2017-01-30 00:00:00

  • Model-based multiplicity estimation of population size.

    abstract::A survey is conducted at w of K selection units or lists, e.g. health care institutions or weeks in a year, to estimate N, the total number of individuals with particular characteristics. Our estimator utilizes two items determined for each survey participant: the number, u, among the w lists in S and the number, j, a...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3614

    authors: Laska EM,Meisner M,Wanderling J

    更新日期:2009-07-30 00:00:00