Efficient semiparametric inference for two-phase studies with outcome and covariate measurement errors.

Abstract:

:In modern observational studies using electronic health records or other routinely collected data, both the outcome and covariates of interest can be error-prone and their errors often correlated. A cost-effective solution is the two-phase design, under which the error-prone outcome and covariates are observed for all subjects during the first phase and that information is used to select a validation subsample for accurate measurements of these variables in the second phase. Previous research on two-phase measurement error problems largely focused on scenarios where there are errors in covariates only or the validation sample is a simple random sample of study subjects. Herein, we propose a semiparametric approach to general two-phase measurement error problems with a quantitative outcome, allowing for correlated errors in the outcome and covariates and arbitrary second-phase selection. We devise a computationally efficient and numerically stable expectation-maximization algorithm to maximize the nonparametric likelihood function. The resulting estimators possess desired statistical properties. We demonstrate the superiority of the proposed methods over existing approaches through extensive simulation studies, and we illustrate their use in an observational HIV study.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Tao R,Lotspeich SC,Amorim G,Shaw PA,Shepherd BE

doi

10.1002/sim.8799

subject

Has Abstract

pub_date

2021-02-10 00:00:00

pages

725-738

issue

3

eissn

0277-6715

issn

1097-0258

journal_volume

40

pub_type

杂志文章
  • Analytical, practical and regulatory issues in prevention studies.

    abstract::Prevention studies, as distinguished from studies investigating treatments for established disease, present some distinct challenges. Perhaps the most extensive experience with preventive agents is in the area of infectious diseases; vaccines have been extremely effective in preventing many such diseases. Vaccines hav...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1717

    authors: Ellenberg SS

    更新日期:2004-01-30 00:00:00

  • Nowcasting influenza epidemics using non-homogeneous hidden Markov models.

    abstract::Timeliness of a public health surveillance system is one of its most important characteristics. The process of predicting the present situation using available incomplete information from surveillance systems has received the term nowcasting and has high public health interest. Generally in Europe, general practitione...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5670

    authors: Nunes B,Natário I,Lucília Carvalho M

    更新日期:2013-07-10 00:00:00

  • The use of an extended baseline period in the evaluation of treatment in a longitudinal Duchenne muscular dystrophy trial.

    abstract::A trial of Duchenne muscular dystrophy involved tracking boys of all ages through a one-year baseline period, followed by a one-year trial of leucine versus placebo treatment. In this paper we develop a model for a total-muscle-strength score that uses the data of the extended baseline period in the evaluation of the ...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/sim.4780050304

    authors: Madsen KS,Miller JP,Province MA

    更新日期:1986-05-01 00:00:00

  • Nonparametric sequential evaluation of diagnostic biomarkers.

    abstract::We consider evaluation and comparison of the diagnostic accuracy of biomarkers with continuous test outcomes, possibly correlated due to repeated measurements. We develop nonparametric group sequential testing procedures to evaluate and compare the area of biomarkers under their receiver operating characteristic curve...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3203

    authors: Liu A,Wu C,Schisterman EF

    更新日期:2008-05-10 00:00:00

  • The power of focused tests to detect disease clustering.

    abstract::Statistical tests have been proposed for determining whether incident cases of adverse health effects are 'clustered' together. Several procedures, termed 'focused', specifically analyse disease surveillance data around pre-specified putative sources of environmental hazard. Little has been done to compare the perform...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780142103

    authors: Waller LA,Lawson AB

    更新日期:1995-11-15 00:00:00

  • Ordinal invariant measures for individual and group changes in ordered categorical data.

    abstract::Subjective judgements of complex variables are commonly recorded as ordered categorical data. The rank-invariant properties of such data are well known, and there are various statistical approaches to the analysis and modelling of ordinal data. This paper focuses on the non-additive property of ordered categorical dat...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19981230)17:24<2923::aid-s

    authors: Svensson E

    更新日期:1998-12-30 00:00:00

  • Designs for phase I trials in ordered groups.

    abstract::We propose a new design for dose finding for cytotoxic agents in two ordered groups of patients. By ordered groups, we mean that prior to the study there is clinical information that would indicate that for a given dose one group would be more susceptible to toxicities than patients in the other group. The designs are...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7133

    authors: Conaway MR,Wages NA

    更新日期:2017-01-30 00:00:00

  • Monitoring potential adverse event rate differences using data from blinded trials: the canary in the coal mine.

    abstract::The development of drugs and biologicals whose mechanisms of action may extend beyond their target indications has led to a need to identify unexpected potential toxicities promptly even while blinded clinical trials are under way. One component of recently issued FDA rules regarding safety reporting requirements rais...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7129

    authors: Gould AL,Wang WB

    更新日期:2017-01-15 00:00:00

  • Comparison of operational characteristics for binary tests with clustered data.

    abstract::Although statistical methodology is well-developed for comparing diagnostic tests in terms of their sensitivity and specificity, comparative inference about predictive values is not. In this paper, we consider the analysis of studies comparing operating characteristics of two diagnostic tests that are measured on all ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6485

    authors: Kwak M,Um SW,Jung SH

    更新日期:2015-07-10 00:00:00

  • A practical approach for the assessment of bioequivalence under selected higher-order cross-over designs.

    abstract::The two-period cross-over design with two sequences of drug administration is a standard experimental design when bioequivalence of one test formulation is to be assessed in comparison with a reference formulation. Previously, an approach based on Fieller's confidence interval has been presented for the assessment of ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19971015)16:19<2229::aid-s

    authors: Vuorinen J

    更新日期:1997-10-15 00:00:00

  • Disease clusters, exact distributions of maxima, and P-values.

    abstract::This paper presents combinatorial (exact) methods that are useful in the analysis of disease cluster data obtained from small environments, such as buildings and neighbourhoods. Maxwell-Boltzmann and Fermi-Dirac occupancy models are compared in terms of appropriateness of representation of disease incidence patterns (...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780121906

    authors: Grimson RC

    更新日期:1993-10-01 00:00:00

  • Determining the size of a cross-sectional sample to estimate the age-specific incidence of an irreversible disease.

    abstract::The design of a cross-sectional survey to estimate the age-specific incidence of an irreversible disease is considered, where the incidence rate is not changing over time and the risk of mortality is not affected by the onset of disease. The sample is assumed to give information on the current age and disease status o...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780132208

    authors: Marschner IC

    更新日期:1994-11-30 00:00:00

  • Analysis of the ratio of marginal probabilities in a matched-pair setting.

    abstract::Statistical methods for testing and interval estimation of the ratio of marginal probabilities in the matched-pair setting are considered in this paper. We are especially interested in the situation where the null value is not one, as in one-sided equivalence trials. We propose a Fieller-type statistic based on constr...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1017

    authors: Nam JM,Blackwelder WC

    更新日期:2002-03-15 00:00:00

  • Violations of the independent increment assumption when using generalized estimating equation in longitudinal group sequential trials.

    abstract::In phase 3 clinical trials, ethical and financial concerns motivate sequential analyses in which the data are analyzed prior to completion of the entire planned study. Existing group sequential software accounts for the effects of these interim analyses on the sampling density by assuming that the contribution of subs...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6306

    authors: Shoben AB,Emerson SS

    更新日期:2014-12-20 00:00:00

  • Statistical inferences of extended concentration indices for directly standardized rates.

    abstract::The relative concentration index (RCI) and the absolute concentration index (ACI) have been widely used for monitoring health disparities with ranked health determinants. The RCI has been extended to allow value judgments about inequality aversion by Pereira in 1998 and by Wagstaff in 2002. Previous studies of the ext...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7952

    authors: Yu M,Liu B,Li Y,Zou ZJ,Breen N

    更新日期:2019-01-15 00:00:00

  • Random-effects meta-analysis of the clinical utility of tests and prediction models.

    abstract::The use of data from multiple studies or centers for the validation of a clinical test or a multivariable prediction model allows researchers to investigate the test's/model's performance in multiple settings and populations. Recently, meta-analytic techniques have been proposed to summarize discrimination and calibra...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7653

    authors: Wynants L,Riley RD,Timmerman D,Van Calster B

    更新日期:2018-05-30 00:00:00

  • A Bayesian analysis of mixture structural equation models with non-ignorable missing responses and covariates.

    abstract::In behavioral, biomedical, and social-psychological sciences, it is common to encounter latent variables and heterogeneous data. Mixture structural equation models (SEMs) are very useful methods to analyze these kinds of data. Moreover, the presence of missing data, including both missing responses and missing covaria...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3915

    authors: Cai JH,Song XY,Hser YI

    更新日期:2010-08-15 00:00:00

  • Discovering early diabetic neuropathy from epidermal nerve fiber patterns.

    abstract::Epidermal nerve fibre (ENF) density and morphology are used to study small fibre involvement in diabetic, HIV, chemotherapy induced and other neuropathies. ENF density and summed length of ENFs per epidermal surface area are reduced, and ENFs may appear more clustered within the epidermis in subjects with small fibre ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7009

    authors: Andersson C,Guttorp P,Särkkä A

    更新日期:2016-10-30 00:00:00

  • Constructing time-specific reference ranges.

    abstract::Reference ranges which take time (such as age) into account are often required in medicine, but simple, systematic and efficient statistical methods for constructing them are lacking. A method is described which is based on low order polynomial curves (linear, quadratic or occasionally cubic), together with guidelines...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,多中心研究

    doi:10.1002/sim.4780100502

    authors: Royston P

    更新日期:1991-05-01 00:00:00

  • Flexible multistate models for interval-censored data: Specification, estimation, and an application to ageing research.

    abstract::Continuous-time multistate survival models can be used to describe health-related processes over time. In the presence of interval-censored times for transitions between the living states, the likelihood is constructed using transition probabilities. Models can be specified using parametric or semiparametric shapes fo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7604

    authors: Machado RJM,van den Hout A

    更新日期:2018-05-10 00:00:00

  • Sample sizes for constructing confidence intervals and testing hypotheses.

    abstract::Although estimation and confidence intervals have become popular alternatives to hypothesis testing and p-values, statisticians usually determine sample sizes for randomized clinical trials by controlling the power of a statistical test at an appropriate alternative, even those statisticians who recommend the use of c...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780080705

    authors: Bristol DR

    更新日期:1989-07-01 00:00:00

  • Seasonal and other short-term influences on United States AIDS incidence.

    abstract::This paper models monthly AIDS diagnosis counts in terms of smooth secular trend, calendar month effects, and the number of workdays per month. A parameterization of month effects allows separation of true seasonal effects from a linear trend over the calendar year and an arbitrary June effect. There is strong evidenc...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780131905

    authors: Bacchetti P

    更新日期:1994-10-15 00:00:00

  • A semi-parametric regression analysis of CD4 cell counts.

    abstract::This paper considers the regression analysis of CD4 cell counts, a commonly used indicator and prognostic factor of AIDS progression. For this purpose, a number of methods have been proposed and most of them are based on random effects models. We present an alternative that is based on a mean function regression model...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.945

    authors: Sun J

    更新日期:2001-11-15 00:00:00

  • Modelling the 1985 influenza epidemic in France.

    abstract::The Rvachev-Baroyan-Longini model is a space-time predictive model of the spread of influenza epidemics. It has been applied to 128 cities of the USSR, and more recently, to forecasting the spread of the pandemic of 1968-1969 throughout 52 large cities. It is a deterministic, mass-action, space and time continuous mod...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780071107

    authors: Flahault A,Letrait S,Blin P,Hazout S,Ménarés J,Valleron AJ

    更新日期:1988-11-01 00:00:00

  • Bayesian random effects meta-analysis of trials with binary outcomes: methods for the absolute risk difference and relative risk scales.

    abstract::In a recent Statistics in Medicine paper, Warn, Thompson and Spiegelhalter (WTS) made a comparison between the Bayesian approach to the meta-analysis of binary outcomes and a popular Classical approach that uses summary (two-stage) techniques. They included approximate summary (two-stage) Bayesian techniques in their ...

    journal_title:Statistics in medicine

    pub_type: 评论,信件

    doi:10.1002/sim.2115

    authors: O'Rourke K,Altman DG

    更新日期:2005-09-15 00:00:00

  • Measurement error in continuous endpoints in randomised trials: Problems and solutions.

    abstract::In randomised trials, continuous endpoints are often measured with some degree of error. This study explores the impact of ignoring measurement error and proposes methods to improve statistical inference in the presence of measurement error. Three main types of measurement error in continuous endpoints are considered:...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8359

    authors: Nab L,Groenwold RHH,Welsing PMJ,van Smeden M

    更新日期:2019-11-30 00:00:00

  • Likelihood-based analysis of outcome-dependent sampling designs with longitudinal data.

    abstract::The use of outcome-dependent sampling with longitudinal data analysis has previously been shown to improve efficiency in the estimation of regression parameters. The motivating scenario is when outcome data exist for all cohort members but key exposure variables will be gathered only on a subset. Inference with outcom...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7633

    authors: Zelnick LR,Schildcrout JS,Heagerty PJ

    更新日期:2018-06-15 00:00:00

  • A joint modeling and estimation method for multivariate longitudinal data with mixed types of responses to analyze physical activity data generated by accelerometers.

    abstract::A mixed effect model is proposed to jointly analyze multivariate longitudinal data with continuous, proportion, count, and binary responses. The association of the variables is modeled through the correlation of random effects. We use a quasi-likelihood type approximation for nonlinear variables and transform the prop...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7401

    authors: Li H,Zhang Y,Carroll RJ,Keadle SK,Sampson JN,Matthews CE

    更新日期:2017-11-10 00:00:00

  • Identifying representative trees from ensembles.

    abstract::Tree-based methods have become popular for analyzing complex data structures where the primary goal is risk stratification of patients. Ensemble techniques improve the accuracy in prediction and address the instability in a single tree by growing an ensemble of trees and aggregating. However, in the process, individua...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4492

    authors: Banerjee M,Ding Y,Noone AM

    更新日期:2012-07-10 00:00:00

  • Multi-state models for colon cancer recurrence and death with a cured fraction.

    abstract::In cancer clinical trials, patients often experience a recurrence of disease prior to the outcome of interest, overall survival. Additionally, for many cancers, there is a cured fraction of the population who will never experience a recurrence. There is often interest in how different covariates affect the probability...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6056

    authors: Conlon AS,Taylor JM,Sargent DJ

    更新日期:2014-05-10 00:00:00