Abstract:
:Misclassification introduces errors in categorical variables. This paper presents a review of methods for misclassified categorical data in epidemiology. Different sampling schemes for a 2 x 2 x 2 table and methods of analyses will be discussed first. A misclassification matrix is defined, and the usual misclassification models will be shown to be a subclass of log-linear models. Well-known results on a 2 x 2 table with misclassification and recent results on a 2 x 2 x 2 table are then reviewed. Finally two methods of adjusting for misclassification will be given. The first method assumes a known misclassification matrix, and the second method uses subsampling to estimate the misclassification matrix. The analysis is based on a recursive system of log-linear models: first determine a misclassification model, then select a model for the correctly classified variables. The methods are illustrated by data from traffic safety research on the effectiveness of seatbelt use in reducing injuries.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Chen TTdoi
10.1002/sim.4780080908subject
Has Abstractpub_date
1989-09-01 00:00:00pages
1095-106; discussion 1107-8issue
9eissn
0277-6715issn
1097-0258journal_volume
8pub_type
杂志文章,评审abstract::We propose a new weighted hurdle regression method for modeling count data, with particular interest in modeling cardiovascular events in patients on dialysis. Cardiovascular disease remains one of the leading causes of hospitalization and death in this population. Our aim is to jointly model the relationship/associat...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6232
更新日期:2014-11-10 00:00:00
abstract::The case-control study is a simple and an useful method to characterize the effect of a gene, the effect of an exposure, as well as the interaction between the two. The control-free case-only study is yet an even simpler design, if interest is centered on gene-environment interaction only. It requires the sometimes pl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4028
更新日期:2010-10-30 00:00:00
abstract::In medical and health studies, heterogeneities in clustered count data have been traditionally modeled by positive random effects in Poisson mixed models; however, excessive zeros often occur in clustered medical and health count data. In this paper, we consider a three-level random effects zero-inflated Poisson model...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3619
更新日期:2009-08-15 00:00:00
abstract::The analysis and recognition of disease clustering in space and its representation on a map is an important problem in epidemiology. An approach using mixture models to identify spatial heterogeneity in disease risk and map construction within an empirical Bayes framework is described. Once heterogeneity is detected, ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19960415)15:7/9<919::aid-s
更新日期:1996-04-15 00:00:00
abstract::Methods for multiple informants help to estimate the marginal effect of each multiple source predictor and formally compare the strength of their association with an outcome. We extend multiple informant methods to the case of hierarchical data structures to account for within cluster correlation. We apply the propose...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5967
更新日期:2014-02-20 00:00:00
abstract::Generalized linear models are often assumed to fit propensity scores, which are used to compute inverse probability weighted (IPW) estimators. To derive the asymptotic properties of IPW estimators, the propensity score is supposed to be bounded away from zero. This condition is known in the literature as strict positi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7827
更新日期:2018-10-30 00:00:00
abstract::When estimating the probability of natural conception from observational data on couples with an unfulfilled child wish, the start of assisted reproductive therapy (ART) is a competing event that cannot be assumed to be independent of natural conception. In clinical practice, interest lies in the probability of natura...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6280
更新日期:2014-11-20 00:00:00
abstract::Although there are many models which are used to calculate the health benefits (and thus the cost-effectiveness) of vaccination programmes, they can be divided into two groups: those which assume a constant force of infection, that is a constant per-susceptible rate of infection; and those which assume that the force ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19991215)18:23<3263::aid-s
更新日期:1999-12-15 00:00:00
abstract::In conventional survival analysis there is an underlying assumption that all study subjects are susceptible to the event. In general, this assumption does not adequately hold when investigating the time to an event other than death. Owing to genetic and/or environmental etiology, study subjects may not be susceptible ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5845
更新日期:2013-10-30 00:00:00
abstract::In biomedical studies and clinical trials, repeated measures are often subject to some upper and/or lower limits of detection. Hence, the responses are either left or right censored. A complication arises when more than one series of responses is repeatedly collected on each subject at irregular intervals over a perio...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8017
更新日期:2019-03-15 00:00:00
abstract::In a longitudinal study of workers in seven Norwegian aluminium plants, the time to development of asthmatic symptoms could only be determined to lie in the interval between two consecutive health examinations. In a previous paper we analysed the data by survival techniques for interval censored data. In the present p...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780131708
更新日期:1994-09-15 00:00:00
abstract::This paper examines several methods for deriving standardized morbidity ratios (SMR) and attributable fraction (attributable risk percentage) estimates. We show that some of the proposed methods will, in general, produce biased estimators, although the low variance of certain estimators sometimes compensates for their...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780030206
更新日期:1984-04-01 00:00:00
abstract::The problem of evaluating an averaged functional magnetic resonance imaging (fMRI) response for repeated block design experiments was considered within a semiparametric regression model with autocorrelated residuals. We applied functional data analysis (FDA) techniques that use a least-squares fitting of B-spline expa...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2981
更新日期:2007-09-20 00:00:00
abstract::The general linear mixed model provides a useful approach for analysing a wide variety of data structures which practising statisticians often encounter. Two such data structures which can be problematic to analyse are unbalanced repeated measures data and longitudinal data. Owing to recent advances in methods and sof...
journal_title:Statistics in medicine
pub_type: 杂志文章,评审
doi:10.1002/(sici)1097-0258(19971030)16:20<2349::aid-s
更新日期:1997-10-30 00:00:00
abstract::Existing methods for power analysis for longitudinal study designs are limited in that they do not adequately address random missing data patterns. Although the pattern of missing data can be assessed during data analysis, it is unknown during the design phase of a study. The random nature of the missing data pattern ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2773
更新日期:2007-07-10 00:00:00
abstract::In a medical study we are often interested in graphically displaying the relationship between continuous variables and clinical events indicating disease progression. Often, it is reasonable to make the minimal assumption that the risk of progression is an arbitrary monotone function of the continuous variable. Someti...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1561
更新日期:2003-10-30 00:00:00
abstract::In clinical research, we are often interested in assessing how a biomarker changes with time, and whether it could be used as a surrogate marker when evaluating the efficacy of a new drug. However, when the longitudinal marker is correlated with survival, linear mixed models for longitudinal data may be inappropriate....
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3142
更新日期:2007-12-30 00:00:00
abstract::Although the frequentist paradigm has been the predominant approach to clinical trial design since the 1940s, it has several notable limitations. Advancements in computational algorithms and computer hardware have greatly enhanced the alternative Bayesian paradigm. Compared with its frequentist counterpart, the Bayesi...
journal_title:Statistics in medicine
pub_type: 杂志文章,评审
doi:10.1002/sim.5404
更新日期:2012-11-10 00:00:00
abstract::Longitudinal studies are commonly used to study processes of change. Because data are collected over time, missing data are pervasive in longitudinal studies, and complete ascertainment of all variables is rare. In this paper a new imputation strategy for completing longitudinal data sets is proposed. The proposed met...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.740
更新日期:2001-09-15 00:00:00
abstract::Shared random effects models have been increasingly common in the joint analyses of repeated measures (e.g. CD4 counts, hemoglobin levels) and a correlated failure time such as death. In this paper we study several shared random effects models in the multi-level repeated measures data setting with dependent failure ti...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3392
更新日期:2008-11-29 00:00:00
abstract::The publication of Fisher's correspondence on statistics has shed new light on his views on randomization. Quotations from this correspondence and from other works of Fisher are used to illustrate the role of randomization in clinical trials. It is concluded that Fisher's views not only are coherent but, despite havin...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780130305
更新日期:1994-02-15 00:00:00
abstract::We present some practical extensions and applications of a strategy proposed by Thall, Simon and Estey for designing and monitoring single-arm clinical trials with multiple outcomes. We show by application how the strategy may be applied to construct designs for phase IIA activity trials and phase II equivalence trial...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19980730)17:14<1563::aid-s
更新日期:1998-07-30 00:00:00
abstract::With considerable current interest in longitudinal epidemiologic studies, little is available regarding sample size requirements. This paper considers a method for analysis of longitudinal data, where one compares the mean rates of change for two or more groups, and proposes a statistic for use in determining sample s...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780090414
更新日期:1990-04-01 00:00:00
abstract::Estimation of genetic and environmental contributions to cancers falls in the framework of generalized linear mixed modelling with several random effect components. Computational challenges remain, however, in dealing with binary or survival phenotypes. In this paper, we consider the analysis of melanoma onset in a po...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2266
更新日期:2006-09-30 00:00:00
abstract::Multi-state models are useful for modelling disease progression where the state space of the process is used to represent the discrete disease status of subjects. Often, the disease process is only observed at clinical visits, and the schedule of these visits can depend on the disease status of patients. In such situa...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6582
更新日期:2015-12-20 00:00:00
abstract::This paper describes compliance with the completion of a quality of life questionnaire in the Breast Cancer Prevention Trial, a large multi-centre randomized trial that is studying the efficacy of Tamoxifen in preventing breast cancer. In the first 4875 women enrolled in the control arm of the study, there was a very ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19980315/15)17:5/7<613::ai
更新日期:1998-03-15 00:00:00
abstract::Much has been published on various aspects of data analysis and reporting from clinical trials within the biopharmaceutical environment. This ranges from regulatory guidelines on the format and content of registration dossiers to recommendations on data presentation and the statistical methodologies that are appropria...
journal_title:Statistics in medicine
pub_type: 杂志文章,评审
doi:10.1002/(sici)1097-0258(19980815/30)17:15/16<1829:
更新日期:1998-08-15 00:00:00
abstract::Although prostate cancer and benign prostatic hyperplasia are major health problems in U.S. men, little is known about the early stages of the natural history of prostate disease. A molecular biomarker called prostate specific antigen (PSA), together with a unique longitudinal bank of frozen serum, now allows a histor...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780130520
更新日期:1994-03-15 00:00:00
abstract::The use of outcome-dependent sampling with longitudinal data analysis has previously been shown to improve efficiency in the estimation of regression parameters. The motivating scenario is when outcome data exist for all cohort members but key exposure variables will be gathered only on a subset. Inference with outcom...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7633
更新日期:2018-06-15 00:00:00
abstract::We describe how trends in the vaccination coverage at 19 months of age vary by race/ethnicity; explore the extent to which data required to evaluate a child's up-to-date vaccination status is missing as a result of the scattering of vaccination records among many vaccination providers; evaluate how the prevalence of t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3223
更新日期:2008-09-10 00:00:00