Abstract:
:In many practical applications, count data often exhibit greater or less variability than allowed by the equality of mean and variance, referred to as overdispersion/underdispersion, and there are several reasons that may lead to the overdispersion/underdispersion such as zero inflation and mixture. Moreover, if the count data are distributed as a generalized Poisson or a negative binomial distribution that accommodates extra variation not explained by a simple Poisson or a binomial model, then the dispersion occurs too. In this paper, we deal with a class of two-component zero-inflated generalized Poisson mixture regression models to fit such data and propose a local influence measure procedure for model comparison and statistical diagnostics. At first, we formally develop a general model framework that unifies zero inflation, mixture as well as overdispersion/underdispersion simultaneously, and then we mainly investigate two types of perturbation schemes, the global and individual perturbation schemes, for perturbing various model assumptions and detecting influential observations. Also, we obtain the corresponding local influence measures. Our method is novel for count data analysis and can be used to explore these essential issues such as zero inflation, mixture, and dispersion related to zero-inflated generalized Poisson mixture models. On the basis of the results of model comparison, we could further conduct the sensitivity analysis of perturbation as well as hypothesis test with more accuracy. Finally, we employ here a simulation study and a real example to illustrate the proposed local influence measures.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Chen XD,Fu YZ,Wang XRdoi
10.1002/sim.5560subject
Has Abstractpub_date
2013-04-15 00:00:00pages
1294-312issue
8eissn
0277-6715issn
1097-0258journal_volume
32pub_type
杂志文章abstract::Correlated data are obtained in longitudinal epidemiological studies, where repeated measurements are taken on individuals or groups over time. Such longitudinal data are ideally analyzed using multilevel modeling approaches, which appropriately account for the correlations in repeated responses in the same individual...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5880
更新日期:2013-12-10 00:00:00
abstract::Data monitoring committees (DMCs) have become an increasingly common component of randomized clinical trials in recent years. As experience has accumulated, and more individuals and organizations have become involved in such activities, a variety of approaches to the operation of such committees has inevitably arisen....
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.730
更新日期:2001-09-15 00:00:00
abstract::The concept of broad sense agreement (BSA) has recently been proposed for studying the relationship between a continuous measurement and an ordinal measurement. They developed a nonparametric procedure for estimating the BSA index, which is only applicable to completely observed data. In this work, we consider the pro...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8523
更新日期:2020-06-30 00:00:00
abstract::Diseases can be interconnected. In the recent years, there has been a surge of multidisease studies. Among them, HDN (human disease network) analysis takes a system perspective, examines the interconnections among diseases along with their individual properties, and has demonstrated great potential. Most of the existi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8472
更新日期:2020-04-30 00:00:00
abstract::Two correction methods are considered for multiple logistic regression models with some covariates measured with error. Both methods are based on approximating the complicated regression model between the response and the observed covariates with simpler models. The first model is the logistic approximation proposed b...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780131105
更新日期:1994-06-15 00:00:00
abstract::We have developed a method to longitudinally classify subjects into two or more prognostic groups using longitudinally observed values of markers related to the prognosis. We assume the availability of a training data set where the subjects' allocation into the prognostic group is known. The proposed method proceeds i...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3849
更新日期:2010-12-30 00:00:00
abstract::In longitudinal studies, missing observations occur commonly. It has been well known that biased results could be produced if missingness is not properly handled in the analysis. Authors have developed many methods with the focus on either incomplete response or missing covariate observations, but rarely on both. The ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5536
更新日期:2013-02-28 00:00:00
abstract::Some clinical trials aim to demonstrate therapeutic equivalence on multiple primary endpoints. For example, therapeutic equivalence studies of agents for the treatment of osteoarthritis use several primary endpoints including investigator's global assessment of disease activity, patient's global assessment of response...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.985
更新日期:2001-11-15 00:00:00
abstract::Many extensions of survival models based on the Cox proportional hazards approach have been proposed to handle clustered or multiple event data. Of particular note are five Cox-based models for recurrent event data: Andersen and Gill (AG); Wei, Lin and Weissfeld (WLW); Prentice, Williams and Peterson, total time (PWP-...
journal_title:Statistics in medicine
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/(sici)1097-0258(20000115)19:1<13::aid-sim2
更新日期:2000-01-15 00:00:00
abstract::The setting of a quarantine time for an emerging infectious disease will depend on current knowledge concerning incubation times. Methods for the analysis of information on incubation times are investigated with a particular focus on inference regarding a possible maximum incubation time, after which an exposed indivi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2206
更新日期:2005-11-30 00:00:00
abstract::Estimates of transmission probabilities for sexually transmitted diseases historically come from studies of uninfected individuals exposed to those with a high disease prevalence (for example, prostitutes). However, changes in sexual behaviour, much of which relates to concerns about AIDS, has made identification of p...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780110502
更新日期:1992-03-01 00:00:00
abstract::The analysis of multivariate time-to-event (TTE) data can become complicated due to the presence of clustering, leading to dependence between multiple event times. For a long time, (conditional) frailty models and (marginal) copula models have been used to analyze clustered TTE data. In this article, we propose a gene...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8506
更新日期:2020-05-30 00:00:00
abstract::Mediation analyses can help us to understand the biological mechanism in which an exposure or treatment affects an outcome. Single mediator analyses have been used in various applications, but may not be appropriate for analyzing intricate mechanisms involving multiple mediators that affect each other. Thus, in this a...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8485
更新日期:2020-05-15 00:00:00
abstract::Recent studies have indicated variation in the infectivity beta of HIV among heterosexual couples. We represent this heterogeneity by modelling beta as a random variable. Using data on the number of contacts and seroconversion of couples, we fit the model by maximum-likelihood estimation with a beta distribution and a...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780080110
更新日期:1989-01-01 00:00:00
abstract::Binary classification rules based on covariates typically depend on simple loss functions such as zero-one misclassification. Some cases may require more complex loss functions. For example, individual-level monitoring of HIV-infected individuals on antiretroviral therapy requires periodic assessment of treatment fail...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8082
更新日期:2019-05-20 00:00:00
abstract::New combination regimens evaluated in phase II cancer clinical trials often show promising results compared to the standard therapy for a disease system. Selection of patients with a better prognosis can be a prominent factor for this optimism. For most disease systems, prognostic variables that are related to the out...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.706
更新日期:2001-03-30 00:00:00
abstract::Evaluating treatment effects within different subsets of patients is a common practice in the analysis of individual randomized clinical trials. Such analyses are limited, however, by the number of patients available. Overviews, by providing evidence based on large numbers of patients, can be useful for overcoming the...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780060331
更新日期:1987-04-01 00:00:00
abstract::Laboratory determinations on children aged 6 to 10 years obtained over a 5-year period are analysed by a method described in detail for differentiating between children from exposed and control areas of Seveso, Italy. In the analysis, stratification is employed to distinguish the separate days of laboratory measuremen...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780020221
更新日期:1983-04-01 00:00:00
abstract::This paper presents combinatorial (exact) methods that are useful in the analysis of disease cluster data obtained from small environments, such as buildings and neighbourhoods. Maxwell-Boltzmann and Fermi-Dirac occupancy models are compared in terms of appropriateness of representation of disease incidence patterns (...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780121906
更新日期:1993-10-01 00:00:00
abstract::Flexible survival models are in need when modelling data from long term follow-up studies. In many cases, the assumption of proportionality imposed by a Cox model will not be valid. Instead, a model that can identify time varying effects of fixed covariates can be used. Although there are several approaches that deal ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7088
更新日期:2016-12-10 00:00:00
abstract::With the increasing availability of detailed clinical information, there is optimism that treatment choices can be selectively directed to those individuals most likely to benefit. While standard clinical trials can establish whether a treatment appears to be effective on average, subsequent work is needed to determin...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6138
更新日期:2014-07-30 00:00:00
abstract::We review two recent trends: the emergence of evidence-based medicine and the growing use of Bayesian statistics in medical applications. Evidence-based medicine requires an integrated assessment of the available evidence, and associated uncertainty, but there is also an emphasis on decision-making, for individual pat...
journal_title:Statistics in medicine
pub_type: 杂志文章,评审
doi:10.1002/1097-0258(20001215)19:23<3291::aid-sim627>
更新日期:2000-12-15 00:00:00
abstract::Population heterogeneity is frequently observed among patients' treatment responses in clinical trials because of various factors such as clinical background, environmental, and genetic factors. Different subpopulations defined by those baseline factors can lead to differences in the benefit or safety profile of a the...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7925
更新日期:2018-12-20 00:00:00
abstract::The problem of testing symmetry about zero has a long and rich history in the statistical literature. We introduce a new test that sequentially discards observations whose absolute value is below increasing thresholds defined by the data. McNemar's statistic is obtained at each threshold and the largest is used as the...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5384
更新日期:2012-11-20 00:00:00
abstract::Clinical research often involves continuous outcome measures, such as blood cholesterol, that are amenable to statistical techniques of analysis based on the mean, such as the t-test or multiple linear regression. Clinical interest, however, frequently focuses on the proportion of subjects who fall below or above a cl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780140303
更新日期:1995-02-15 00:00:00
abstract::Cancer patients, chronic kidney disease patients, and subjects infected with HIV are routinely monitored over time using biomarkers that represent key health status indicators. Furthermore, biomarkers are frequently used to guide initiation of new treatments or to inform changes in intervention strategies. Since key m...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7675
更新日期:2018-08-15 00:00:00
abstract::Meta-analyses of data from epidemiological studies are often based on odds ratios (ORs) or relative risks (RRs) and their 95 per cent confidence intervals (CIs) as reported by the authors. Where possible ORs, RRs and CIs should be checked against the source data. Some simple methods are presented for checking the vali...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990815)18:15<1973::aid-s
更新日期:1999-08-15 00:00:00
abstract::In this paper, we consider fitting a flexible and interpretable additive regression model in a data-rich setting. We wish to avoid pre-specifying the functional form of the conditional association between each covariate and the response, while still retaining interpretability of the fitted functions. A number of recen...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7859
更新日期:2019-02-20 00:00:00
abstract::Current methods for statistical analysis of twin studies focus on continuous and dichotomous data, while only limited methodology exists for analysing multinomial data. As a consequence, investigators are often tempted to collapse multinomial data into two categories simply to facilitate the analysis. We address this ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/1097-0258(20010130)20:2<249::aid-sim641>3.
更新日期:2001-01-30 00:00:00
abstract::We compared eight nominal 95 per cent confidence intervals for the ratio of two Poisson parameters, both assumed small, on their true coverage (the probability that the interval includes the ratio of Poisson parameters) and median width. The commonly used log-linear interval, justified by asymptotic considerations, pr...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3234
更新日期:2008-09-10 00:00:00