Sensitivity of Fisher's exact test to minor perturbations in 2 x 2 contingency tables.

Abstract:

:The two tailed Fisher's exact P value is extremely sensitive to small perturbations in 2 x 2 contingency tables. An example indicates that a 1 per cent increase in the denominator of one treatment group results in a 32 per cent drop in the exact P value, but a mere 0.1 per cent decrease in the treatment success rate. This is equivalent to the increase in significance obtained by a 20 per cent increase in the sample size of both treatments without changing the observed success rates. This drop results from small changes in the probabilities of unobserved events. A systematic evaluation of 920 pairs of similar contingency tables shows that these fluctuations occur frequently over a wide range of sample sizes and significance levels. Doubling the one tailed exact P value provides a more consistent measure of inferential strength. We discuss various chi-squared continuity corrections.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Dupont WD

doi

10.1002/sim.4780050610

subject

Has Abstract

pub_date

1986-11-01 00:00:00

pages

629-35

issue

6

eissn

0277-6715

issn

1097-0258

journal_volume

5

pub_type

杂志文章
  • An extension of the continual reassessment methods using a preliminary up-and-down design in a dose finding study in cancer patients, in order to investigate a greater range of doses.

    abstract::In a phase I clinical trial in cancer patients, the drug involved had one known main adverse effect, which also occurs spontaneously in cancer patients with a fairly high frequency. Experiments in rats have shown marked effects of the drug on tumour growth in high doses, but also dose-dependent toxicity. Consequently,...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780140909

    authors: Møller S

    更新日期:1995-05-15 00:00:00

  • Recommended tests for association in 2 x 2 tables.

    abstract::The asymptotic Pearson's chi-squared test and Fisher's exact test have long been the most used for testing association in 2x2 tables. Unconditional tests preserve the significance level and generally are more powerful than Fisher's exact test for moderate to small samples, but previously were disadvantaged by being co...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3531

    authors: Lydersen S,Fagerland MW,Laake P

    更新日期:2009-03-30 00:00:00

  • Sample size calculations for comparative studies of medical tests for detecting presence of disease.

    abstract::Technologic advances give rise to new tests for detecting disease in many fields, including cancer and sexually transmitted disease. Before a new disease screening test is approved for public use, its accuracy should be shown to be better than or at least not inferior to an existing test. Standards do not yet exist fo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1058

    authors: Alonzo TA,Pepe MS,Moskowitz CS

    更新日期:2002-03-30 00:00:00

  • Assessing the goodness-of-fit of the Laird and Ware model--an example: the Jimma Infant Survival Differential Longitudinal Study.

    abstract::The Jimma Infant Survival Differential Longitudinal Study is an Ethiopian study, set up to establish risk factors affecting infant survival and to investigate socio-economic, maternal and infant-rearing factors that contribute most to the child's early survival. Here, a subgroup of about 1500 children born in Jimma to...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990415)18:7<835::aid-sim

    authors: Lesaffre E,Asefa M,Verbeke G

    更新日期:1999-04-15 00:00:00

  • Bounding the bias of unmeasured factors with confounding and effect-modifying potentials.

    abstract::Confounding is a major concern in observational studies. To adjust for confounding bias, the potential confounder(s) for a study must first be identified and measured. But this is not always possible. The unmeasured factors may also exhibit effect modification, and this further complicates the situation. In this paper...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4151

    authors: Lee WC

    更新日期:2011-04-30 00:00:00

  • On the use of discrete choice models for causal inference.

    abstract::Methodology for causal inference based on propensity scores has been developed and popularized in the last two decades. However, the majority of the methodology has concentrated on binary treatments. Only recently have these methods been extended to settings with multi-valued treatments. We propose a number of discret...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2095

    authors: Tchernis R,Horvitz-Lennon M,Normand SL

    更新日期:2005-07-30 00:00:00

  • Sampling-based estimation for massive survival data with additive hazards model.

    abstract::For massive survival data, we propose a subsampling algorithm to efficiently approximate the estimates of regression parameters in the additive hazards model. We establish consistency and asymptotic normality of the subsample-based estimator given the full data. The optimal subsampling probabilities are obtained via m...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8783

    authors: Zuo L,Zhang H,Wang H,Liu L

    更新日期:2021-01-30 00:00:00

  • A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data.

    abstract::Predicting an individual's risk of experiencing a future clinical outcome is a statistical task with important consequences for both practicing clinicians and public health experts. Modern observational databases such as electronic health records provide an alternative to the longitudinal cohort studies traditionally ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6526

    authors: Wolfson J,Bandyopadhyay S,Elidrisi M,Vazquez-Benitez G,Vock DM,Musgrove D,Adomavicius G,Johnson PE,O'Connor PJ

    更新日期:2015-09-20 00:00:00

  • Estimation of the population effectiveness of vaccination.

    abstract::This paper presents a simple method for estimation of population vaccination effectiveness, which is the fraction of disease cases prevented by a vaccination programme. The method is based on the susceptible-infectious-recovered (SIR) model for the spread of an epidemic in a heterogeneous population under non-homogene...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19970330)16:6<601::aid-sim

    authors: Haber M

    更新日期:1997-03-30 00:00:00

  • Joint modeling of repeated multivariate cognitive measures and competing risks of dementia and death: a latent process and latent class approach.

    abstract::Joint models initially dedicated to a single longitudinal marker and a single time-to-event need to be extended to account for the rich longitudinal data of cohort studies. Multiple causes of clinical progression are indeed usually observed, and multiple longitudinal markers are collected when the true latent trait of...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6731

    authors: Proust-Lima C,Dartigues JF,Jacqmin-Gadda H

    更新日期:2016-02-10 00:00:00

  • Last observation carry-forward and last observation analysis.

    abstract::Drop-out often occurs in clinical trials with multiple visits and drop-out is often informative in the sense that the population of patients who dropped out is different from the population of patients who completed the study. To handle data with informative drop-out, an intention-to-treat analysis, which evaluates tr...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1519

    authors: Shao J,Zhong B

    更新日期:2003-08-15 00:00:00

  • Partitioned GMM logistic regression models for longitudinal data.

    abstract::Correlation is inherent in longitudinal studies due to the repeated measurements on subjects, as well as due to time-dependent covariates in the study. In the National Longitudinal Study of Adolescent to Adult Health (Add Health), data were repeatedly collected on children in grades 7-12 across four waves. Thus, obser...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8099

    authors: Irimata KM,Broatch J,Wilson JR

    更新日期:2019-05-30 00:00:00

  • An evaluation of phase I clinical trial designs in the continuous dose-response setting.

    abstract::Both traditional phase I designs and the increasingly popular continual reassessment method (CRM) designs select an estimate of maximum tolerable dose (MTD) from among a set of prespecified dose levels. Although CRM designs use an implied dose-response model to select the next dose level, in general it is neither assu...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.903

    authors: Storer BE

    更新日期:2001-08-30 00:00:00

  • Using mark-recapture methodology to estimate the size of a population at risk for sexually transmitted diseases.

    abstract::To study the spread of sexually transmitted diseases (STDs) using social/sexual mixing models, one must have quantitative information about sexual mixing. An unavoidable complication in gathering such information by survey is that members of the surveyed population will almost certainly have sexual contacts outside th...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780111202

    authors: Rubin G,Umbach D,Shyu SF,Castillo-Chavez C

    更新日期:1992-09-15 00:00:00

  • Nonparametric sequential evaluation of diagnostic biomarkers.

    abstract::We consider evaluation and comparison of the diagnostic accuracy of biomarkers with continuous test outcomes, possibly correlated due to repeated measurements. We develop nonparametric group sequential testing procedures to evaluate and compare the area of biomarkers under their receiver operating characteristic curve...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3203

    authors: Liu A,Wu C,Schisterman EF

    更新日期:2008-05-10 00:00:00

  • Modelling frailty in area mortality.

    abstract::This paper investigates the impact on area life tables of the specification of unobserved frailty. Frailty specification may affect both the regression effects of area and individual level covariates, and lead to changes in the value of summary mortality parameters, such as life expectancy. The paper also investigates...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780141703

    authors: Congdon P

    更新日期:1995-09-15 00:00:00

  • Assessment of equivalence on multiple endpoints.

    abstract::Some clinical trials aim to demonstrate therapeutic equivalence on multiple primary endpoints. For example, therapeutic equivalence studies of agents for the treatment of osteoarthritis use several primary endpoints including investigator's global assessment of disease activity, patient's global assessment of response...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.985

    authors: Quan H,Bolognese J,Yuan W

    更新日期:2001-11-15 00:00:00

  • Development and applications of a city-level alcohol availability and alcohol problems database.

    abstract::Data on alcohol availability and problems in all cities in Los Angeles County were collected from several different sources and linked together to form a Local Alcohol Availability Database (LAAD). The two major purposes of the project are to provide a city-level alcohol availability and alcohol-related problems datab...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780140517

    authors: MacKinnon DP,Scribner R,Taft KA

    更新日期:1995-03-15 00:00:00

  • Estimating probit models with self-selected treatments.

    abstract::Outcomes research often requires estimating the impact of a binary treatment on a binary outcome in a non-randomized setting, such as the effect of taking a drug on mortality. The data often come from self-selected samples, leading to a spurious correlation between the treatment and outcome when standard binary depend...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2226

    authors: Bhattacharya J,Goldman D,McCaffrey D

    更新日期:2006-02-15 00:00:00

  • Practical modifications to the time-to-event continual reassessment method for phase I cancer trials with fast patient accrual and late-onset toxicities.

    abstract::The goal of phase I cancer trials is to determine the highest dose of a treatment regimen with an acceptable toxicity rate. Traditional designs for phase I trials, such as the Continual Reassessment Method (CRM) and the 3 + 3 design, require each patient or a cohort of patients to be fully evaluated for the dose-limit...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4255

    authors: Polley MY

    更新日期:2011-07-30 00:00:00

  • A missing composite covariate in survival analysis: a case study of the Chinese Longitudinal Health and Longevity Survey.

    abstract::We estimate a Cox proportional hazards model where one of the covariates measures the level of a subject's cognitive functioning by grading the total score obtained by the subject on the items of a questionnaire. A case study is presented where the sample includes partial respondents, who did not answer some questionn...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3773

    authors: Lagona F,Zhang Z

    更新日期:2010-01-30 00:00:00

  • Semiparametric additive rates model for recurrent events data with intermittent gaps.

    abstract::Statistical methods for analyzing recurrent events have attracted significant attention. The majority of existing works consider situations in which subjects are observed over time periods and events of interest that occurred during the course of follow-up are recorded. In some applications, a subject may leave the st...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8042

    authors: Su PF,Zhong J,Ou HT

    更新日期:2019-04-15 00:00:00

  • Empirical evaluation of statistical models for counts or rates.

    abstract::We consider methods for selecting the joint specification of the mean and variance functions in statistical models for rates or counts. Based on analyses of diagnosis-specific hospital discharge rates in Michigan, we show that a Poisson model with an extra variance component for the systematic variation is superior to...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780100908

    authors: Wolfe RA,Petroni GR,McLaughlin CG,McMahon LF Jr

    更新日期:1991-09-01 00:00:00

  • Interval estimation for rank correlation coefficients based on the probit transformation with extension to measurement error correction of correlated ranked data.

    abstract::The Spearman (rho(s)) and Kendall (tau) rank correlation coefficient are routinely used as measures of association between non-normally distributed random variables. However, confidence limits for rho(s) are only available under the assumption of bivariate normality and for tau under the assumption of asymptotic norma...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2547

    authors: Rosner B,Glynn RJ

    更新日期:2007-02-10 00:00:00

  • Proportional hazards models and age-period-cohort analysis of cancer rates.

    abstract::Age-period-cohort (APC) analysis is widely used in cancer epidemiology to model trends in cancer rates. We develop methods for comparative APC analysis of two independent cause-specific hazard rates assuming that an APC model holds for each one. We construct linear hypothesis tests to determine whether the two hazards...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3865

    authors: Rosenberg PS,Anderson WF

    更新日期:2010-05-20 00:00:00

  • Chronic disease prevention: public health potential and research needs.

    abstract::This paper, arising out of an event to honour the statistical and scientific contributions of Professor Peter Armitage, is concerned with research strategies and needs for chronic disease prevention. A few highlights from recent intervention trials for the prevention of cancer, cardiovascular disease, fractures and di...

    journal_title:Statistics in medicine

    pub_type:

    doi:10.1002/sim.2045

    authors: Prentice RL

    更新日期:2004-11-30 00:00:00

  • Properties of R(2) statistics for logistic regression.

    abstract::Various R(2) statistics have been proposed for logistic regression to quantify the extent to which the binary response can be predicted by a given logistic regression model and covariates. We study the asymptotic properties of three popular variance-based R(2) statistics. We find that two variance-based R(2) statistic...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2300

    authors: Hu B,Palta M,Shao J

    更新日期:2006-04-30 00:00:00

  • Multistate models and lifetime risk estimation: Application to Alzheimer's disease.

    abstract::The lifetime risk of a clinical condition is the probability of onset of the condition during one's lifespan. Recent advances in Alzheimer's disease (AD) research have identified screening tests for biomarkers that can identify persons who are in the earliest stages of the AD process but who do not yet have any clinic...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8056

    authors: Brookmeyer R,Abdalla N

    更新日期:2019-04-30 00:00:00

  • Tests for individual and population bioequivalence based on generalized p-values.

    abstract::The U.S. Food and Drug Administration (FDA) has proposed new regulations that address the 'prescribability' and 'switchability' of new formulations of already-approved drugs. These new criteria are known, respectively, as population and individual bioequivalence. Two methods have been proposed in the bioequivalence li...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1346

    authors: McNally RJ,Iyer H,Mathew T

    更新日期:2003-01-15 00:00:00

  • Flexible longitudinal linear mixed models for multiple censored responses data.

    abstract::In biomedical studies and clinical trials, repeated measures are often subject to some upper and/or lower limits of detection. Hence, the responses are either left or right censored. A complication arises when more than one series of responses is repeatedly collected on each subject at irregular intervals over a perio...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8017

    authors: Lachos VH,A Matos L,Castro LM,Chen MH

    更新日期:2019-03-15 00:00:00