The Wilcoxon-Mann-Whitney test under scrutiny.

Abstract:

:The Wilcoxon-Mann-Whitney (WMW) test is often used to compare the means or medians of two independent, possibly nonnormal distributions. For this problem, the true significance level of the large sample approximate version of the WMW test is known to be sensitive to differences in the shapes of the distributions. Based on a wide ranging simulation study, our paper shows that the problem of lack of robustness of this test is more serious than is thought to be the case. In particular, small differences in variances and moderate degrees of skewness can produce large deviations from the nominal type I error rate. This is further exacerbated when the two distributions have different degrees of skewness. Other rank-based methods like the Fligner-Policello (FP) test and the Brunner-Munzel (BM) test perform similarly, although the BM test is generally better. By considering the WMW test as a two-sample T test on ranks, we explain the results by noting some undesirable properties of the rank transformation. In practice, the ranked samples should be examined and found to sufficiently satisfy reasonable symmetry and variance homogeneity before the test results are interpreted.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Fagerland MW,Sandvik L

doi

10.1002/sim.3561

subject

Has Abstract

pub_date

2009-05-01 00:00:00

pages

1487-97

issue

10

eissn

0277-6715

issn

1097-0258

journal_volume

28

pub_type

杂志文章
  • Sample size calculations for comparative studies of medical tests for detecting presence of disease.

    abstract::Technologic advances give rise to new tests for detecting disease in many fields, including cancer and sexually transmitted disease. Before a new disease screening test is approved for public use, its accuracy should be shown to be better than or at least not inferior to an existing test. Standards do not yet exist fo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1058

    authors: Alonzo TA,Pepe MS,Moskowitz CS

    更新日期:2002-03-30 00:00:00

  • The relative importance of prognostic factors in studies of survival.

    abstract::The relative importance of prognostic factors in regression can be measured either by standardized regression coefficients or by percentages of explained variation in a dependent variable. One advantage of using explained variation is the direct comparability of qualitative prognostic factors with others, or of groups...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780122413

    authors: Schemper M

    更新日期:1993-12-30 00:00:00

  • A semi-parametric Bayesian approach to average bioequivalence.

    abstract::Bioequivalence assessment is an issue of great interest. Development of statistical methods for assessing bioequivalence is an important area of research for statisticians. Bioequivalence is usually determined based on the normal distribution. We relax this assumption and develop a semi-parametric mixed model for bioe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2620

    authors: Ghosh P,Rosner GL

    更新日期:2007-03-15 00:00:00

  • Estimating age-related trends in cross-sectional studies using S-distributions.

    abstract::Growth trends in children are often based on cross-sectional studies, in which a sample of the population is investigated at one given point in time. Estimating age-related percentiles in such studies involves fitting data distributions, each of which is specific for one age group, and a subsequent smoothing of the pe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(20000315)19:5<697::aid-sim

    authors: Sorribas A,March J,Voit EO

    更新日期:2000-03-15 00:00:00

  • Nonparametric collective spectral density estimation with an application to clustering the brain signals.

    abstract::In this paper, we develop a method for the simultaneous estimation of spectral density functions (SDFs) for a collection of stationary time series that share some common features. Due to the similarities among the SDFs, the log-SDF can be represented using a common set of basis functions. The basis shared by the colle...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7972

    authors: Maadooliat M,Sun Y,Chen T

    更新日期:2018-12-30 00:00:00

  • Confidence intervals for an exposure adjusted incidence rate difference with applications to clinical trials.

    abstract::To summarize safety data such as clinical adverse experiences in clinical trials with a moderate to long-term follow-up, we may use a measurement which accounts for the potential differences in the follow-up duration between treatment groups. The incidence rate, which uses the total person-time follow-up in a treatmen...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2335

    authors: Liu GF,Wang J,Liu K,Snavely DB

    更新日期:2006-04-30 00:00:00

  • Comparing and combining data across multiple sources via integration of paired-sample data to correct for measurement error.

    abstract::In biomedical research such as the development of vaccines for infectious diseases or cancer, study outcomes measured by an assay or device are often collected from multiple sources or laboratories. Measurement error that may vary between laboratories needs to be adjusted for when combining samples across data sources...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5446

    authors: Huang Y,Huang Y,Moodie Z,Li S,Self S

    更新日期:2012-12-10 00:00:00

  • Tests for individual and population bioequivalence based on generalized p-values.

    abstract::The U.S. Food and Drug Administration (FDA) has proposed new regulations that address the 'prescribability' and 'switchability' of new formulations of already-approved drugs. These new criteria are known, respectively, as population and individual bioequivalence. Two methods have been proposed in the bioequivalence li...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1346

    authors: McNally RJ,Iyer H,Mathew T

    更新日期:2003-01-15 00:00:00

  • Sample size calculation for clinical trials in which entry criteria and outcomes are counts of events. ACIP Investigators. Asymptomatic Cardiac Ischemia Pilot.

    abstract::In many chronic diseases, therapy aims to prevent or reduce the frequency of episodes of a disease manifestation, for example cardiac ischaemic episodes or epileptic seizures. Entry criteria for clinical trials typically include a minimum number of episodes within a baseline period, and regression to the mean should b...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780130806

    authors: McMahon RP,Proschan M,Geller NL,Stone PH,Sopko G

    更新日期:1994-04-30 00:00:00

  • Boosting for detection of gene-environment interactions.

    abstract::In genetic association studies, it is typically thought that genetic variants and environmental variables jointly will explain more of the inheritance of a phenotype than either of these two components separately. Traditional methods to identify gene-environment interactions typically consider only one measured enviro...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5444

    authors: Pashova H,LeBlanc M,Kooperberg C

    更新日期:2013-01-30 00:00:00

  • Efficient adaptive designs with mid-course sample size adjustment in clinical trials.

    abstract::Adaptive designs have been proposed for clinical trials in which the nuisance parameters or alternative of interest are unknown or likely to be misspecified before the trial. Although most previous works on adaptive designs and mid-course sample size re-estimation have focused on two-stage or group-sequential designs ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3201

    authors: Bartroff J,Lai TL

    更新日期:2008-05-10 00:00:00

  • An easy-to-implement approach for analyzing case-control and case-only studies assuming gene-environment independence and Hardy-Weinberg equilibrium.

    abstract::The case-control study is a simple and an useful method to characterize the effect of a gene, the effect of an exposure, as well as the interaction between the two. The control-free case-only study is yet an even simpler design, if interest is centered on gene-environment interaction only. It requires the sometimes pl...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4028

    authors: Lee WC,Wang LY,Cheng KF

    更新日期:2010-10-30 00:00:00

  • What do we mean by validating a prognostic model?

    abstract::Prognostic models are used in medicine for investigating patient outcome in relation to patient and disease characteristics. Such models do not always work well in practice, so it is widely recommended that they need to be validated. The idea of validating a prognostic model is generally taken to mean establishing tha...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(20000229)19:4<453::aid-sim

    authors: Altman DG,Royston P

    更新日期:2000-02-29 00:00:00

  • STRengthening analytical thinking for observational studies: the STRATOS initiative.

    abstract::The validity and practical utility of observational medical research depends critically on good study design, excellent data quality, appropriate statistical methods and accurate interpretation of results. Statistical methodology has seen substantial development in recent times. Unfortunately, many of these methodolog...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6265

    authors: Sauerbrei W,Abrahamowicz M,Altman DG,le Cessie S,Carpenter J,STRATOS initiative.

    更新日期:2014-12-30 00:00:00

  • Assessing the robustness of sisVIVE in a Mendelian randomization study to estimate the causal effect of body mass index on income using multiple SNPs from understanding society.

    abstract::The "some invalid, some valid instrumental variable estimator" (sisVIVE) is a lasso-based method for instrumental variables (IVs) regression of outcome on an exposure. In principle, sisVIVE is robust to some of the IVs in the analysis being invalid, in the sense of being related to the outcome variable through pathway...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8066

    authors: Bao Y,Clarke PS,Smart M,Kumari M

    更新日期:2019-04-30 00:00:00

  • Analysis of additive risk model with high-dimensional covariates using partial least squares.

    abstract::In this paper, we construct a partial additive regression (PAR) model to predict the survival times of cancer patients based on microarray gene expression data with right censoring. The area under time-dependent receiver operating characteristic curve is used as a model evaluation criterion. We conduct a simulation st...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3412

    authors: Zhao Y,Zhou Y,Zhao M

    更新日期:2009-01-30 00:00:00

  • Random-effects meta-analysis of the clinical utility of tests and prediction models.

    abstract::The use of data from multiple studies or centers for the validation of a clinical test or a multivariable prediction model allows researchers to investigate the test's/model's performance in multiple settings and populations. Recently, meta-analytic techniques have been proposed to summarize discrimination and calibra...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7653

    authors: Wynants L,Riley RD,Timmerman D,Van Calster B

    更新日期:2018-05-30 00:00:00

  • Model averaging for robust assessment of QT prolongation by concentration-response analysis.

    abstract::Assessing the QT prolongation potential of a drug is typically done based on pivotal safety studies called thorough QT studies. Model-based estimation of the drug-induced QT prolongation at the estimated mean maximum drug concentration could increase efficiency over the currently used intersection-union test. However,...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7395

    authors: Dosne AG,Bergstrand M,Karlsson MO,Renard D,Heimann G

    更新日期:2017-10-30 00:00:00

  • Quantifying the impact of between-study heterogeneity in multivariate meta-analyses.

    abstract::Measures that quantify the impact of heterogeneity in univariate meta-analysis, including the very popular I(2) statistic, are now well established. Multivariate meta-analysis, where studies provide multiple outcomes that are pooled in a single analysis, is also becoming more commonly used. The question of how to quan...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5453

    authors: Jackson D,White IR,Riley RD

    更新日期:2012-12-20 00:00:00

  • Testing the equality of two Poisson means using the rate ratio.

    abstract::In this article, we investigate procedures for comparing two independent Poisson variates that are observed over unequal sampling frames (i.e. time intervals, populations, areas or any combination thereof). We consider two statistics (with and without the logarithmic transformation) for testing the equality of two Poi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1949

    authors: Ng HK,Tang ML

    更新日期:2005-03-30 00:00:00

  • Drug treatment of mild hypertension to reduce the risk of CHD: is it worth-while?

    abstract::Although hypertension is regarded as a causal factor for coronary heart disease (CHD) a reduction in the risk of CHD as a result of lowering blood pressure in mild hypertension could not be demonstrated. This conclusion is based on an overview analysis of all published randomized trials in mild hypertension, including...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780071104

    authors: Holme I

    更新日期:1988-11-01 00:00:00

  • A robust goodness-of-fit test statistic with application to ordinal regression models.

    abstract::We propose a goodness-of-fit test statistic for linear regression with heterogeneous variance, which is asymptotically chi-square if the given model is correct. The test statistic is computed as a quadratic form of observed minus predicted responses. We apply the method to a linear regression for an ordinal categorica...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780130205

    authors: Lipsitz SR,Buoncristiani JF

    更新日期:1994-01-30 00:00:00

  • Internal pilot studies I: type I error rate of the naive t-test.

    abstract::When sample size is recalculated using unblinded interim data, use of the usual t-test at the end of a study may lead to an elevated type I error rate. This paper describes a numerical quadrature investigation to calculate the true probability of rejection as a function of the time of the recalculation, the magnitude ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19991230)18:24<3481::aid-s

    authors: Wittes J,Schabenberger O,Zucker D,Brittain E,Proschan M

    更新日期:1999-12-30 00:00:00

  • Identifying representative trees from ensembles.

    abstract::Tree-based methods have become popular for analyzing complex data structures where the primary goal is risk stratification of patients. Ensemble techniques improve the accuracy in prediction and address the instability in a single tree by growing an ensemble of trees and aggregating. However, in the process, individua...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4492

    authors: Banerjee M,Ding Y,Noone AM

    更新日期:2012-07-10 00:00:00

  • Modelling the 1985 influenza epidemic in France.

    abstract::The Rvachev-Baroyan-Longini model is a space-time predictive model of the spread of influenza epidemics. It has been applied to 128 cities of the USSR, and more recently, to forecasting the spread of the pandemic of 1968-1969 throughout 52 large cities. It is a deterministic, mass-action, space and time continuous mod...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780071107

    authors: Flahault A,Letrait S,Blin P,Hazout S,Ménarés J,Valleron AJ

    更新日期:1988-11-01 00:00:00

  • Empirical Bayes versus fully Bayesian analysis of geographical variation in disease risk.

    abstract::This paper reviews methods for mapping geographical variation in disease incidence and mortality. Recent results in Bayesian hierarchical modelling of relative risk are discussed. Two approaches to relative risk estimation, along with the related computational procedures, are described and compared. The first is an em...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110802

    authors: Bernardinelli L,Montomoli C

    更新日期:1992-06-15 00:00:00

  • Targeted maximum likelihood estimation for a binary treatment: A tutorial.

    abstract::When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G-formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric outcome model. In con...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7628

    authors: Luque-Fernandez MA,Schomaker M,Rachet B,Schnitzer ME

    更新日期:2018-07-20 00:00:00

  • A spatial scan statistic for ordinal data.

    abstract::Spatial scan statistics are widely used for count data to detect geographical disease clusters of high or low incidence, mortality or prevalence and to evaluate their statistical significance. Some data are ordinal or continuous in nature, however, so that it is necessary to dichotomize the data to use a traditional s...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2607

    authors: Jung I,Kulldorff M,Klassen AC

    更新日期:2007-03-30 00:00:00

  • Viral load detectability profiles for HIV infection.

    abstract::The introduction of potent antiretroviral therapies for treatment of HIV infection typically results in a dramatic reduction in plasma HIV RNA concentration, often to levels undetectable by current measurement practices. However, although a high proportion of patients achieve 'undetectability', many then experience a ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1325

    authors: McKinnon EJ,James IR,John M,Mallal SA

    更新日期:2003-02-15 00:00:00

  • Using orthogonal polynomial scores in summarizing and evaluating longitudinal data collected in phase I and II clinical pharmacology studies.

    abstract::Orthogonal polynomial scores (OPS) is a simple, biologically meaningful approach to characterize longitudinal data in phase I and II clinical pharmacology trials. It describes average, linear, quadratic and higher order polynomial characteristics of each subject's response over time with use of composite scores comput...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780120703

    authors: Bradstreet TE

    更新日期:1993-04-15 00:00:00