Inferential tools in penalized logistic regression for small and sparse data: A comparative study.

Abstract:

:This paper focuses on inferential tools in the logistic regression model fitted by the Firth penalized likelihood. In this context, the Likelihood Ratio statistic is often reported to be the preferred choice as compared to the 'traditional' Wald statistic. In this work, we consider and discuss a wider range of test statistics, including the robust Wald, the Score, and the recently proposed Gradient statistic. We compare all these asymptotically equivalent statistics in terms of interval estimation and hypothesis testing via simulation experiments and analyses of two real datasets. We find out that the Likelihood Ratio statistic does not appear the best inferential device in the Firth penalized logistic regression.

journal_name

Stat Methods Med Res

authors

Siino M,Fasola S,Muggeo VM

doi

10.1177/0962280216661213

subject

Has Abstract

pub_date

2018-05-01 00:00:00

pages

1365-1375

issue

5

eissn

0962-2802

issn

1477-0334

pii

0962280216661213

journal_volume

27

pub_type

杂志文章
  • Forensic inference from genetic markers.

    abstract::This review provides an overview of forensic inference from genetic markers. Because the judge and jurors are charged with decision-making, the forensic expert's job is to provide a useful summary of the evidence to the court. Hence, this review focuses on the likelihood ratio as a means of summarizing the genetic dat...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228029300200304

    authors: Devlin B

    更新日期:1993-01-01 00:00:00

  • Evaluation of software for multiple imputation of semi-continuous data.

    abstract::It is now widely accepted that multiple imputation (MI) methods properly handle the uncertainty of missing data over single imputation methods. Several standard statistical software packages, such as SAS, R and STATA, have standard procedures or user-written programs to perform MI. The performance of these packages is...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280206074464

    authors: Yu LM,Burton A,Rivero-Arias O

    更新日期:2007-06-01 00:00:00

  • Prediction intervals with random forests.

    abstract::The classical and most commonly used approach to building prediction intervals is the parametric approach. However, its main drawback is that its validity and performance highly depend on the assumed functional link between the covariates and the response. This research investigates new methods that improve the perfor...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219829885

    authors: Roy MH,Larocque D

    更新日期:2020-01-01 00:00:00

  • A monotone data augmentation algorithm for longitudinal data analysis via multivariate skew-t, skew-normal or t distributions.

    abstract::The mixed effects model for repeated measures has been widely used for the analysis of longitudinal clinical data collected at a number of fixed time points. We propose a robust extension of the mixed effects model for repeated measures for skewed and heavy-tailed data on basis of the multivariate skew-t distribution,...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219865579

    authors: Tang Y

    更新日期:2020-06-01 00:00:00

  • Projections of cancer mortality risks using spatio-temporal P-spline models.

    abstract::Cancer mortality risk estimates are essential for planning resource allocation and designing and evaluating cancer prevention and management strategies. However, mortality figures generally become available after a few years, making necessary to develop reliable procedures to provide current and near future mortality ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212446366

    authors: Ugarte MD,Goicoa T,Etxeberria J,Militino AF

    更新日期:2012-10-01 00:00:00

  • Adjustment for treatment changes in epilepsy trials: A comparison of causal methods for time-to-event outcomes.

    abstract:BACKGROUND:When trials are subject to departures from randomised treatment, simple statistical methods that aim to estimate treatment efficacy, such as per protocol or as treated analyses, typically introduce selection bias. More appropriate methods to adjust for departure from randomised treatment are rarely employed,...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217735560

    authors: Dodd S,Williamson P,White IR

    更新日期:2019-03-01 00:00:00

  • Relative efficiency of unequal cluster sizes for variance component estimation in cluster randomized and multicentre trials.

    abstract::Cluster randomized and multicentre trials evaluate the effect of a treatment on persons nested within clusters, for instance patients within clinics or pupils within schools. Although equal sample sizes per cluster are generally optimal for parameter estimation, they are rarely feasible. This paper addresses the relat...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280206079018

    authors: van Breukelen GJ,Candel MJ,Berger MP

    更新日期:2008-08-01 00:00:00

  • Correcting for dependent censoring in routine outcome monitoring data by applying the inverse probability censoring weighted estimator.

    abstract::Censored data make survival analysis more complicated because exact event times are not observed. Statistical methodology developed to account for censored observations assumes that patients' withdrawal from a study is independent of the event of interest. However, in practice, some covariates might be associated to b...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280216628900

    authors: Willems S,Schat A,van Noorden MS,Fiocco M

    更新日期:2018-02-01 00:00:00

  • Regression towards the mean, historically considered.

    abstract::The simple yet subtle concept of regression towards the mean is reviewed historically. Verbal, geometric, and mathematical expressions of the concept date to the discoverer of the concept, Francis Galton. That discovery and subsequent understanding (and misunderstanding) of the concept are surveyed. ...

    journal_title:Statistical methods in medical research

    pub_type: 传,历史文章,杂志文章,评审

    doi:10.1177/096228029700600202

    authors: Stigler SM

    更新日期:1997-06-01 00:00:00

  • Designs in partially controlled studies: messages from a review.

    abstract::The ability to evaluate effects of factors on outcomes is increasingly important for studies that control some but not all of the factors. Although important advances have been made in methods of analysis for such partially controlled studies, work on designs has been limited. To help understand why, we review the mai...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1191/0962280205sm405oa

    authors: Li F,Frangakis CE

    更新日期:2005-08-01 00:00:00

  • The application of methods to quantify attributable risk in medical practice.

    abstract::Several epidemiological parameters have been introduced for quantifying the population impact of a certain exposure on morbidity on a population level, termed 'attributable risk' (AR). Of these definitions, the AR as suggested by Levin in 1953 or some algebraic transformations of it are most commonly used. A structure...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/096228020101000305

    authors: Uter W,Pfahlberg A

    更新日期:2001-06-01 00:00:00

  • Efficient Monte Carlo evaluation of resampling-based hypothesis tests with applications to genetic epidemiology.

    abstract::Monte Carlo evaluation of resampling-based tests is often conducted in statistical analysis. However, this procedure is generally computationally intensive. The pooling resampling-based method has been developed to reduce the computational burden but the validity of the method has not been studied before. In this arti...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280216661876

    authors: Fung WK,Yu K,Yang Y,Zhou JY

    更新日期:2018-05-01 00:00:00

  • Random-effects models for multivariate repeated measures.

    abstract::Mixed models are widely used for the analysis of one repeatedly measured outcome. If more than one outcome is present, a mixed model can be used for each one. These separate models can be tied together into a multivariate mixed model by specifying a joint distribution for their random effects. This strategy has been u...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280206075305

    authors: Fieuws S,Verbeke G,Molenberghs G

    更新日期:2007-10-01 00:00:00

  • A comparison of machine learning methods for classification using simulation with multiple real data examples from mental health studies.

    abstract:BACKGROUND:Recent literature on the comparison of machine learning methods has raised questions about the neutrality, unbiasedness and utility of many comparative studies. Reporting of results on favourable datasets and sampling error in the estimated performance measures based on single samples are thought to be the m...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213502437

    authors: Khondoker M,Dobson R,Skirrow C,Simmons A,Stahl D

    更新日期:2016-10-01 00:00:00

  • Comparing cluster-level dynamic treatment regimens using sequential, multiple assignment, randomized trials: Regression estimation and sample size considerations.

    abstract::Cluster-level dynamic treatment regimens can be used to guide sequential treatment decision-making at the cluster level in order to improve outcomes at the individual or patient-level. In a cluster-level dynamic treatment regimen, the treatment is potentially adapted and re-adapted over time based on changes in the cl...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217708654

    authors: NeCamp T,Kilbourne A,Almirall D

    更新日期:2017-08-01 00:00:00

  • Joint modelling for organ transplantation outcomes for patients with diabetes and the end-stage renal disease.

    abstract::This article is motivated by jointly modelling longitudinal and time-to-event clinical data of patients with diabetes and end-stage renal disease. All patients are on the waiting list for the pancreas transplant after kidney transplant, and some of them have a pancreas transplant before kidney transplant failure or de...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218786980

    authors: Dong JJ,Wang S,Wang L,Gill J,Cao J

    更新日期:2019-09-01 00:00:00

  • Testing for association in case-control genome-wide association studies with shared controls.

    abstract::The statistical analysis of genome-wide association studies (GWASs) with multiple diseases and shared controls (SCs) is discussed. The usual method for analyzing data from these studies is to compare each individual disease with either the SCs or the pooled controls which include other diseases. We observed that apply...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212474061

    authors: Chen Z,Huang H,Ng HK

    更新日期:2016-04-01 00:00:00

  • Measurement error correction using validation data: a review of methods and their applicability in case-control studies.

    abstract::Measurement error is a serious problem in the analysis of epidemiological data. In the past 20 years, a large number of methods for the correction of measurement error have been developed. While at the beginning mostly methods for cohort studies were considered, recently more attention has been paid to case-control st...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228020000900504

    authors: Thürigen D,Spiegelman D,Blettner M,Heuer C,Brenner H

    更新日期:2000-10-01 00:00:00

  • Pseudo-observations in survival analysis.

    abstract::We review recent work on the application of pseudo-observations in survival and event history analysis. This includes regression models for parameters like the survival function in a single point, the restricted mean survival time and transition or state occupation probabilities in multi-state models, e.g. the competi...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/0962280209105020

    authors: Andersen PK,Perme MP

    更新日期:2010-02-01 00:00:00

  • Prospective analysis of infectious disease surveillance data using syndromic information.

    abstract::In this paper, we describe a Bayesian hierarchical Poisson model for the prospective analysis of data for infectious diseases. The proposed model consists of two components. The first component describes the behavior of disease during nonepidemic periods and the second component represents the increase in disease coun...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214527385

    authors: Corberán-Vallet A,Lawson AB

    更新日期:2014-12-01 00:00:00

  • The EM algorithm in medical imaging.

    abstract::This article outlines the statistical developments that have taken place in the use of the EM algorithm in emission and transmission tomography during the past decade or so. We discuss the statistical aspects of the modelling of the projection data for both the emission and transmission cases and define the relevant p...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/096228029700600105

    authors: Kay J

    更新日期:1997-03-01 00:00:00

  • Estimating marginal and incremental effects in the analysis of medical expenditure panel data using marginalized two-part random-effects generalized Gamma models: Evidence from China healthcare cost data.

    abstract::Conditional two-part random-effects models have been proposed for the analysis of healthcare cost panel data that contain both zero costs from the non-users of healthcare facilities and positive costs from the users. These models have been extended to accommodate more flexible data structures when using the generalize...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217690770

    authors: Zhang B,Liu W,Hu Y

    更新日期:2018-10-01 00:00:00

  • Estimating the personal cure rate of cancer patients using population-based grouped cancer survival data.

    abstract::Cancer patients are subject to multiple competing risks of death and may die from causes other than the cancer diagnosed. The probability of not dying from the cancer diagnosed, which is one of the patients' main concerns, is sometimes called the 'personal cure' rate. Two approaches of modelling competing-risk surviva...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280209347046

    authors: Binbing Yu,Tiwari RC,Feuer EJ

    更新日期:2011-06-01 00:00:00

  • Joint modeling of longitudinal zero-inflated count and time-to-event data: A Bayesian perspective.

    abstract::Longitudinal zero-inflated count data are encountered frequently in substance-use research when assessing the effects of covariates and risk factors on outcomes. Often, both the time to a terminal event such as death or dropout and repeated measure count responses are collected for each subject. In this setting, the l...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280216659312

    authors: Zhu H,DeSantis SM,Luo S

    更新日期:2018-04-01 00:00:00

  • Propensity scores: from naive enthusiasm to intuitive understanding.

    abstract::Estimation of the effect of a binary exposure on an outcome in the presence of confounding is often carried out via outcome regression modelling. An alternative approach is to use propensity score methodology. The propensity score is the conditional probability of receiving the exposure given the observed covariates a...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280210394483

    authors: Williamson E,Morley R,Lucas A,Carpenter J

    更新日期:2012-06-01 00:00:00

  • Interpolation between spatial frameworks: an application of process convolution to estimating neighbourhood disease prevalence.

    abstract::Health data may be collected across one spatial framework (e.g. health provider agencies), but contrasts in health over another spatial framework (neighbourhoods) may be of policy interest. In the UK, population prevalence totals for chronic diseases are provided for populations served by general practitioner practice...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212447150

    authors: Congdon P

    更新日期:2014-04-01 00:00:00

  • Accurate quantification of uncertainty in epidemic parameter estimates and predictions using stochastic compartmental models.

    abstract::Stochastic transmission dynamic models are needed to quantify the uncertainty in estimates and predictions during outbreaks of infectious diseases. We previously developed a calibration method for stochastic epidemic compartmental models, called Multiple Shooting for Stochastic Systems (MSS), and demonstrated its comp...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218805780

    authors: Zimmer C,Leuba SI,Cohen T,Yaesoubi R

    更新日期:2019-12-01 00:00:00

  • Shared parameter models for joint analysis of longitudinal and survival data with left truncation due to delayed entry - Applications to cystic fibrosis.

    abstract::Many longitudinal studies observe time to occurrence of a clinical event such as death, while also collecting serial measurements of one or more biomarkers that are predictive of the event, or are surrogate outcomes of interest. Joint modeling can be used to examine the relationship between the biomarker and the event...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218764193

    authors: Schluchter MD,Piccorelli AV

    更新日期:2019-05-01 00:00:00

  • A proof-of-concept-to-confirmatory multiple adaptation design in the development of an anti-viral treatment.

    abstract::In the clinical development of some new infectious disease drugs, early clinical pharmacology trials may predict with high confidence that the efficacious doses are well below the range of the safety margin. In this case, a dose-ranging study may be unnecessary after a proof-of-concept (PoC) study testing the highest ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218807950

    authors: Fan XF,Gallo P,Su G,Menton R,Segal F

    更新日期:2019-12-01 00:00:00

  • Multiplicity adjustments in trials with two correlated comparisons of interest.

    abstract::Clinical trials investigating the efficacy of two or more doses of an experimental treatment compared to a single reference arm are not uncommon. In such situations, if each dose is compared to the reference arm using an un-adjusted significance level, consideration of the Type I familywise error is likely to be requi...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280210378943

    authors: Fernandes N,Stone A

    更新日期:2011-12-01 00:00:00