Abstract:
:Colorectal cancer is the second leading cause of death from cancer in the United States. To facilitate the efficiency of colorectal cancer screening, there is a need to stratify risk for colorectal cancer among the 90% of US residents who are considered "average risk." In this article, we investigate such risk stratification rules for advanced colorectal neoplasia (colorectal cancer and advanced, precancerous polyps). We use a recently completed large cohort study of subjects who underwent a first screening colonoscopy. Logistic regression models have been used in the literature to estimate the risk of advanced colorectal neoplasia based on quantifiable risk factors. However, logistic regression may be prone to overfitting and instability in variable selection. Since most of the risk factors in our study have several categories, it was tempting to collapse these categories into fewer risk groups. We propose a penalized logistic regression method that automatically and simultaneously selects variables, groups categories, and estimates their coefficients by penalizing the [Formula: see text]-norm of both the coefficients and their differences. Hence, it encourages sparsity in the categories, i.e. grouping of the categories, and sparsity in the variables, i.e. variable selection. We apply the penalized logistic regression method to our data. The important variables are selected, with close categories simultaneously grouped, by penalized regression models with and without the interactions terms. The models are validated with 10-fold cross-validation. The receiver operating characteristic curves of the penalized regression models dominate the receiver operating characteristic curve of naive logistic regressions, indicating a superior discriminative performance.
journal_name
Stat Methods Med Resjournal_title
Statistical methods in medical researchauthors
Lin Y,Yu M,Wang S,Chappell R,Imperiale TFdoi
10.1177/0962280213497432subject
Has Abstractpub_date
2016-08-01 00:00:00pages
1677-91issue
4eissn
0962-2802issn
1477-0334pii
0962280213497432journal_volume
25pub_type
杂志文章abstract::Estimating the long-term health impact of air pollution using an ecological spatio-temporal study design is a challenging task, due to the presence of residual spatio-temporal autocorrelation in the health counts after adjusting for the covariate effects. This autocorrelation is commonly modelled by a set of random ef...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280214527384
更新日期:2014-12-01 00:00:00
abstract::This article outlines the statistical developments that have taken place in the use of the EM algorithm in emission and transmission tomography during the past decade or so. We discuss the statistical aspects of the modelling of the projection data for both the emission and transmission cases and define the relevant p...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/096228029700600105
更新日期:1997-03-01 00:00:00
abstract::Cluster randomized and multicentre trials evaluate the effect of a treatment on persons nested within clusters, for instance patients within clinics or pupils within schools. Although equal sample sizes per cluster are generally optimal for parameter estimation, they are rarely feasible. This paper addresses the relat...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280206079018
更新日期:2008-08-01 00:00:00
abstract::The classical and most commonly used approach to building prediction intervals is the parametric approach. However, its main drawback is that its validity and performance highly depend on the assumed functional link between the covariates and the response. This research investigates new methods that improve the perfor...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280219829885
更新日期:2020-01-01 00:00:00
abstract::Many longitudinal studies observe time to occurrence of a clinical event such as death, while also collecting serial measurements of one or more biomarkers that are predictive of the event, or are surrogate outcomes of interest. Joint modeling can be used to examine the relationship between the biomarker and the event...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280218764193
更新日期:2019-05-01 00:00:00
abstract::In recent years, there has been a prominent discussion in the literature about the potential for overestimation of the treatment effect when a clinical trial stops at an interim analysis due to the experimental treatment showing a benefit over the control. However, there has been much less attention paid to the conver...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280218795320
更新日期:2019-10-01 00:00:00
abstract::Sensitive questions are often involved in healthcare or medical survey research. Much empirical evidence has shown that the randomized response technique is useful for the collection of truthful responses. However, few studies have discussed methods to estimate the dependence of sensitive responses of multiple types. ...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280219847492
更新日期:2020-03-01 00:00:00
abstract::Cure rate models have been widely adopted for characterizing survival data that have long-term survivors. Under a mixture cure rate model where the population is a mixture of cured and susceptible subjects, a primary goal is to study covariate effects on the cure probability and survival function of the susceptible su...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217708684
更新日期:2017-10-01 00:00:00
abstract::This work presents a brief overview of Markov models in cancer screening evaluation and focuses on two specific models. A three-state model was first proposed to estimate jointly the sensitivity of the screening procedure and the average duration in the preclinical phase, i.e. the period when the cancer is asymptomati...
journal_title:Statistical methods in medical research
pub_type: 杂志文章,评审
doi:10.1177/0962280209359848
更新日期:2010-10-01 00:00:00
abstract::Pattern-mixture model (PMM)-based controlled imputations have become a popular tool to assess the sensitivity of primary analysis inference to different post-dropout assumptions or to estimate treatment effectiveness. The methodology is well established for continuous responses but less well established for binary res...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280220941880
更新日期:2020-12-01 00:00:00
abstract::It is a common practice to analyze longitudinal data frequently arisen in medical studies using various mixed-effects models in the literature. However, the following issues may standout in longitudinal data analysis: (i) In clinical practice, the profile of each subject's response from a longitudinal study may follow...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280214544207
更新日期:2017-02-01 00:00:00
abstract::This paper focuses on inferential tools in the logistic regression model fitted by the Firth penalized likelihood. In this context, the Likelihood Ratio statistic is often reported to be the preferred choice as compared to the 'traditional' Wald statistic. In this work, we consider and discuss a wider range of test st...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280216661213
更新日期:2018-05-01 00:00:00
abstract::We propose a hierarchical Bayesian methodology to model spatially or spatio-temporal clustered survival data with possibility of cure. A flexible continuous transformation class of survival curves indexed by a single parameter is used. This transformation model is a larger class of models containing two special cases ...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280212445658
更新日期:2016-02-01 00:00:00
abstract::Agreement between two methods of clinical measurement can be quantified using the differences between observations made using the two methods on the same subjects. The 95% limits of agreement, estimated by mean difference +/- 1.96 standard deviation of the differences, provide an interval within which 95% of differenc...
journal_title:Statistical methods in medical research
pub_type: 杂志文章,评审
doi:10.1177/096228029900800204
更新日期:1999-06-01 00:00:00
abstract::Trials run in either rare diseases, such as rare cancers, or rare sub-populations of common diseases are challenging in terms of identifying, recruiting and treating sufficient patients in a sensible period. Treatments for rare diseases are often designed for other disease areas and then later proposed as possible tre...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280216662070
更新日期:2018-05-01 00:00:00
abstract::In this study, we discuss a decision theoretic or fully Bayesian approach to the sample size question in clinical trials with binary responses. Data are assumed to come from two binomial distributions. A Dirichlet distribution is assumed to describe prior knowledge of the two success probabilities p1 and p2. The param...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280211399562
更新日期:2013-12-01 00:00:00
abstract::This paper illustrates the use of multidimensional scaling methods (MDS) to examine space-time patterns in epidemic data. The paper begins by outlining the principles of MDS. The model is then formally specified and illustrated by application to two data sets. The first is partly a tutorial example. It uses monthly re...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/096228029500400202
更新日期:1995-06-01 00:00:00
abstract::This article aims to develop a probability-based model involving the use of direct likelihood formulation and generalised linear modelling (GLM) approaches useful in estimating important disease parameters from longitudinal or repeated measurement data. The current application is based on infection with respiratory sy...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280210385749
更新日期:2011-10-01 00:00:00
abstract::Sample selection arises when the outcome of interest is partially observed in a study. Although sophisticated statistical methods in the parametric and non-parametric framework have been proposed to solve this problem, it is yet unclear how to deal with selectively missing covariate data using simple multiple imputati...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217715663
更新日期:2019-01-01 00:00:00
abstract::Tree-based methods are very powerful and popular tools for analysing survival data with right-censoring. The existing methods assume that the true time-to-event and the censoring times are independent given the covariates. We propose different ways to build survival forests when dependent censoring is suspected, by us...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217727314
更新日期:2019-02-01 00:00:00
abstract::Variable selection in semiparametric mixed models for longitudinal data remains a challenge, especially in the presence of multiple correlated outcomes. In this paper, we propose a model selection procedure that simultaneously selects fixed and random effects using a maximum penalized likelihood method with the adapti...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217690769
更新日期:2018-10-01 00:00:00
abstract::In this paper, a new allocation rule for treatment assignments in sequential clinical trials is proposed. The stratified and randomized play-the-winner rule (SRPWR) is an extension of the randomized play-the-winner rule to more than two treatments. It is applicable to cases where the probabilities of success of a trea...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280207081606
更新日期:2008-12-01 00:00:00
abstract::Several epidemiological parameters have been introduced for quantifying the population impact of a certain exposure on morbidity on a population level, termed 'attributable risk' (AR). Of these definitions, the AR as suggested by Levin in 1953 or some algebraic transformations of it are most commonly used. A structure...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/096228020101000305
更新日期:2001-06-01 00:00:00
abstract::The estimation of population parameters using complex survey data requires careful statistical modelling to account for the design features. This is further complicated by unit and item nonresponse for which a number of methods have been developed in order to reduce estimation bias. In this paper, we address some issu...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280213484401
更新日期:2016-08-01 00:00:00
abstract::Regression models are frequently used to model the functional relationship between an interesting outcome parameter and one or more potentially relevant explanatory variables. Objectives can be to set up as a prognostic model, for example, or an estimation model for a certain parameter of interest. Determining half-li...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280213502403
更新日期:2016-10-01 00:00:00
abstract::Early phase trials of complex interventions currently focus on assessing the feasibility of a large randomised control trial and on conducting pilot work. Assessing the efficacy of the proposed intervention is generally discouraged, due to concerns of underpowered hypothesis testing. In contrast, early assessment of e...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280215589507
更新日期:2016-06-01 00:00:00
abstract::This review provides an overview of forensic inference from genetic markers. Because the judge and jurors are charged with decision-making, the forensic expert's job is to provide a useful summary of the evidence to the court. Hence, this review focuses on the likelihood ratio as a means of summarizing the genetic dat...
journal_title:Statistical methods in medical research
pub_type: 杂志文章,评审
doi:10.1177/096228029300200304
更新日期:1993-01-01 00:00:00
abstract::Propensity score methods are common for estimating a binary treatment effect when treatment assignment is not randomized. When exposure is measured on an ordinal scale (i.e. low-medium-high), however, propensity score inference requires extensions which have received limited attention. Estimands of possible interest w...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280214560046
更新日期:2017-04-01 00:00:00
abstract::Improving the quality of care that patients receive is a major focus of clinical research, particularly in the setting of cardiovascular hospitalization. Quality improvement studies seek to estimate and visualize the degree of variability in dichotomous treatment patterns and outcomes across different providers, where...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217754230
更新日期:2019-04-01 00:00:00
abstract::Appropriate handling of aggregate missing outcome data is necessary to minimise bias in the conclusions of systematic reviews. The two-stage pattern-mixture model has been already proposed to address aggregate missing continuous outcome data. While this approach is more proper compared with the exclusion of missing co...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280220983544
更新日期:2021-01-06 00:00:00