Abstract:
:In applications such as medical statistics and genetics, we encounter situations where a large number of highly correlated predictors explain a response. For example, the response may be a disease indicator and the predictors may be treatment indicators or single nucleotide polymorphisms (SNPs). Constructing a good predictive model in such cases is well studied. Less well understood is how to recover the 'true sparsity pattern', that is finding which predictors have direct effects on the response, and indicating the statistical significance of the results. Restricting attention to binary predictors and response, we study the recovery of the true sparsity pattern using a two-stage method that separates establishing the presence of effects from inferring their exact relationship with the predictors. Simulations and a real data application demonstrate that the method discriminates well between associations and direct effects. Comparisons with lasso-based methods demonstrate favourable performance of the proposed method.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Sperrin M,Jaki Tdoi
10.1002/sim.4014subject
Has Abstractpub_date
2010-10-30 00:00:00pages
2544-56issue
24eissn
0277-6715issn
1097-0258journal_volume
29pub_type
杂志文章abstract::We propose a semiparametric method for estimating ROC surfaces for continuous diagnostic tests based on two test measurements. Such a three-class diagnostic problem based on two test measurements arises naturally from some DNA amplification-related diagnostic scenarios. Simulation results show that our proposed semipa...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3625
更新日期:2009-08-15 00:00:00
abstract::The present study investigates the performance of several statistical tests to detect publication bias in diagnostic meta-analysis by means of simulation. While bivariate models should be used to pool data from primary studies in diagnostic meta-analysis, univariate measures of diagnostic accuracy are preferable for t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6177
更新日期:2014-08-15 00:00:00
abstract::The National Children's Study is a national household probability sample designed to identify 100,000 children at birth and follow the sampled children for 21 years. Data from the study will support examining numerous hypotheses concerning genetic and environmental effects on the health and development of children. Th...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3891
更新日期:2010-06-15 00:00:00
abstract::Biological drug products are therapeutic moieties manufactured by a living system or organisms. These are important life-saving drug products for patients with unmet medical needs. Because of expensive cost, only a few patients have access to life-saving biological products. Most of the early biological products will ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5565
更新日期:2013-02-10 00:00:00
abstract::It is not uncommon for a continuous outcome variable Y to be dichotomized and analysed using logistic regression. Moser and Coombs (Statist. Med. 2004; 23:1843-1860) provide a method for converting the output from a standard linear regression analysis using the original continuous outcome Y to give much more efficient...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3474
更新日期:2009-01-30 00:00:00
abstract::The availability of large data sets together with the growth in power and storage capabilities of computers have made the analysis of the spatial distribution of disease rates an increasingly important tool in public health research. Use of existing geographic divisions or groupings tends to result either in unstable ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780121916
更新日期:1993-10-01 00:00:00
abstract::In some randomized controlled trials, subjects with a better prognosis may be diverted into the treatment group. This subverting of randomization results in an unobserved non-compliance with the originally intended treatment assignment. Consequently, the estimate of treatment effect from these trials may be biased. Th...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.715
更新日期:2001-02-28 00:00:00
abstract::Omitted variable bias is discussed in the context of linear models. It is shown that the effect of omitted variables can be controlled in linear models for metric dependent variables by using data from follow-up studies. Two different models for analysing such data are proposed. In the first model the omitted variable...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780110906
更新日期:1992-06-30 00:00:00
abstract::Observational studies provide a rich source of information for assessing effectiveness of treatment interventions in many situations where it is not ethical or practical to perform randomized controlled trials. However, such studies are prone to bias from hidden (unmeasured) confounding. A promising approach to identi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7051
更新日期:2016-12-10 00:00:00
abstract::Many quantitative assay measurements of metabolites of environmental toxicants in clinical investigations are subject to left censoring due to values falling below assay detection limits. Moreover, when observations occur in both unexposed individuals and exposed individuals who reflect a mixture of two distributions ...
journal_title:Statistics in medicine
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/sim.2079
更新日期:2005-07-15 00:00:00
abstract::In many epidemiological studies it is common to resort to regression models relating incidence of a disease and its risk factors. The main goal of this paper is to consider inference on such models with error-prone observations and variances of the measurement errors changing across observations. We suppose that the o...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3343
更新日期:2008-11-10 00:00:00
abstract::The relationship between association and surrogacy has been the focus of much debate in the surrogate marker literature. Recently, the individual causal association (ICA) has been introduced as a metric of surrogacy in the causal inference framework, when both the surrogate and the true endpoint are normally distribut...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8698
更新日期:2020-11-20 00:00:00
abstract::Determination of the equation that relates an ordered dependent variable to ordered independent variables is sought. One solution, non-parametric discriminant analysis (NPD), involves obtaining the best monotonic step function by means of a computer search procedure. Although one can use alternative selection criteria...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780110804
更新日期:1992-06-15 00:00:00
abstract::The construction, validation and updating of a prognostic model for kidney graft survival is reported using data from the Eurotransplant database. First, a model is constructed for data from transplantations in the period 1984 to 1987. The model is later updated for the 1988 1990 data. The first data set was randomly ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780141806
更新日期:1995-09-30 00:00:00
abstract::In a medical study we are often interested in graphically displaying the relationship between continuous variables and clinical events indicating disease progression. Often, it is reasonable to make the minimal assumption that the risk of progression is an arbitrary monotone function of the continuous variable. Someti...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1561
更新日期:2003-10-30 00:00:00
abstract::Correlation is inherent in longitudinal studies due to the repeated measurements on subjects, as well as due to time-dependent covariates in the study. In the National Longitudinal Study of Adolescent to Adult Health (Add Health), data were repeatedly collected on children in grades 7-12 across four waves. Thus, obser...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8099
更新日期:2019-05-30 00:00:00
abstract::The matched case-control designs are commonly used to control for potential confounding factors in genetic epidemiology studies especially epigenetic studies with DNA methylation. Compared with unmatched case-control studies with high-dimensional genomic or epigenetic data, there have been few variable selection metho...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5694
更新日期:2013-05-30 00:00:00
abstract::A mediator acts as a third variable in the causal pathway between a risk factor and an outcome. In this paper, we consider the estimation of the mediation effect when the mediator is a binary variable. We give a precise definition of the mediation effect and examine asymptotic properties of five different estimators o...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2730
更新日期:2007-08-15 00:00:00
abstract::In time-to-event analysis, the traditional summary measures have been based on the hazard function, survival function, quantile event time, restricted mean event time, and residual lifetime. Under competing risks, furthermore, typical summary measures have been the cause-specific hazard function and cumulative inciden...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8871
更新日期:2021-01-06 00:00:00
abstract::A relationship between baseline risk and treatment effect is increasingly investigated as a possible explanation of between-study heterogeneity in clinical trial meta-analysis. An approach that is still often applied in the medical literature is to plot the estimated treatment effects against the estimated measures of...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/1097-0258(20001230)19:24<3497::aid-sim830>
更新日期:2000-12-30 00:00:00
abstract::Public health interventions are often designed to target communities defined either geographically (e.g. cities, counties) or socially (e.g. schools or workplaces). The group randomized trial (GRT) is regarded as the gold standard for evaluating these interventions. However, community leaders may object to randomizati...
journal_title:Statistics in medicine
pub_type: 杂志文章,随机对照试验
doi:10.1002/sim.4237
更新日期:2011-07-10 00:00:00
abstract::This paper introduces a new spatial scan statistic designed to adjust cluster detection for longitudinal confounding factors indexed in space. The functional-model-adjusted statistic was developed using generalized functional linear models in which longitudinal confounding factors were considered to be functional cova...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8459
更新日期:2020-04-15 00:00:00
abstract::Two correction methods are considered for multiple logistic regression models with some covariates measured with error. Both methods are based on approximating the complicated regression model between the response and the observed covariates with simpler models. The first model is the logistic approximation proposed b...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780131105
更新日期:1994-06-15 00:00:00
abstract::Causal inference for non-censored response variables, such as binary or quantitative outcomes, is often based on either (1) direct standardization ('G-formula') or (2) inverse probability of treatment assignment weights ('propensity score'). To do causal inference in survival analysis, one needs to address right-censo...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7297
更新日期:2017-07-30 00:00:00
abstract::We examine the use of randomization-based inference for analyzing multiarmed randomized clinical trials, including the application of conditional randomization tests to multiple comparisons. The view is taken that the linkage of the statistical test to the experimental design (randomization procedure) should be recogn...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8418
更新日期:2020-02-20 00:00:00
abstract::This article introduces a global hypothesis test intended for studies with multiple endpoints. Our test makes use of a priori predictions about the direction of the result of each endpoint and we weight these predictions using the sample correlation matrix. The global alternative hypothesis concerns a parameter, ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8724
更新日期:2020-12-10 00:00:00
abstract::We propose a goodness-of-fit test statistic for linear regression with heterogeneous variance, which is asymptotically chi-square if the given model is correct. The test statistic is computed as a quadratic form of observed minus predicted responses. We apply the method to a linear regression for an ordinal categorica...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780130205
更新日期:1994-01-30 00:00:00
abstract::Clinical research often involves continuous outcome measures, such as blood cholesterol, that are amenable to statistical techniques of analysis based on the mean, such as the t-test or multiple linear regression. Clinical interest, however, frequently focuses on the proportion of subjects who fall below or above a cl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780140303
更新日期:1995-02-15 00:00:00
abstract::In this paper, we define a modified version τ(b) of Kendall's tau to measure the association in a pair (X,Y) of random variables subject to fixed left censoring due to known lower detection limits. We provide a nonparametric estimator of τ(b) and investigate its asymptotic properties. We then assume an Archimedean cop...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4319
更新日期:2011-11-20 00:00:00
abstract::Genetic markers can be used as instrumental variables, in an analogous way to randomization in a clinical trial, to estimate the causal relationship between a phenotype and an outcome variable. Our purpose is to extend the existing methods for such Mendelian randomization studies to the context of multiple genetic mar...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3843
更新日期:2010-05-30 00:00:00