Abstract:
:Drawing inferences for high-dimensional models is challenging as regular asymptotic theories are not applicable. This article proposes a new framework of simultaneous estimation and inferences for high-dimensional linear models. By smoothing over partial regression estimates based on a given variable selection scheme, we reduce the problem to low-dimensional least squares estimations. The procedure, termed as Selection-assisted Partial Regression and Smoothing (SPARES), utilizes data splitting along with variable selection and partial regression. We show that the SPARES estimator is asymptotically unbiased and normal, and derive its variance via a nonparametric delta method. The utility of the procedure is evaluated under various simulation scenarios and via comparisons with the de-biased LASSO estimators, a major competitor. We apply the method to analyze two genomic datasets and obtain biologically meaningful results.
journal_name
Biometricsjournal_title
Biometricsauthors
Fei Z,Zhu J,Banerjee M,Li Ydoi
10.1111/biom.13013subject
Has Abstractpub_date
2019-06-01 00:00:00pages
551-561issue
2eissn
0006-341Xissn
1541-0420journal_volume
75pub_type
杂志文章相关文献
BIOMETRICS文献大全abstract::Prentice and Sheppard (1995, Biometrika 82, 113-125) proposed a method for estimating relative risks associated with poorly measured exposures using disease rates from multiple populations and exposure and confounding factor data from sample surveys of persons in each population. The method involved an assumption of i...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1998-12-01 00:00:00
abstract::The estimation of maternal genetic variances by a multivariate maximum likelihood method is discussed. As an illustration the method is applied to data on Tribolium using a model based on partitioning the maternal genetic effect into additive and dominance components. An alternative model due to Falconer (1965) is als...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1976-12-01 00:00:00
abstract::We propose methods for Bayesian inference for a new class of semiparametric survival models with a cure fraction. Specifically, we propose a semiparametric cure rate model with a smoothing parameter that controls the degree of parametricity in the right tail of the survival distribution. We show that such a parameter ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.00383.x
更新日期:2001-06-01 00:00:00
abstract::Genetic variants associated with disease outcomes can be used to develop personalized treatment. To reach this precision medicine goal, hundreds of large-scale genome-wide association studies (GWAS) have been conducted in the past decade to search for promising genetic variants associated with various traits. They hav...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13076
更新日期:2019-12-01 00:00:00
abstract::Left truncation commonly occurs in many areas, and many methods have been proposed in the literature for the analysis of various types of left-truncated failure time data. For the situation, a common approach is to conduct the analysis conditional on truncation times, and the method is relatively simple but may not be...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13394
更新日期:2020-10-15 00:00:00
abstract::In a study designed to assess the relationship between a dichotomous exposure and the eventual occurrence of a dichotomous outcome, frequency matching has been proposed as a way to balance the exposure cohorts with respect to the sampling distribution of potential confounding factors. This paper discusses the pooled e...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1985-03-01 00:00:00
abstract::In this article, we develop new methods for estimating average treatment effects in observational studies, in settings with more than two treatment levels, assuming unconfoundedness given pretreatment variables. We emphasize propensity score subclassification and matching methods which have been among the most popular...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12505
更新日期:2016-12-01 00:00:00
abstract::This paper considers methods for estimating the relationship between a binary response Y and the genetic effects responsible for a second binary trait Z. The responses Y are observed only for target individuals, and the responses Z are observed only for the relatives of these targets. The analysis consists of two part...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00808.x
更新日期:2000-09-01 00:00:00
abstract::Spatial weed count data are modeled and predicted using a generalized linear mixed model combined with a Bayesian approach and Markov chain Monte Carlo. Informative priors for a data set with sparse sampling are elicited using a previously collected data set with extensive sampling. Furthermore, we demonstrate that so...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2002.00280.x
更新日期:2002-06-01 00:00:00
abstract::We consider hidden Markov models as a versatile class of models for weakly dependent random phenomena. The topic of the present paper is likelihood-ratio testing for hidden Markov models, and we show that, under appropriate conditions, the standard asymptotic theory of likelihood-ratio tests is valid. Such tests are c...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00742.x
更新日期:2000-09-01 00:00:00
abstract::We consider modeling correlated survival data when cluster sizes may be informative to the outcome of interest based on a within-cluster resampling (WCR) approach and a weighted score function (WSF) method. We derive the large sample properties for the WCR estimators under the Cox proportional hazards model. We establ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00730.x
更新日期:2007-09-01 00:00:00
abstract::Laska and Meisner (1989, Biometrics 45, 1139-1151) dealt with the problem of testing whether an identified treatment belonging to a set of k + 1 treatments is better than each of the other k treatments. They calculated sample size tables for k = 2 when using multiple t-tests or Wilcoxon-Mann-Whitney tests, both under ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00879.x
更新日期:2000-09-01 00:00:00
abstract::The classical concordance correlation coefficient (CCC) to measure agreement among a set of observers assumes data to be distributed as normal and a linear relationship between the mean and the subject and observer effects. Here, the CCC is generalized to afford any distribution from the exponential family by means of...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2009.01335.x
更新日期:2010-09-01 00:00:00
abstract::Tsiatis, Rosner, and Mehta (1984, Biometrics 40, 797-803) proposed a procedure for constructing confidence intervals following group sequential tests of a normal mean. This method is first extended for group sequential tests for which the sample sizes between interim analyses are not identical or the times are not equ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1987-12-01 00:00:00
abstract::In epidemiologic studies, subjects are often misclassified as to their level of exposure. Ignoring this misclassification error in the analysis introduces bias in the estimates of certain parameters and invalidates many hypothesis tests. For situations in which there is misclassification of exposure in a follow-up stu...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1991-06-01 00:00:00
abstract::This article describes the theoretical and practical issues in experimental design for gene expression microarrays. Specifically, this article 1) discusses the basic principles of design (randomization, replication, and blocking) as they pertain to microarrays, and 2) provides some general guidelines for statisticians...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2003.00096.x
更新日期:2003-12-01 00:00:00
abstract::The accuracy of a new diagnostic test is often determined by comparison with a reference test which also has unknown error rates. Maximum likelihood estimation of the error rates of both tests is possible if they are simultaneously applied to two populations with different disease prevalences. The estimation procedure...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1985-12-01 00:00:00
abstract:SUMMARY:Many hormones are secreted in pulses. The pulsatile relationship between hormones regulates many biological processes. To understand endocrine system regulation, time series of hormone concentrations are collected. The goal is to characterize pulsatile patterns and associations between hormones. Currently each ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2008.01117.x
更新日期:2009-06-01 00:00:00
abstract::This paper focuses on the methodology developed for analyzing a multivariate interval-censored data set from an AIDS observational study. A purpose of the study was to determine the natural history of the opportunistic infection cytomeglovirus (CMV) in an HIV-infected individual. For this observational study, laborato...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00940.x
更新日期:2000-09-01 00:00:00
abstract::Mixed latent Markov (MLM) models represent an important tool of analysis of longitudinal data when response variables are affected by time-fixed and time-varying unobserved heterogeneity, in which the latter is accounted for by a hidden Markov chain. In order to avoid bias when using a model of this type in the presen...
journal_title:Biometrics
pub_type: 杂志文章,随机对照试验
doi:10.1111/biom.12224
更新日期:2015-03-01 00:00:00
abstract::The situation considered is the prediction of a future observation from a simple exponential survival distribution in a hierarchical Bayes context. It is shown that when the hyperparameters need to be estimated from the data, a sample reuse approach is superior to maximum likelihood and method of moments estimation pr...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1990-03-01 00:00:00
abstract::A common measure in clinical trials and epidemiologic studies is the number of events such as seizures, hospitalizations, or bouts of disease. Frequently, a binary measure of severity for each event is available but is not incorporated in the analysis. This paper proposes methodology for jointly modeling the number of...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1997-09-01 00:00:00
abstract::Suppose that the response variable in a well-executed clinical or observational study to evaluate a treatment is the time to a certain event, and a set of baseline covariates or predictors was collected for each study patient. Furthermore, suppose that a significant number of study patients had nontrivial, long-term a...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2003.00116.x
更新日期:2003-12-01 00:00:00
abstract::Several models for the course of microbial infections during the incubation period are examined. Each model fits sore throat incubation data very well. Together with the lack of precision of the resulting parameter estimates, this suggests that incubation data alone are insufficient for elucidating the finer details o...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1980-06-01 00:00:00
abstract::Epidemiological studies involving biomarkers are often hindered by prohibitively expensive laboratory tests. Strategically pooling specimens prior to performing these lab assays has been shown to effectively reduce cost with minimal information loss in a logistic regression setting. When the goal is to perform regress...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12134
更新日期:2014-03-01 00:00:00
abstract::For the analysis of time trends in incidence and mortality rates, the age-period-cohort (apc) model has became a widely accepted method. The considered data are arranged in a two-way table by age group and calendar period, which are mostly subdivided into 5- or 10-year intervals. The disadvantage of this approach is t...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1997-03-01 00:00:00
abstract::Tang, Gnecco, and Geller (1989, Biometrika 76, 577-583) proposed an approximate likelihood ratio (ALR) test of the null hypothesis that a normal mean vector equals a null vector against the alternative that all of its components are nonnegative with at least one strictly positive. This test is useful for comparing a t...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2002.00650.x
更新日期:2002-09-01 00:00:00
abstract::With advancements in technology, the collection of multiple types of measurements on a common set of subjects is becoming routine in science. Some notable examples include multimodal neuroimaging studies for the simultaneous investigation of brain structure and function and multi-omics studies for combining genetic an...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13351
更新日期:2020-08-13 00:00:00
abstract::I discuss diagnostic methods for discriminant analysis. The equivalence with linear regression is noted and regression diagnostics are considered. The leverage is a function of the linear discriminant function and the Mahalanobis distance of the observation from the group mean. The distribution of this distance is app...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1997-12-01 00:00:00
abstract::We propose a sequential approach for constructing multiple-objective locally optimal designs for nonlinear models. The technique used here is a general one and we demonstrate the added benefits of using a multiple-objective design over a single-objective design with examples from biomedical studies. ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1998-12-01 00:00:00