Efficient regression analysis with ranked-set sampling.

Abstract:

:This article is motivated by a lung cancer study where a regression model is involved and the response variable is too expensive to measure but the predictor variable can be measured easily with relatively negligible cost. This situation occurs quite often in medical studies, quantitative genetics, and ecological and environmental studies. In this article, by using the idea of ranked-set sampling (RSS), we develop sampling strategies that can reduce cost and increase efficiency of the regression analysis for the above-mentioned situation. The developed method is applied retrospectively to a lung cancer study. In the lung cancer study, the interest is to investigate the association between smoking status and three biomarkers: polyphenol DNA adducts, micronuclei, and sister chromatic exchanges. Optimal sampling schemes with different optimality criteria such as A-, D-, and integrated mean square error (IMSE)-optimality are considered in the application. With set size 10 in RSS, the improvement of the optimal schemes over simple random sampling (SRS) is great. For instance, by using the optimal scheme with IMSE-optimality, the IMSEs of the estimated regression functions for the three biomarkers are reduced to about half of those incurred by using SRS.

journal_name

Biometrics

journal_title

Biometrics

authors

Chen Z,Wang YG

doi

10.1111/j.0006-341X.2004.00255.x

subject

Has Abstract

pub_date

2004-12-01 00:00:00

pages

997-1004

issue

4

eissn

0006-341X

issn

1541-0420

pii

BIOM255

journal_volume

60

pub_type

杂志文章
  • Spatial-temporal modeling of the association between air pollution exposure and preterm birth: identifying critical windows of exposure.

    abstract::Exposure to high levels of air pollution during the pregnancy is associated with increased probability of preterm birth (PTB), a major cause of infant morbidity and mortality. New statistical methodology is required to specifically determine when a particular pollutant impacts the PTB outcome, to determine the role of...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01774.x

    authors: Warren J,Fuentes M,Herring A,Langlois P

    更新日期:2012-12-01 00:00:00

  • A note on case-control sampling to estimate kappa coefficients.

    abstract::The feasibility and cost-effectiveness of estimation of kappa using a case-control method of sampling, proposed by Jannarone, Macera, and Garrison (1987, Biometrics 43, 433-437), is provided support. However, in this article unrealistic assumptions in their presentation are identified and more general results for more...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Kraemer HC,Bloch DA

    更新日期:1990-03-01 00:00:00

  • A comparison of several point estimators of the odds ratio in a single 2 x 2 contingency table.

    abstract::The relative performance of the unconditioned maximum likelihood estimators (UMLEs), conditional MLEs (CMLEs), and Jewell-type estimators of the odds ratio (OR) and its logarithm were investigated in sets of single 2 x 2 contingency tables. The tables were generated by complete enumeration of all possible cell frequen...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Walter SD,Cook RJ

    更新日期:1991-09-01 00:00:00

  • Nonparametric comparison of two survival-time distributions in the presence of dependent censoring.

    abstract::When testing the null hypothesis that treatment arm-specific survival-time distributions are equal, the log-rank test is asymptotically valid when the distribution of time to censoring is conditionally independent of randomized treatment group given survival time. We introduce a test of the null hypothesis for use whe...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/1541-0420.00059

    authors: DiRienzo AG

    更新日期:2003-09-01 00:00:00

  • The analysis of pair-matched case-control studies, a multivariate approach.

    abstract::In matched case-control studies one frequently must consider more than one variable in the analysis and in this paper a log-linear model is presented to meet this objective. A conditional argument yields a method for making inferences on the parameters measuring the association between the variables and disease. The r...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Holford TR

    更新日期:1978-12-01 00:00:00

  • A general model for the analysis of mark-resight, mark-recapture, and band-recovery data under tag loss.

    abstract::Estimates of waterfowl demographic parameters often come from resighting studies where birds fit with individually identifiable neck collars are resighted at a distance. Concerns have been raised about the effects of collar loss on parameter estimates, and the reliability of extrapolating from collared individuals to ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00245.x

    authors: Conn PB,Kendall WL,Samuel MD

    更新日期:2004-12-01 00:00:00

  • A marginal mixed baseline hazards model for multivariate failure time data.

    abstract::In multivariate failure time data analysis, a marginal regression modeling approach is often preferred to avoid assumptions on the dependence structure among correlated failure times. In this paper, a marginal mixed baseline hazards model is introduced. Estimating equations are proposed for the estimation of the margi...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00805.x

    authors: Clegg LX,Cai J,Sen PK

    更新日期:1999-09-01 00:00:00

  • An empirical Bayes' approach to joint analysis of multiple microarray gene expression studies.

    abstract::With the prevalence of gene expression studies and the relatively low reproducibility caused by insufficient sample sizes, it is natural to consider joint analysis that could combine data from different experiments effectively to achieve improved accuracy. We present in this article a model-based approach for better i...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01602.x

    authors: Ruan L,Yuan M

    更新日期:2011-12-01 00:00:00

  • A Bayesian approach to jointly modeling toxicity and biomarker expression in a phase I/II dose-finding trial.

    abstract::In this article, we propose a Bayesian approach to phase I/II dose-finding oncology trials by jointly modeling a binary toxicity outcome and a continuous biomarker expression outcome. We apply our method to a clinical trial of a new gene therapy for bladder cancer patients. In this trial, the biomarker expression indi...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:10.1111/j.1541-0420.2005.00314.x

    authors: Bekele BN,Shen Y

    更新日期:2005-06-01 00:00:00

  • Basal body temperature, ovulation and the risk of conception, with special reference to the lifetimes of sperm and egg.

    abstract::The risks of conception, due to sexual intercourse at various times before and after the periovulatory rise in the woman's basal body temperature, are evaluated. In general, the risk is small nine or more days before, and two or more days after, the first day of elevated temperature. The model for the conception proba...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Royston JP

    更新日期:1982-06-01 00:00:00

  • N-mixture models for estimating population size from spatially replicated counts.

    abstract::Spatial replication is a common theme in count surveys of animals. Such surveys often generate sparse count data from which it is difficult to estimate population size while formally accounting for detection probability. In this article, I describe a class of models (N-mixture models) which allow for estimation of pop...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00142.x

    authors: Royle JA

    更新日期:2004-03-01 00:00:00

  • Variable selection for logistic regression using a prediction-focused information criterion.

    abstract::In biostatistical practice, it is common to use information criteria as a guide for model selection. We propose new versions of the focused information criterion (FIC) for variable selection in logistic regression. The FIC gives, depending on the quantity to be estimated, possibly different sets of selected variables....

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00567.x

    authors: Claeskens G,Croux C,Van Kerckhoven J

    更新日期:2006-12-01 00:00:00

  • Variance estimation for systematic designs in spatial surveys.

    abstract::In spatial surveys for estimating the density of objects in a survey region, systematic designs will generally yield lower variance than random designs. However, estimating the systematic variance is well known to be a difficult problem. Existing methods tend to overestimate the variance, so although the variance is g...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01604.x

    authors: Fewster RM

    更新日期:2011-12-01 00:00:00

  • Fitting a multiplicative incidence model to age- and time-specific prevalence data.

    abstract::We discuss the assessment of age- and time-specific disease incidence using prevalence data. A method is described for conveniently fitting a discrete-time multiplicative model, subject to positivity constraints, using the EM-algorithm. Together with smoothing, it allows essentially nonparametric assessment of inciden...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Marschner IC

    更新日期:1996-06-01 00:00:00

  • Tests for monotone mean residual life, using randomly censored data.

    abstract::At any age the mean residual life function gives the expected remaining life at that age. Reliabilists and biometricians have found it useful to categorize failure distributions by the monotonicity properties of the mean residual life function. Hollander and Proschan (1975, Biometrika 62, 585-593) have derived tests o...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Chen YY,Hollander M,Langberg NA

    更新日期:1983-03-01 00:00:00

  • On symmetric semiparametric two-sample problem.

    abstract::We consider a two-sample problem where data come from symmetric distributions. Usual two-sample data with only magnitudes recorded, arising from case-control studies or logistic discriminant analyses, may constitute a symmetric two-sample problem. We propose a semiparametric model such that, in addition to symmetry, t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13233

    authors: Li M,Diao G,Qin J

    更新日期:2020-12-01 00:00:00

  • Analysis of longitudinal data in the presence of informative observational times and a dependent terminal event, with application to medical cost data.

    abstract::In longitudinal observational studies, repeated measures are often taken at informative observation times. Also, there may exist a dependent terminal event such as death that stops the follow-up. For example, patients in poorer health are more likely to seek medical treatment and their medical cost for each visit tend...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00954.x

    authors: Liu L,Huang X,O'Quigley J

    更新日期:2008-09-01 00:00:00

  • Comparison of different methods for decision-making in bioequivalence assessment.

    abstract::If the regulatory requirements are symmetrical, the use of symmetrical confidence intervals as a decision rule for bioequivalence assessment leads, as shown by simulations, to better level properties and an inferior power compared to a rule based on shortest confidence intervals. A choice between these two approaches ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Mandallaz D,Mau J

    更新日期:1981-06-01 00:00:00

  • Assessing the goodness-of-fit of hidden Markov models.

    abstract::In this article, we propose a graphical technique for assessing the goodness-of-fit of a stationary hidden Markov model (HMM). We show that plots of the estimated distribution against the empirical distribution detect lack of fit with high probability for large sample sizes. By considering plots of the univariate and ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00189.x

    authors: MacKay Altman R

    更新日期:2004-06-01 00:00:00

  • A new method to explore the distribution of interindividual random effects in non-linear mixed effects models.

    abstract::This article presents a new approach for exploring the distribution of interindividual random effects in nonlinear mixed effect models. The approach introduces a spline function, which transforms an assumed normally distributed interindividual random effect to an arbitrary distribution approximating that of the data. ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Fattinger KE,Sheiner LB,Verotta D

    更新日期:1995-12-01 00:00:00

  • Capture-recapture when time and behavioral response affect capture probabilities.

    abstract::We consider a capture-recapture model in which capture probabilities vary with time and with behavioral response. Two inference procedures are developed under the assumption that recapture probabilities bear a constant relationship to initial capture probabilities. These two procedures are the maximum likelihood metho...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00427.x

    authors: Chao A,Chu W,Hsu CH

    更新日期:2000-06-01 00:00:00

  • Confidence intervals for the conditional probability of misallocation in discriminant analysis.

    abstract::In this study we are concerned with the construction of confidence intervals for the conditional probability of misallocation associated with Anderson's classification statistics, W. The available methods of computing confidence intervals for the conditional probability are not satisfactory in practice, mainly because...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: McLachlan GJ

    更新日期:1975-03-01 00:00:00

  • On pooling across strata when frequency matching has been followed in a cohort study.

    abstract::In a study designed to assess the relationship between a dichotomous exposure and the eventual occurrence of a dichotomous outcome, frequency matching has been proposed as a way to balance the exposure cohorts with respect to the sampling distribution of potential confounding factors. This paper discusses the pooled e...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Weinberg CR

    更新日期:1985-03-01 00:00:00

  • Assessing reproducibility by the within-subject coefficient of variation with random effects models.

    abstract::In this paper we consider the use of within-subject coefficient of variation (WCV) for assessing the reproducibility or reliability of a measurement. Application to assessing reproducibility of biochemical markers for measuring bone turnover is described and the comparison with intraclass correlation is discussed. Bot...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Quan H,Shih WJ

    更新日期:1996-12-01 00:00:00

  • Bayesian estimation of the probability of asbestos exposure from lung fiber counts.

    abstract::Asbestos exposure is a well-known risk factor for various lung diseases, and when they occur, workmen's compensation boards need to make decisions concerning the probability the cause is work related. In the absence of a definitive work history, measures of short and long asbestos fibers as well as counts of asbestos ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01279.x

    authors: Weichenthal S,Joseph L,Bélisle P,Dufresne A

    更新日期:2010-06-01 00:00:00

  • Bias in estimating association parameters for longitudinal binary responses with drop-outs.

    abstract::This paper considers the impact of bias in the estimation of the association parameters for longitudinal binary responses when there are drop-outs. A number of different estimating equation approaches are considered for the case where drop-out cannot be assumed to be a completely random process. In particular, standar...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00015.x

    authors: Fitzmaurice GM,Lipsitz SR,Molenberghs G,Ibrahim JG

    更新日期:2001-03-01 00:00:00

  • Connecting the latent multinomial.

    abstract::Link et al. (2010, Biometrics 66, 178-185) define a general framework for analyzing capture-recapture data with potential misidentifications. In this framework, the observed vector of counts, y, is considered as a linear function of a vector of latent counts, x, such that y=Ax, with x assumed to follow a multinomial d...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12333

    authors: Schofield MR,Bonner SJ

    更新日期:2015-12-01 00:00:00

  • Sensitivity analysis: distributional assumptions and confounding assumptions.

    abstract::In a presentation of various methods for assessing the sensitivity of regression results to unmeasured confounding, Lin, Psaty, and Kronmal (1998, Biometrics54, 948-963) use a conditional independence assumption to derive algebraic relationships between the true exposure effect and the apparent exposure effect in a re...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01024.x

    authors: Vanderweele TJ

    更新日期:2008-06-01 00:00:00

  • Sparse generalized eigenvalue problem with application to canonical correlation analysis for integrative analysis of methylation and gene expression data.

    abstract::We present a method for individual and integrative analysis of high dimension, low sample size data that capitalizes on the recurring theme in multivariate analysis of projecting higher dimensional data onto a few meaningful directions that are solutions to a generalized eigenvalue problem. We propose a general framew...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12886

    authors: Safo SE,Ahn J,Jeon Y,Jung S

    更新日期:2018-12-01 00:00:00

  • Stability of population growth determined by 2 X 2 Leslie matrix with density-dependent elements.

    abstract::The matrix considered contains four elements, each a function of total number. A special case for which the matrix may be appropriate is when the population may be divided into juveniles and adults, and the survival rates and fecundity are the same for all members of each group. This is true, at least approximatley, f...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Cooke D,Leon JA

    更新日期:1976-06-01 00:00:00