Multilevel functional clustering analysis.

Abstract:

:In this article, we investigate clustering methods for multilevel functional data, which consist of repeated random functions observed for a large number of units (e.g., genes) at multiple subunits (e.g., bacteria types). To describe the within- and between variability induced by the hierarchical structure in the data, we take a multilevel functional principal component analysis (MFPCA) approach. We develop and compare a hard clustering method applied to the scores derived from the MFPCA and a soft clustering method using an MFPCA decomposition. In a simulation study, we assess the estimation accuracy of the clustering membership and the cluster patterns under a series of settings: small versus moderate number of time points; various noise levels; and varying number of subunits per unit. We demonstrate the applicability of the clustering analysis to a real data set consisting of expression profiles from genes activated by immunity system cells. Prevalent response patterns are identified by clustering the expression profiles using our multilevel clustering analysis.

journal_name

Biometrics

journal_title

Biometrics

authors

Serban N,Jiang H

doi

10.1111/j.1541-0420.2011.01714.x

subject

Has Abstract

pub_date

2012-09-01 00:00:00

pages

805-14

issue

3

eissn

0006-341X

issn

1541-0420

journal_volume

68

pub_type

杂志文章
  • Regression analysis of multivariate grouped survival data.

    abstract::Multivariate failure time data arise when each study subject may experience several types of event or when there are clusterings of observational units such that failure times within the same cluster are correlated. The failure times are often subject to interval grouping or have truly discrete measurements. In this p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Guo SW,Lin DY

    更新日期:1994-09-01 00:00:00

  • Extension of the rank sum test for clustered data: two-group comparisons with group membership defined at the subunit level.

    abstract::The Wilcoxon rank sum test is widely used for two-group comparisons for nonnormal data. An assumption of this test is independence of sampling units both between and within groups. In ophthalmology, data are often collected on two eyes of an individual, which are highly correlated. In ophthalmological clinical trials,...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00582.x

    authors: Rosner B,Glynn RJ,Lee ML

    更新日期:2006-12-01 00:00:00

  • Capture-recapture estimation using finite mixtures of arbitrary dimension.

    abstract::Reversible jump Markov chain Monte Carlo (RJMCMC) methods are used to fit Bayesian capture-recapture models incorporating heterogeneity in individuals and samples. Heterogeneity in capture probabilities comes from finite mixtures and/or fixed sample effects allowing for interactions. Estimation by RJMCMC allows automa...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01289.x

    authors: Arnold R,Hayakawa Y,Yip P

    更新日期:2010-06-01 00:00:00

  • Modeling of time trends and interactions in vital rates using restricted regression splines.

    abstract::For the analysis of time trends in incidence and mortality rates, the age-period-cohort (apc) model has became a widely accepted method. The considered data are arranged in a two-way table by age group and calendar period, which are mostly subdivided into 5- or 10-year intervals. The disadvantage of this approach is t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Heuer C

    更新日期:1997-03-01 00:00:00

  • A test of homogeneity of distributions when observations are subject to measurement errors.

    abstract::When the observed data are contaminated with errors, the standard two-sample testing approaches that ignore measurement errors may produce misleading results, including a higher type-I error rate than the nominal level. To tackle this inconsistency, a nonparametric test is proposed for testing equality of two distribu...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13207

    authors: Lee D,Lahiri SN,Sinha S

    更新日期:2020-09-01 00:00:00

  • Semiparametric maximum likelihood for nonlinear regression with measurement errors.

    abstract::This article demonstrates semiparametric maximum likelihood estimation of a nonlinear growth model for fish lengths using imprecisely measured ages. Data on the species corvina reina, found in the Gulf of Nicoya, Costa Rica, consist of lengths and imprecise ages for 168 fish and precise ages for a subset of 16 fish. T...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00448.x

    authors: Suh EY,Schafer DW

    更新日期:2002-06-01 00:00:00

  • Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.

    abstract::We consider a semiparametric regression model that relates a normal outcome to covariates and a genetic pathway, where the covariate effects are modeled parametrically and the pathway effect of multiple gene expressions is modeled parametrically or nonparametrically using least-squares kernel machines (LSKMs). This un...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00799.x

    authors: Liu D,Lin X,Ghosh D

    更新日期:2007-12-01 00:00:00

  • Sparse generalized eigenvalue problem with application to canonical correlation analysis for integrative analysis of methylation and gene expression data.

    abstract::We present a method for individual and integrative analysis of high dimension, low sample size data that capitalizes on the recurring theme in multivariate analysis of projecting higher dimensional data onto a few meaningful directions that are solutions to a generalized eigenvalue problem. We propose a general framew...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12886

    authors: Safo SE,Ahn J,Jeon Y,Jung S

    更新日期:2018-12-01 00:00:00

  • Models for circular-linear and circular-circular data constructed from circular distributions based on nonnegative trigonometric sums.

    abstract::Johnson and Wehrly (1978, Journal of the American Statistical Association 73, 602-606) and Wehrly and Johnson (1980, Biometrika 67, 255-256) show one way to construct the joint distribution of a circular and a linear random variable, or the joint distribution of a pair of circular random variables from their marginal ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00716.x

    authors: Fernández-Durán JJ

    更新日期:2007-06-01 00:00:00

  • Estimating the ventilation-perfusion distribution: an ill-posed integral equation problem.

    abstract::The distribution of ventilation-perfusion ratio over the lung is a useful indicator of the efficiency of lung function. Information about this distribution can be obtained by observing the retention in blood of inert gases passed through the lung. These retentions are related to the ventilation-perfusion distribution ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lim LL,Whitehead J

    更新日期:1992-03-01 00:00:00

  • Model selection and inference for censored lifetime medical expenditures.

    abstract::Identifying factors associated with increased medical cost is important for many micro- and macro-institutions, including the national economy and public health, insurers and the insured. However, assembling comprehensive national databases that include both the cost and individual-level predictors can prove challengi...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12464

    authors: Johnson BA,Long Q,Huang Y,Chansky K,Redman M

    更新日期:2016-09-01 00:00:00

  • Bayesian partitioning for modeling and mapping spatial case-control data.

    abstract::Methods for modeling and mapping spatial variation in disease risk continue to motivate much research. In particular, spatial analyses provide a useful tool for exploring geographical heterogeneity in health outcomes, and consequently can yield clues as to disease etiology, direct public health management, and generat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01193.x

    authors: Costain DA

    更新日期:2009-12-01 00:00:00

  • Buckley-James-type estimator with right-censored and length-biased data.

    abstract::We present a natural generalization of the Buckley-James-type estimator for traditional survival data to right-censored length-biased data under the accelerated failure time (AFT) model. Length-biased data are often encountered in prevalent cohort studies and cancer screening trials. Informative right censoring induce...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01568.x

    authors: Ning J,Qin J,Shen Y

    更新日期:2011-12-01 00:00:00

  • Calculating sample size for studies with expected all-or-none nonadherence and selection bias.

    abstract:SUMMARY:We develop sample size formulas for studies aiming to test mean differences between a treatment and control group when all-or-none nonadherence (noncompliance) and selection bias are expected. Recent work by Fay, Halloran, and Follmann (2007, Biometrics 63, 465-474) addressed the increased variances within grou...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01114.x

    authors: Shardell MD,El-Kamary SS

    更新日期:2009-06-01 00:00:00

  • Regression calibration in semiparametric accelerated failure time models.

    abstract::In large cohort studies, it often happens that some covariates are expensive to measure and hence only measured on a validation set. On the other hand, relatively cheap but error-prone measurements of the covariates are available for all subjects. Regression calibration (RC) estimation method (Prentice, 1982, Biometri...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01295.x

    authors: Yu M,Nan B

    更新日期:2010-06-01 00:00:00

  • A new method to explore the distribution of interindividual random effects in non-linear mixed effects models.

    abstract::This article presents a new approach for exploring the distribution of interindividual random effects in nonlinear mixed effect models. The approach introduces a spline function, which transforms an assumed normally distributed interindividual random effect to an arbitrary distribution approximating that of the data. ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Fattinger KE,Sheiner LB,Verotta D

    更新日期:1995-12-01 00:00:00

  • An empirical Bayes' approach to joint analysis of multiple microarray gene expression studies.

    abstract::With the prevalence of gene expression studies and the relatively low reproducibility caused by insufficient sample sizes, it is natural to consider joint analysis that could combine data from different experiments effectively to achieve improved accuracy. We present in this article a model-based approach for better i...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01602.x

    authors: Ruan L,Yuan M

    更新日期:2011-12-01 00:00:00

  • Fast Bayesian inference in large Gaussian graphical models.

    abstract::Despite major methodological developments, Bayesian inference in Gaussian graphical models remains challenging in high dimension due to the tremendous size of the model space. This article proposes a method to infer the marginal and conditional independence structures between variables by multiple testing, which bypas...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13064

    authors: Leday GGR,Richardson S

    更新日期:2019-12-01 00:00:00

  • Nonparametric analysis of covariance by matching.

    abstract::The basic problem under consideration is the comparison of treatments with respect to a response Y when a covariable X is taken into account. Various methods involving matching may be regarded as compromises between the standard analysis of covariance and the standard analysis of independent matched pairs. First, ther...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Quade D

    更新日期:1982-09-01 00:00:00

  • Regional spatial modeling of topsoil geochemistry.

    abstract::Geographic information about the levels of toxics in environmental media is commonly used in regional environmental health studies when direct measurements of personal exposure is limited or unavailable. In this article, we propose a statistical framework for analyzing the spatial distribution of topsoil geochemical p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01041.x

    authors: Calder CA,Craigmile PF,Zhang J

    更新日期:2009-03-01 00:00:00

  • Doubly robust estimator for net survival rate in analyses of cancer registry data.

    abstract::Cancer population studies based on cancer registry databases are widely conducted to address various research questions. In general, cancer registry databases do not collect information on cause of death. The net survival rate is defined as the survival rate if a subject would not die for any causes other than cancer....

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12568

    authors: Komukai S,Hattori S

    更新日期:2017-03-01 00:00:00

  • Optimal Bayesian design for patient selection in a clinical study.

    abstract::Bayesian experimental design for a clinical trial involves specifying a utility function that models the purpose of the trial, in this case the selection of patients for a diagnostic test. The best sample of patients is selected by maximizing expected utility. This optimization task poses difficulties due to a high-di...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01156.x

    authors: Buzoianu M,Kadane JB

    更新日期:2009-09-01 00:00:00

  • Case-control analysis with partial knowledge of exposure misclassification probabilities.

    abstract::Consider case control analysis with a dichotomous exposure variable that is subject to misclassification. If the classification probabilities are known, then methods are available to adjust odds-ratio estimates in light of the misclassification. We study the realistic scenario where reasonable guesses, but not exact v...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00598.x

    authors: Gustafson P,Le ND,Saskin R

    更新日期:2001-06-01 00:00:00

  • Bayesian approaches to joint cure-rate and longitudinal models with applications to cancer vaccine trials.

    abstract::Complex issues arise when investigating the association between longitudinal immunologic measures and time to an event, such as time to relapse, in cancer vaccine trials. Unlike many clinical trials, we may encounter patients who are cured and no longer susceptible to the time-to-event endpoint. If there are cured pat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/1541-0420.00079

    authors: Brown ER,Ibrahim JG

    更新日期:2003-09-01 00:00:00

  • On identifiability in capture-recapture models.

    abstract::We study the issue of identifiability of mixture models in the context of capture-recapture abundance estimation for closed populations. Such models are used to take account of individual heterogeneity in capture probabilities, but their validity was recently questioned by Link (2003, Biometrics 59, 1123-1130) on the ...

    journal_title:Biometrics

    pub_type: 评论,杂志文章

    doi:10.1111/j.1541-0420.2006.00637_1.x

    authors: Holzmann H,Munk A,Zucchini W

    更新日期:2006-09-01 00:00:00

  • An improved method for measuring heart-rate variability: assessment of cardiac autonomic function.

    abstract::Heart rate oscillates in synchrony with respiration. Several methods have been employed to assess this 'sinus arrhythmia', as an index of autonomic nervous system function. This paper proposes a new, easily computed measure, R, which is relatively resistant to the major nonrespiratory sources of variation, including p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Weinberg CR,Pfeifer MA

    更新日期:1984-09-01 00:00:00

  • Bayesian survival analysis using a MARS model.

    abstract::A Bayesian multivariate adaptive regression spline fitting approach is used to model univariate and multivariate survival data with censoring. The possible models contain the proportional hazards model as a subclass and automatically detect departures from this. A reversible jump Markov chain Monte Carlo algorithm is ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.01071.x

    authors: Mallick BK,Denison DG,Smith AF

    更新日期:1999-12-01 00:00:00

  • Markov models for covariate dependence of binary sequences.

    abstract::Suppose that a heterogeneous group of individuals is followed over time and that each individual can be in state 0 or state 1 at each time point. The sequence of states is assumed to follow a binary Markov chain. In this paper we model the transition probabilities for the 0 to 0 and 1 to 0 transitions by two logistic ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Muenz LR,Rubinstein LV

    更新日期:1985-03-01 00:00:00

  • Type I error robustness of ANOVA and ANOVA on ranks when the number of treatments is large.

    abstract::Agricultural screening trials often involve a large number (t) of treatments in a complete block design with limited replication (b = 3 or 4 blocks). The null hypothesis of interest is that of no differences between treatments. For the commonly used analysis of variance (ANOVA) procedure, most texts do not discuss agr...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Brownie C,Boos DD

    更新日期:1994-06-01 00:00:00

  • Variable selection in canonical discriminant analysis for family studies.

    abstract::In family studies, canonical discriminant analysis can be used to find linear combinations of phenotypes that exhibit high ratios of between-family to within-family variabilities. But with large numbers of phenotypes, canonical discriminant analysis may overfit. To estimate the predicted ratios associated with the coe...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01414.x

    authors: Jin M,Fang Y

    更新日期:2011-03-01 00:00:00