Abstract:
:Identifying disease-associated changes in DNA methylation can help us gain a better understanding of disease etiology. Bisulfite sequencing allows the generation of high-throughput methylation profiles at single-base resolution of DNA. However, optimally modeling and analyzing these sparse and discrete sequencing data is still very challenging due to variable read depth, missing data patterns, long-range correlations, data errors, and confounding from cell type mixtures. We propose a regression-based hierarchical model that allows covariate effects to vary smoothly along genomic positions and we have built a specialized EM algorithm, which explicitly allows for experimental errors and cell type mixtures, to make inference about smooth covariate effects in the model. Simulations show that the proposed method provides accurate estimates of covariate effects and captures the major underlying methylation patterns with excellent power. We also apply our method to analyze data from rheumatoid arthritis patients and controls. The method has been implemented in R package SOMNiBUS.
journal_name
Biometricsjournal_title
Biometricsauthors
Zhao K,Oualkacha K,Lakhal-Chaieb L,Labbe A,Klein K,Ciampi A,Hudson M,Colmegna I,Pastinen T,Zhang T,Daley D,Greenwood CMTdoi
10.1111/biom.13307subject
Has Abstractpub_date
2020-05-21 00:00:00eissn
0006-341Xissn
1541-0420pub_type
杂志文章相关文献
BIOMETRICS文献大全abstract::We investigate two-stage parametric and two-stage semi-parametric estimation procedures for the association parameter in copula models for bivariate survival data where censoring in either or both components is allowed. We derive asymptotic properties of the estimators and compare their performance by simulations. Bot...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1995-12-01 00:00:00
abstract::This note discusses the use of capture-mark-recapture methods and log-linear, linear, product models for incomplete tables (Espeland, 1986, Communications in Statistics--Simulation and Computing 15, 405-424) to estimate completeness of reporting in a disease registry and to estimate incompleteness-adjusted incidence r...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1992-12-01 00:00:00
abstract::Prentice and Sheppard (1995, Biometrika 82, 113-125) proposed a method for estimating relative risks associated with poorly measured exposures using disease rates from multiple populations and exposure and confounding factor data from sample surveys of persons in each population. The method involved an assumption of i...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1998-12-01 00:00:00
abstract::Detecting changes in longitudinal data is important in medical research. However, the existence of measurement outliers can cause an unexpected increase in the false alarm rate in claiming changes. To reduce the outliers, a new method has been developed. In this scheme, two measures are initially taken and, if they ar...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1994-03-01 00:00:00
abstract::Three properties of interest in bioavailability studies using compartmental models are the area under the concentration curve, the maximum concentration, and the time to maximum concentration. Methods are described for finding designs that minimize the variance of the estimates of these quantities in such a model. The...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1993-06-01 00:00:00
abstract::It is not, in general, possible to include all relevant risk factors in a model of survival or disease incidence. This heterogeneity must be accounted for in the interpretation, as it can imply otherwise unexpected results. This is illustrated by diabetic nephropathy, a serious complication experienced by some diabeti...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1994-12-01 00:00:00
abstract::For patients on dialysis, hospitalizations remain a major risk factor for mortality and morbidity. We use data from a large national database, United States Renal Data System, to model time-varying effects of hospitalization risk factors as functions of time since initiation of dialysis. To account for the three-level...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13205
更新日期:2020-09-01 00:00:00
abstract::In spatial surveys for estimating the density of objects in a survey region, systematic designs will generally yield lower variance than random designs. However, estimating the systematic variance is well known to be a difficult problem. Existing methods tend to overestimate the variance, so although the variance is g...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2011.01604.x
更新日期:2011-12-01 00:00:00
abstract::Consider case control analysis with a dichotomous exposure variable that is subject to misclassification. If the classification probabilities are known, then methods are available to adjust odds-ratio estimates in light of the misclassification. We study the realistic scenario where reasonable guesses, but not exact v...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.00598.x
更新日期:2001-06-01 00:00:00
abstract::I discuss diagnostic methods for discriminant analysis. The equivalence with linear regression is noted and regression diagnostics are considered. The leverage is a function of the linear discriminant function and the Mahalanobis distance of the observation from the group mean. The distribution of this distance is app...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1997-12-01 00:00:00
abstract::We consider the problem of comparing two outcome measures when the pairs are clustered. Using the general principle of within-cluster resampling, we obtain a novel signed-rank test for clustered paired data. We show by a simple informative cluster size simulation model that only our test maintains the correct size und...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2007.00923.x
更新日期:2008-06-01 00:00:00
abstract::An estimator proposed by Greenland and Holland (1991, Biometrics 47, 319-322) for a standardized risk difference parameter is shown to be a maximum likelihood estimator if the consistent estimator of the common odds ratio is appropriately chosen. The statistical problem under consideration is reparameterized. Likeliho...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1992-09-01 00:00:00
abstract::We propose methods for Bayesian inference for a new class of semiparametric survival models with a cure fraction. Specifically, we propose a semiparametric cure rate model with a smoothing parameter that controls the degree of parametricity in the right tail of the survival distribution. We show that such a parameter ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.00383.x
更新日期:2001-06-01 00:00:00
abstract::A stochastic model is presented for the analysis of incomplete repeated-measures experiments. The general linear model is used to relate the response measures to other variables which are thought to account for inherent variation; an autoregressive moving average (ARMA) time series representation is used to model dist...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1989-03-01 00:00:00
abstract::Geographic information about the levels of toxics in environmental media is commonly used in regional environmental health studies when direct measurements of personal exposure is limited or unavailable. In this article, we propose a statistical framework for analyzing the spatial distribution of topsoil geochemical p...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2008.01041.x
更新日期:2009-03-01 00:00:00
abstract::The first part of the article reviews the Data Augmentation algorithm and presents two approximations to the Data Augmentation algorithm for the analysis of missing-data problems: the Poor Man's Data Augmentation algorithm and the Asymptotic Data Augmentation algorithm. These two algorithms are then implemented in the...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1991-12-01 00:00:00
abstract::In many medical experiments, data are collected across time, over a number of similar trials, or over a number of experimental units. As is the case of neuron spike train studies, these data may be in the form of counts of events per unit of time. These counts may be correlated within each trial. It is often of intere...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1998-03-01 00:00:00
abstract::In a period starting around 2007, the Hand, Foot, and Mouth Disease (HFMD) became wide-spreading in China, and the Chinese public health was seriously threatened. To prevent the outbreak of infectious diseases like HFMD, effective disease surveillance systems would be especially helpful to give signals of disease outb...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12301
更新日期:2015-09-01 00:00:00
abstract::There is a simple robust variance estimator for cluster-correlated data. While this estimator is well known, it is poorly documented, and its wide range of applicability is often not understood. The estimator is widely used in sample survey research, but the results in the sample survey literature are not easily appli...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00645.x
更新日期:2000-06-01 00:00:00
abstract::Missing values appear very often in many applications, but the problem of missing values has not received much attention in testing order-restricted alternatives. Under the missing at random (MAR) assumption, we impute the missing values nonparametrically using kernel regression. For data with imputation, the classica...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12658
更新日期:2017-09-01 00:00:00
abstract::We describe an adaptive Bayesian design for a clinical trial of an experimental treatment for patients with hematologic malignancies who initially received an allogeneic bone marrow transplant but subsequently suffered a disease recurrence. Treatment consists of up to two courses of targeted immunotherapy followed by ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2002.00560.x
更新日期:2002-09-01 00:00:00
abstract::This article presents a new approach for exploring the distribution of interindividual random effects in nonlinear mixed effect models. The approach introduces a spline function, which transforms an assumed normally distributed interindividual random effect to an arbitrary distribution approximating that of the data. ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1995-12-01 00:00:00
abstract::We study publication bias in meta-analysis by supposing there is a population (y, sigma) of studies which give treatment effect estimates y approximately N(theta, sigma(2)). A selection function describes the probability that each study is selected for review. The overall estimate of theta depends on the studies selec...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00705.x
更新日期:2007-06-01 00:00:00
abstract::We introduce two test procedures for comparing two survival distributions on the basis of randomly right-censored data consisting of both paired and unpaired observations. Our procedures are based on generalizations of a pooled rank test statistic previously proposed for uncensored data. One generalization adapts the ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00154.x
更新日期:2000-03-01 00:00:00
abstract::A considerable body of literature has arisen over the past 15 years for analyzing univariate repeated measures data. However, it is rare in applied biomedical research for interest to be restricted to a single outcome measure. In this paper, we consider the case of bivariate repeated measures. We apply a generalized e...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1996-06-01 00:00:00
abstract::The fitting of finite mixture models via the EM algorithm is considered for data which are available only in grouped form and which may also be truncated. A practical example is presented where a mixture of two doubly truncated log-normal distributions is adopted to model the distribution of the volume of red blood ce...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1988-06-01 00:00:00
abstract::In longitudinal observational studies, repeated measures are often taken at informative observation times. Also, there may exist a dependent terminal event such as death that stops the follow-up. For example, patients in poorer health are more likely to seek medical treatment and their medical cost for each visit tend...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2007.00954.x
更新日期:2008-09-01 00:00:00
abstract::In this article, we describe a Bayesian approach to the calibration of a stochastic computer model of chemical kinetics. As with many applications in the biological sciences, the data available to calibrate the model come from different sources. Furthermore, these data appear to provide somewhat conflicting informatio...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2009.01245.x
更新日期:2010-03-01 00:00:00
abstract::A new test using incidence data is developed for testing whether two or more groups have the same seasonal pattern. The method fits sine waves to the data with a fundamental period of one cycle per year, and has the possibility of using higher harmonics, when necessary, to adequately model the data. The seasonal patte...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1988-12-01 00:00:00
abstract::The use of the uniformly most powerful among the unbiased (UMPU) test was recently suggested for the study of gametic association between two polymorphic loci as an alternative to the Fisher's exact test (Zapata and Alvarez, 1997, Annals of Human Genetics 61, 71-77). However, the proposed test is not UMPU for two-side...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.00535.x
更新日期:2001-06-01 00:00:00