Abstract:
:Testing for association between two random vectors is a common and important task in many fields, however, existing tests, such as Escoufier's RV test, are suitable only for low-dimensional data, not for high-dimensional data. In moderate to high dimensions, it is necessary to consider sparse signals, which are often expected with only a few, but not many, variables associated with each other. We generalize the RV test to moderate-to-high dimensions. The key idea is to data adaptively weight each variable pair based on its empirical association. As the consequence, the proposed test is adaptive, alleviating the effects of noise accumulation in high-dimensional data, and thus maintaining the power for both dense and sparse alternative hypotheses. We show the connections between the proposed test with several existing tests, such as a generalized estimating equations-based adaptive test, multivariate kernel machine regression (KMR), and kernel distance methods. Furthermore, we modify the proposed adaptive test so that it can be powerful for nonlinear or nonmonotonic associations. We use both real data and simulated data to demonstrate the advantages and usefulness of the proposed new test. The new test is freely available in R package aSPC on CRAN at https://cran.r-project.org/web/packages/aSPC/index.html and https://github.com/jasonzyx/aSPC.
journal_name
Genet Epidemioljournal_title
Genetic epidemiologyauthors
Xu Z,Xu G,Pan W,Alzheimer's Disease Neuroimaging Initiative.doi
10.1002/gepi.22059subject
Has Abstractpub_date
2017-11-01 00:00:00pages
599-609issue
7eissn
0741-0395issn
1098-2272journal_volume
41pub_type
杂志文章abstract::Path analysis of nuclear family data has been widely applied to resolve genetic and environmental sources of familial resemblance. Here we report the results of a systematic evaluation of the effects of departures from five modeling assumptions often made when analyzing nuclear family data; i) the observed environment...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370060207
更新日期:1989-01-01 00:00:00
abstract::In genetic association studies, a single marker is often associated with multiple, correlated phenotypes (e.g., obesity and cardiovascular disease, or nicotine dependence and lung cancer). A pervasive question is then whether that marker exerts independent effects on all phenotypes. In this paper, we address this ques...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21660
更新日期:2012-09-01 00:00:00
abstract::Our group studied the effects of genotyping errors, pedigree errors, and missing data on a wide range of techniques, with a focus on the role of single-nucleotide polymorphisms (SNPs). Half of our group used simulated data, and half of our group used data from the Collaborative Study on the Genetics of Alcoholism (COG...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20120
更新日期:2005-01-01 00:00:00
abstract::As part of Genetic Analysis Workshop 17 (GAW17), our group considered the application of novel and standard approaches to the analysis of genotype-phenotype association in next-generation sequencing data. Our group identified a major issue in the analysis of the GAW17 next-generation sequencing data: type I error and ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20650
更新日期:2011-01-01 00:00:00
abstract::Using a recently developed semiparametric method for combined linkage/linkage-disequilibrium analysis, we analyzed the Collaborative Study on the Genetics of Alcoholism data subset developed for Genetic Analysis Workshop 11 (GAW11). This semiparametric approach estimates recombination fractions for linkage, marker log...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370170708
更新日期:1999-01-01 00:00:00
abstract::Meta-analysis has been little explored to make an overall assessment of linkage from different studies. In practice, it is likely that published linkage studies will only report p-values. We compared the performance of the widely used Fisher method for combining p-values with that of pooling raw data. More loci were c...
journal_title:Genetic epidemiology
pub_type: 杂志文章,meta分析
doi:10.1002/gepi.1370170798
更新日期:1999-01-01 00:00:00
abstract::Jewish women have been reported to have a higher risk for familial breast cancer than non-Jewish women and to be more likely to carry mutations in breast cancer genes such as BRCA1. Because BRCA1 mutations also increase women's risk for ovarian cancer, we asked whether Jewish women are at higher risk for familial ovar...
journal_title:Genetic epidemiology
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/(SICI)1098-2272(1998)15:1<51::AID-GEPI4>3.
更新日期:1998-01-01 00:00:00
abstract::The availability of high-density haplotype data has motivated several fine-scale linkage disequilibrium mapping methods for locating disease-causing mutations. These methods identify loci around which haplotypes of case chromosomes exhibit greater similarity than do those of control chromosomes. A difficulty arising i...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20016
更新日期:2004-11-01 00:00:00
abstract::In spite of the success of genome-wide association studies in finding many common variants associated with disease, these variants seem to explain only a small proportion of the estimated heritability. Data collection has turned toward exome and whole genome sequencing, but it is well known that single marker methods ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21746
更新日期:2013-09-01 00:00:00
abstract::Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consisting of many thousan...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20482
更新日期:2010-05-01 00:00:00
abstract::Clinical trial results have recently demonstrated that inhibiting inflammation by targeting the interleukin-1β pathway can offer a significant reduction in lung cancer incidence and mortality, highlighting a pressing and unmet need to understand the benefits of inflammation-focused lung cancer therapies at the genetic...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22358
更新日期:2021-02-01 00:00:00
abstract::Kin-cohort design can be used to study the effect of a genetic mutation on the risk of multiple events, using the same study. In this design, the outcome data consist of the event history of the relatives of a sample of genotyped subjects. Existing methods for kin-cohort estimation allow estimation of the risk of one ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.10269
更新日期:2003-12-01 00:00:00
abstract::Contributions to Group 17 of the Genetic Analysis Workshop 15 considered dense markers in linkage disequilibrium (LD) in the context of either linkage or association analysis. Three contributions reported on methods for modeling LD or selecting a subset of markers in linkage equilibrium to perform linkage analysis. Wh...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20291
更新日期:2007-01-01 00:00:00
abstract::Multilocus linkage disequilibrium (LD) tests that consider inter-marker (LD) are more powerful than single-locus tests when disease etiology is contributed simultaneously by several linked and correlated loci. However, inclusion of redundant non-informative markers may result in reduced testing power and/or inflated f...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20165
更新日期:2006-09-01 00:00:00
abstract::Variable selection is growing in importance with the advent of high throughput genotyping methods requiring analysis of hundreds to thousands of single nucleotide polymorphisms (SNPs) and the increased interest in using these genetic studies to better understand common, complex diseases. Up to now, the standard approa...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20353
更新日期:2009-01-01 00:00:00
abstract::For most complex diseases, the fraction of heritability that can be explained by the variants discovered from genome-wide association studies is minor. Although the so-called "rare variants" (minor allele frequency [MAF] < 1%) have attracted increasing attention, they are unlikely to account for much of the "missing h...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21740
更新日期:2013-09-01 00:00:00
abstract::Genome-wide association studies of discrete traits generally use simple methods of analysis based on chi(2) tests for contingency tables or logistic regression, at least for an initial scan of the entire genome. Nevertheless, more power might be obtained by using various methods that analyze multiple markers in combin...
journal_title:Genetic epidemiology
pub_type:
doi:10.1002/gepi.20465
更新日期:2009-01-01 00:00:00
abstract::Variance component linkage analysis is commonly used to map quantitative trait loci (QTLs) in general pedigrees. Large pedigrees are especially attractive for these studies because they provide greater power per genotyped individual than small pedigrees. We propose accurate and computationally efficient methods to cal...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20160
更新日期:2006-09-01 00:00:00
abstract::A computer-simulation method is presented for determining and correcting for the effect of maximizing the lod score over disease definitions, penetrance values, and perhaps other model parameters. The method consists of simulating the complete analysis using marker genotypes randomly generated under the assumption of ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370070402
更新日期:1990-01-01 00:00:00
abstract::Apolipoprotein A-IV (APO A-IV) is a major protein component of mesenteric lymph chylomicrons and very-low-density lipoproteins. It is found in plasma predominantly unassociated with major lipoprotein fractions and in high density lipoproteins. APO A-IV exhibits structural heterogeneity owing to two codominant alleles,...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370060404
更新日期:1989-01-01 00:00:00
abstract::Smalley et al. [(1992) Genet Epidemiol 9:333-345] found evidence of a mixture of two distributions in memory performance among offspring of patients with dementia of the Alzheimer type (DAT), suggesting that these groups reflect genotypic subgroups of carriers and non-carriers of a putative DAT gene. One prediction of...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370110506
更新日期:1994-01-01 00:00:00
abstract::Investigators interested in whether a disease aggregates in families often collect case-control family data, which consist of disease status and covariate information for members of families selected via case or control probands. Here, we focus on the use of case-control family data to investigate the relative contrib...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20454
更新日期:2010-04-01 00:00:00
abstract::Artificial neural networks were applied to the alcoholism data to reveal nonlinear relationships between intermediate phenotypes, marker identity-by-descent sharing, and the affection status. A variable number of hidden units were considered to achieve a balance between the minimal mean-squared error and over-fitting ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370170738
更新日期:1999-01-01 00:00:00
abstract::We used a case-control design to scan the genome for any associations between genetic markers and disease susceptibility loci using the first two replicates of the Mycenaean population from the GAW11 (Problem 2) data. Using a case-control approach, we constructed a series of 2-by-3 tables for each allele of every mark...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.13701707128
更新日期:1999-01-01 00:00:00
abstract::Our goal was to detect genes contributing to the P300 component of the event related potential (ERP). We found that all of the ERP traits were highly correlated. Most of them distinguished alcoholics from nonalcoholics. To have one summary variable for the ERP traits, we calculated the first principal component (PRIN1...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370170728
更新日期:1999-01-01 00:00:00
abstract::A genetic epidemiologic investigation of breast cancer involving 389 breast cancer pedigrees including information on 14,721 individuals from the Icelandic population-based cancer registry is presented. Probands were women born in or after 1920 and reported to have breast cancer in the cancer registry. The average age...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(200001)18:1<81::AID-GEPI6>
更新日期:2000-01-01 00:00:00
abstract::The restricted partition method (RPM) is a partitioning algorithm for examining multi-locus genotypes as (potentially non-additive) predictors of a quantitative trait. The motivating application was to develop a robust method to examine quantitative phenotypes for epistasis (gene-gene interactions), but the method can...
journal_title:Genetic epidemiology
pub_type: 杂志文章,评审
doi:10.1002/gepi.20006
更新日期:2004-09-01 00:00:00
abstract::With new technologies, multiple types of genomic data are commonly collected on a single set of samples. However, standard analysis methods concentrate on a single data type at a time and ignore the relationships between genes, proteins, and biochemical reactions that give rise to complex phenotypes. In this paper, we...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21628
更新日期:2012-05-01 00:00:00
abstract::Genetic epidemiology is a relatively new discipline that seeks to unravel the role of genetic factors and their interactions with environmental factors in the etiology of diseases, using population and family study approaches. To characterize the overall direction and emphasis of research strategies used in this field...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370100505
更新日期:1993-01-01 00:00:00
abstract::Genetic association studies of obstetric complications may genotype case and control mothers, or their respective newborns, or both case-control mothers and their children. The relatively high prevalence of many obstetric complications and the availability of both maternal and offspring's genotype data have provided m...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20406
更新日期:2009-09-01 00:00:00