The challenge for genetic epidemiologists: how to analyze large numbers of SNPs in relation to complex diseases.

Abstract:

:Genetic epidemiologists have taken the challenge to identify genetic polymorphisms involved in the development of diseases. Many have collected data on large numbers of genetic markers but are not familiar with available methods to assess their association with complex diseases. Statistical methods have been developed for analyzing the relation between large numbers of genetic and environmental predictors to disease or disease-related variables in genetic association studies. In this commentary we discuss logistic regression analysis, neural networks, including the parameter decreasing method (PDM) and genetic programming optimized neural networks (GPNN) and several non-parametric methods, which include the set association approach, combinatorial partitioning method (CPM), restricted partitioning method (RPM), multifactor dimensionality reduction (MDR) method and the random forests approach. The relative strengths and weaknesses of these methods are highlighted. Logistic regression and neural networks can handle only a limited number of predictor variables, depending on the number of observations in the dataset. Therefore, they are less useful than the non-parametric methods to approach association studies with large numbers of predictor variables. GPNN on the other hand may be a useful approach to select and model important predictors, but its performance to select the important effects in the presence of large numbers of predictors needs to be examined. Both the set association approach and random forests approach are able to handle a large number of predictors and are useful in reducing these predictors to a subset of predictors with an important contribution to disease. The combinatorial methods give more insight in combination patterns for sets of genetic and/or environmental predictor variables that may be related to the outcome variable. As the non-parametric methods have different strengths and weaknesses we conclude that to approach genetic association studies using the case-control design, the application of a combination of several methods, including the set association approach, MDR and the random forests approach, will likely be a useful strategy to find the important genes and interaction patterns involved in complex diseases.

journal_name

BMC Genet

journal_title

BMC genetics

authors

Heidema AG,Boer JM,Nagelkerke N,Mariman EC,van der A DL,Feskens EJ

doi

10.1186/1471-2156-7-23

subject

Has Abstract

pub_date

2006-04-21 00:00:00

pages

23

issn

1471-2156

pii

1471-2156-7-23

journal_volume

7

pub_type

社论
  • A multi-marker test based on family data in genome-wide association study.

    abstract:BACKGROUND:Complex diseases are believed to be the results of many genes and environmental factors. Hence, multi-marker methods that can use the information of markers from different genes are appropriate for mapping complex disease genes. There already have been several multi-marker methods proposed for case-control s...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-8-65

    authors: Zhang Z,Zhang S,Sha Q

    更新日期:2007-09-25 00:00:00

  • The effect of rare alleles on estimated genomic relationships from whole genome sequence data.

    abstract:BACKGROUND:Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleoti...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0185-0

    authors: Eynard SE,Windig JJ,Leroy G,van Binsbergen R,Calus MP

    更新日期:2015-03-12 00:00:00

  • Resampling-based tests for Lasso in genome-wide association studies.

    abstract:BACKGROUND:Genome-wide association studies involve detecting association between millions of genetic variants and a trait, which typically use univariate regression to test association between each single variant and the phenotype. Alternatively, Lasso penalized regression allows one to jointly model the relationship b...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-017-0533-3

    authors: Arbet J,McGue M,Chatterjee S,Basu S

    更新日期:2017-07-24 00:00:00

  • Evidence for paternal DNA transmission to gynogenetic grass carp.

    abstract:BACKGROUND:Grass carp (Ctenopharyngodon idellus, GC), as the highest-output fish in China, is economically important. The production of gynogenetic grass carp (GGC) will provide important germplasm resource for producing improved GC. At present, knowledge regarding the heterologous sperm DNA in gynogenetic offspring is...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-018-0712-x

    authors: Mao Z,Fu Y,Wang Y,Wang S,Zhang M,Gao X,Luo K,Qin Q,Zhang C,Tao M,Yao Z,Liu S

    更新日期:2019-01-07 00:00:00

  • Conserved genomic organisation of Group B Sox genes in insects.

    abstract:BACKGROUND:Sox domain containing genes are important metazoan transcriptional regulators implicated in a wide rage of developmental processes. The vertebrate B subgroup contains the Sox1, Sox2 and Sox3 genes that have early functions in neural development. Previous studies show that Drosophila Group B genes have been f...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-6-26

    authors: McKimmie C,Woerfel G,Russell S

    更新日期:2005-05-19 00:00:00

  • qDTY12.1: a locus with a consistent effect on grain yield under drought in rice.

    abstract:BACKGROUND:Selection for grain yield under drought is an efficient criterion for improving the drought tolerance of rice. Recently, some drought-tolerant rice varieties have been developed using this selection criterion and successfully released for cultivation in drought-prone target environments. The process can be m...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-14-12

    authors: Mishra KK,Vikram P,Yadaw RB,Swamy BP,Dixit S,Cruz MT,Maturan P,Marker S,Kumar A

    更新日期:2013-02-26 00:00:00

  • Population study of 1311 C/T polymorphism of Glucose 6 Phosphate Dehydrogenase gene in Pakistan - an analysis of 715 X-chromosomes.

    abstract:BACKGROUND:Nucleotide 1311 polymorphism at exon 11 of G6PD gene is widely prevalent in various populations of the world. The aim of the study was to evaluate 1311 polymorphism in subjects carrying G6PD Mediterranean gene and in general population living in Pakistan. RESULTS:Patients already known to be G6PD deficient ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-10-41

    authors: Moiz B,Nasir A,Moatter T,Naqvi ZA,Khurshid M

    更新日期:2009-07-30 00:00:00

  • A copy number variation in human NCF1 and its pseudogenes.

    abstract:BACKGROUND:Neutrophil cytosolic factor-1 (NCF1) is a component of NADPH oxidase. The NCF1 gene colocalizes with two pseudogenes (NCF1B and NCF1C). These two pseudogenes have a GT deletion in exon 2, resulting in a frameshift and an early stop codon. Here, we report a copy number variation (CNV) of the NCF1 pseudogenes ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-11-13

    authors: Brunson T,Wang Q,Chambers I,Song Q

    更新日期:2010-02-23 00:00:00

  • Genomic and expression analysis of multiple Sry loci from a single Rattus norvegicus Y chromosome.

    abstract:BACKGROUND:Sry is a gene known to be essential for testis determination but is also transcribed in adult male tissues. The laboratory rat, Rattus norvegicus, has multiple Y chromosome copies of Sry while most mammals have only a single copy. DNA sequence comparisons with other rodents with multiple Sry copies are incon...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-8-11

    authors: Turner ME,Martin C,Martins AS,Dunmire J,Farkas J,Ely DL,Milsted A

    更新日期:2007-04-04 00:00:00

  • Characterization of two MHC II genes (DOB, DRB) in white-tailed deer (Odocoileus virginianus).

    abstract:BACKGROUND:The major histocompatibility complex (MHC) is responsible for detecting and addressing foreign pathogens inside the body. While the general structure of MHC genes is relatively well conserved among mammalian species, it is notably different among ruminants due to a chromosomal inversion that splits MHC type ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-020-00889-5

    authors: Ivy-Israel NMD,Moore CE,Schwartz TS,Ditchkoff SS

    更新日期:2020-07-29 00:00:00

  • Exome sequencing of a family with lone, autosomal dominant atrial flutter identifies a rare variation in ABCB4 significantly enriched in cases.

    abstract:BACKGROUND:Lone atrial flutter (AFL) and atrial fibrillation (AF) are common and sometimes consequential cardiac conduction disorders with a strong heritability, as underlined by recent genome-wide association studies that identified genetic modifiers. Follow-up family-based genetic analysis also identified Mendelian t...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0177-0

    authors: Maciąg A,Villa F,Ferrario A,Spinelli CC,Carrizzo A,Malovini A,Torella A,Montenero C,Parisi A,Condorelli G,Vecchione C,Nigro V,Montenero AS,Puca AA

    更新日期:2015-02-11 00:00:00

  • Cytoplasm affects grain weight and filled-grain ratio in indica rice.

    abstract:BACKGROUND:Cytoplasmic effects on agronomic traits--involving cytoplasmic and nuclear genomes of either different species or different cultivars--are well documented in wheat but have seldom been demonstrated in rice (Oryza sativa L.). To detect cytoplasmic effects, we introgressed the nuclear genomes of three indica c...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-12-53

    authors: Tao D,Xu P,Zhou J,Deng X,Li J,Deng W,Yang J,Yang G,Li Q,Hu F

    更新日期:2011-06-01 00:00:00

  • Interval estimation of disease loci: development and applications of new linkage methods.

    abstract::Three variants of the confidence set inference (CSI) procedure were proposed and applied to both the simulated and the Collaborative Study on the Genetics of Alcoholism (COGA) data. For each of the two applications, we first performed a preliminary genome scan study based on the microsatellite markers using the GENEHU...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-6-S1-S21

    authors: Papachristou C,Lin S

    更新日期:2005-12-30 00:00:00

  • Hierarchical modeling in association studies of multiple phenotypes.

    abstract::The genetic study of disease-associated phenotypes has become common because such phenotypes are often easier to measure and in many cases are under greater genetic control than the complex disease itself. Some disease-associated phenotypes are rare, however, making it difficult to evaluate their effects due to small ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-6-S1-S104

    authors: Liu X,Jorgenson E,Witte JS

    更新日期:2005-12-30 00:00:00

  • Meta-analysis of haplotype-association studies: comparison of methods and empirical evaluation of the literature.

    abstract:BACKGROUND:Meta-analysis is a popular methodology in several fields of medical research, including genetic association studies. However, the methods used for meta-analysis of association studies that report haplotypes have not been studied in detail. In this work, methods for performing meta-analysis of haplotype assoc...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-12-8

    authors: Bagos PG

    更新日期:2011-01-19 00:00:00

  • B chromosome in the beetle Coprophanaeus cyanescens (Scarabaeidae): emphasis in the organization of repetitive DNA sequences.

    abstract:BACKGROUND:To contribute to the knowledge of coleopteran cytogenetics, especially with respect to the genomic content of B chromosomes, we analyzed the composition and organization of repetitive DNA sequences in the Coprophanaeus cyanescens karyotype. We used conventional staining and the application of fluorescence in...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-13-96

    authors: Gomes de Oliveira S,Cassia de Moura R,Martins C

    更新日期:2012-11-06 00:00:00

  • Genome-wide association analyses for carcass quality in crossbred beef cattle.

    abstract:BACKGROUND:Genetic improvement of beef quality will benefit both producers and consumers, and can be achieved by selecting animals that carry desired quantitative trait nucleotides (QTN), which result from intensive searches using genetic markers. This paper presents a genome-wide association approach utilizing single ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-14-80

    authors: Lu D,Sargolzaei M,Kelly M,Vander Voort G,Wang Z,Mandell I,Moore S,Plastow G,Miller SP

    更新日期:2013-09-11 00:00:00

  • A power study of bivariate LOD score analysis of a complex trait and fear/discomfort with strangers.

    abstract::Complex diseases are often reported along with disease-related traits (DRT). Sometimes investigators consider both disease and DRT phenotypes separately and sometimes they consider individuals as affected if they have either the disease or the DRT, or both. We propose instead to consider the joint distribution of the ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-6-S1-S113

    authors: Ji F,Lee D,Mendell NR

    更新日期:2005-12-30 00:00:00

  • Molecular cytogenetic differentiation of paralogs of Hox paralogs in duplicated and re-diploidized genome of the North American paddlefish (Polyodon spathula).

    abstract:BACKGROUND:Acipenseriformes is a basal lineage of ray-finned fishes and comprise 27 extant species of sturgeons and paddlefishes. They are characterized by several specific genomic features as broad ploidy variation, high chromosome numbers, presence of numerous microchromosomes and propensity to interspecific hybridiz...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-017-0484-8

    authors: Symonová R,Havelka M,Amemiya CT,Howell WM,Kořínková T,Flajšhans M,Gela D,Ráb P

    更新日期:2017-03-02 00:00:00

  • The TCF7L2 rs7903146 polymorphism, dietary intakes and type 2 diabetes risk in an Algerian population.

    abstract:BACKGROUND:The transcription factor 7-like 2 (TCF7L2) gene is the most significant genetic risk factor for type 2 diabetes (T2D). Association analyses were performed on participants (n = 751, aged between 30 and 64) in the ISOR population-based study in the city of Oran. Dietary intakes were estimated using a weekly fo...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-014-0134-3

    authors: Ouhaibi-Djellouli H,Mediene-Benchekor S,Lardjam-Hetraf SA,Hamani-Medjaoui I,Meroufel DN,Boulenouar H,Hermant X,Saidi-Mehtar N,Amouyel P,Houti L,Goumidi L,Meirhaeghe A

    更新日期:2014-12-10 00:00:00

  • Comparing self-reported ethnicity to genetic background measures in the context of the Multi-Ethnic Study of Atherosclerosis (MESA).

    abstract:BACKGROUND:Questions remain regarding the utility of self-reported ethnicity (SRE) in genetic and epidemiologic research. It is not clear whether conditioning on SRE provides adequate protection from inflated type I error rates due to population stratification and admixture. We address this question using data obtained...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-12-28

    authors: Divers J,Redden DT,Rice KM,Vaughan LK,Padilla MA,Allison DB,Bluemke DA,Young HJ,Arnett DK

    更新日期:2011-03-04 00:00:00

  • Genetic diversity assessment of sesame core collection in China by phenotype and molecular markers and extraction of a mini-core collection.

    abstract:BACKGROUND:Sesame (Sesamum indicum L.) is one of the four major oil crops in China. A sesame core collection (CC) was established in China in 2000, but no complete study on its genetic diversity has been carried out at either the phenotypic or molecular level. To provide technical guidance, a theoretical basis for furt...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-13-102

    authors: Zhang Y,Zhang X,Che Z,Wang L,Wei W,Li D

    更新日期:2012-11-15 00:00:00

  • MCP1 haplotypes associated with protection from pulmonary tuberculosis.

    abstract:BACKGROUND:The monocyte chemoattractant protein 1 (MCP-1) is involved in the recruitment of lymphocytes and monocytes and their migration to sites of injury and cellular immune reactions. In a Ghanaian tuberculosis (TB) case-control study group, associations of the MCP1 -362C and the MCP1 -2581G alleles with resistance...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-12-34

    authors: Intemann CD,Thye T,Förster B,Owusu-Dabo E,Gyapong J,Horstmann RD,Meyer CG

    更新日期:2011-04-19 00:00:00

  • Multiple telophase arrest bypassed (tab) mutants alleviate the essential requirement for Cdc15 in exit from mitosis in S. cerevisiae.

    abstract:BACKGROUND:The Mitotic Exit Network (MEN) proteins - including the protein kinase Cdc15 and the protein phosphatase Cdc14 - are essential for exit from mitosis in Saccharomyces cerevisiae. To identify downstream targets of the MEN, we sought telophase arrest bypassed (tab) mutations that bypassed the essential requirem...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-3-4

    authors: Shou W,Deshaies RJ

    更新日期:2002-03-12 00:00:00

  • Molecular organization and chromosomal localization of 5S rDNA in Amazonian Engystomops (Anura, Leiuperidae).

    abstract:BACKGROUND:For anurans, knowledge of 5S rDNA is scarce. For Engystomops species, chromosomal homeologies are difficult to recognize due to the high level of inter- and intraspecific cytogenetic variation. In an attempt to better compare the karyotypes of the Amazonian species Engystomops freibergi and Engystomops peter...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-13-17

    authors: Rodrigues DS,Rivera M,Lourenço LB

    更新日期:2012-03-20 00:00:00

  • Whole genome population genetics analysis of Sudanese goats identifies regions harboring genes associated with major traits.

    abstract:BACKGROUND:Sudan is endowed with a variety of indigenous goat breeds which are used for meat and milk production and which are well adapted to the local environment. The aim of the present study was to determine the genetic diversity and relationship within and between the four main Sudanese breeds of Nubian, Desert, T...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-017-0553-z

    authors: Rahmatalla SA,Arends D,Reissmann M,Said Ahmed A,Wimmers K,Reyer H,Brockmann GA

    更新日期:2017-10-23 00:00:00

  • MicroRNA-146a rs2910164 is associated with severe preeclampsia in Black South African women on HAART.

    abstract:BACKGROUND:South African (SA) Black women have a high prevalence of preeclampsia and HIV, both conditions associated with increased inflammation. miR-146a is an inflammatory-associated miR and a common single nucleotide polymorphism (rs2910164) has been associated with several disease conditions. To date, this SNP has ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-016-0469-z

    authors: Maharaj NR,Ramkaran P,Pillay S,Chuturgoon AA

    更新日期:2017-01-19 00:00:00

  • Analysis of an independent tumor suppressor locus telomeric to Tp53 suggested Inpp5k and Myo1c as novel tumor suppressor gene candidates in this region.

    abstract:BACKGROUND:Several reports indicate a commonly deleted chromosomal region independent from, and distal to the TP53 locus in a variety of human tumors. In a previous study, we reported a similar finding in a rat tumor model for endometrial carcinoma (EC) and through developing a deletion map, narrowed the candidate regi...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0238-4

    authors: Hedberg Oldfors C,Dios DG,Linder A,Visuttijai K,Samuelson E,Karlsson S,Nilsson S,Behboudi A

    更新日期:2015-07-14 00:00:00

  • Association study of stuttering candidate genes GNPTAB, GNPTG and NAGPA with dyslexia in Chinese population.

    abstract:BACKGROUND:Dyslexia is a polygenic speech and language disorder characterized by an unexpected difficulty in reading in children and adults despite normal intelligence and schooling. Increasing evidence reveals that different speech and language disorders could share common genetic factors. As previous study reported a...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0172-5

    authors: Chen H,Xu J,Zhou Y,Gao Y,Wang G,Xia J,Huen MS,Siok WT,Jiang Y,Tan LH,Sun Y

    更新日期:2015-02-03 00:00:00

  • Generation of an 870 kb deletion encompassing the Skt/Etl4 locus by combination of inter- and intra-chromosomal recombination.

    abstract:BACKGROUND:Etl4(lacZ) (Enhancer trap locus 4) and Skt(Gt) (Sickle tail) are lacZ reporter gene integrations into the same locus on mouse chromosome 2 targeting a gene that is expressed in the notochord of early embryos and in multiple epithelia during later development. Both insertions caused recessive mutations that r...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0302-0

    authors: Serth K,Beckers A,Schuster-Gossler K,Pavlova MN,Müller J,Paul MC,Reinhardt R,Gossler A

    更新日期:2015-12-18 00:00:00