Fast and accurate haplotype frequency estimation for large haplotype vectors from pooled DNA data.

Abstract:

BACKGROUND:Typically, the first phase of a genome wide association study (GWAS) includes genotyping across hundreds of individuals and validation of the most significant SNPs. Allelotyping of pooled genomic DNA is a common approach to reduce the overall cost of the study. Knowledge of haplotype structure can provide additional information to single locus analyses. Several methods have been proposed for estimating haplotype frequencies in a population from pooled DNA data. RESULTS:We introduce a technique for haplotype frequency estimation in a population from pooled DNA samples focusing on datasets containing a small number of individuals per pool (2 or 3 individuals) and a large number of markers. We compare our method with the publicly available state-of-the-art algorithms HIPPO and HAPLOPOOL on datasets of varying number of pools and marker sizes. We demonstrate that our algorithm provides improvements in terms of accuracy and computational time over competing methods for large number of markers while demonstrating comparable performance for smaller marker sizes. Our method is implemented in the "Tree-Based Deterministic Sampling Pool" (TDSPool) package which is available for download at http://www.ee.columbia.edu/~anastas/tdspool. CONCLUSIONS:Using a tree-based determinstic sampling technique we present an algorithm for haplotype frequency estimation from pooled data. Our method demonstrates superior performance in datasets with large number of markers and could be the method of choice for haplotype frequency estimation in such datasets.

journal_name

BMC Genet

journal_title

BMC genetics

authors

Iliadis A,Anastassiou D,Wang X

doi

10.1186/1471-2156-13-94

subject

Has Abstract

pub_date

2012-10-30 00:00:00

pages

94

issn

1471-2156

pii

1471-2156-13-94

journal_volume

13

pub_type

杂志文章
  • Genome-wide association and systems genetic analyses of residual feed intake, daily feed consumption, backfat and weight gain in pigs.

    abstract:BACKGROUND:Feed efficiency is one of the major components determining costs of animal production. Residual feed intake (RFI) is defined as the difference between the observed and the expected feed intake given a certain production. Residual feed intake 1 (RFI1) was calculated based on regression of individual daily fee...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-15-27

    authors: Do DN,Ostersen T,Strathe AB,Mark T,Jensen J,Kadarmideen HN

    更新日期:2014-02-17 00:00:00

  • Abnormal X: autosome ratio, but normal X chromosome inactivation in human triploid cultures.

    abstract:BACKGROUND:X chromosome inactivation (XCI) is that aspect of mammalian dosage compensation that brings about equivalence of X-linked gene expression between females and males by inactivating one of the two X chromosomes (Xi) in normal female cells, leaving them with a single active X (Xa) as in male cells. In cells wit...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-7-41

    authors: Gartler SM,Varadarajan KR,Luo P,Norwood TH,Canfield TK,Hansen RS

    更新日期:2006-07-03 00:00:00

  • Model-specific tests on variance heterogeneity for detection of potentially interacting genetic loci.

    abstract:BACKGROUND:Trait variances among genotype groups at a locus are expected to differ in the presence of an interaction between this locus and another locus or environment. A simple maximum test on variance heterogeneity can thus be used to identify potentially interacting single nucleotide polymorphisms (SNPs). RESULTS:...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-13-59

    authors: Hothorn LA,Libiger O,Gerhard D

    更新日期:2012-07-18 00:00:00

  • Microsatellites versus single-nucleotide polymorphisms in linkage analysis for quantitative and qualitative measures.

    abstract:BACKGROUND:Genetic maps based on single-nucleotide polymorphisms (SNP) are increasingly being used as an alternative to microsatellite maps. This study compares linkage results for both types of maps for a neurophysiology phenotype and for an alcohol dependence phenotype. Our analysis used two SNP maps on the Illumina ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-6-S1-S122

    authors: Dunn G,Hinrichs AL,Bertelsen S,Jin CH,Kauwe JS,Suarez BK,Bierut LJ

    更新日期:2005-12-30 00:00:00

  • Fine-scale genetic analysis of the exploited Nile monitor (Varanus niloticus) in Sahelian Africa.

    abstract:BACKGROUND:Overexploitation of wildlife populations results in direct consequences, such as extinction and local extirpation, as well as indirect effects including genetic diversity loss and changes in genetic structure. A clear understanding of the underlying genetic patterns of harvested species is necessary for sust...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0188-x

    authors: Dowell SA,de Buffrénil V,Kolokotronis SO,Hekkala ER

    更新日期:2015-03-28 00:00:00

  • Characterization of molecular diversity and genome-wide association study of stripe rust resistance at the adult plant stage in Northern Chinese wheat landraces.

    abstract:BACKGROUND:Stripe rust is a serious fungal disease of wheat (Triticum aestivum L.) caused by Puccinia striiformis f. sp. tritici (Pst), which results in yield reduction and decreased grain quality. Breeding for genetic resistance to stripe rust is the most cost-effective method to control the disease. In the present st...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-019-0736-x

    authors: Yao F,Zhang X,Ye X,Li J,Long L,Yu C,Li J,Wang Y,Wu Y,Wang J,Jiang Q,Li W,Ma J,Wei Y,Zheng Y,Chen G

    更新日期:2019-03-26 00:00:00

  • Worldwide distribution of NAT2 diversity: implications for NAT2 evolutionary history.

    abstract:BACKGROUND:The N-acetyltransferase 2 (NAT2) gene plays a crucial role in the metabolism of many drugs and xenobiotics. As it represents a likely target of population-specific selection pressures, we fully sequenced the NAT2 coding region in 97 Mandenka individuals from Senegal, and compared these sequences to extant da...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-9-21

    authors: Sabbagh A,Langaney A,Darlu P,Gérard N,Krishnamoorthy R,Poloni ES

    更新日期:2008-02-27 00:00:00

  • Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype.

    abstract:BACKGROUND:RNA sequencing (RNA-seq) technology has identified multiple differentially expressed (DE) genes associated to complex disease, however, these genes only explain a modest part of variance. Omnigenic model assumes that disease may be driven by genes with indirect relevance to disease and be propagated by funct...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-019-0739-7

    authors: Shao F,Wang Y,Zhao Y,Yang S

    更新日期:2019-03-19 00:00:00

  • Reassessment of the function of somatolactin alpha in lipid metabolism using medaka mutant and transgenic strains.

    abstract:BACKGROUND:Somatolactin alpha (SLa) is a fish-specific peptide hormone secreted from the pituitary. In medaka, SLa functions to darken the skin color and lack of SLa makes it pale. Transcription of SLa is enhanced or suppressed when fish are kept in dark or bright conditions, respectively, indicating SLa's important ro...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-13-64

    authors: Sasano Y,Yoshimura A,Fukamachi S

    更新日期:2012-07-24 00:00:00

  • Identification and mapping of stable QTL with main and epistasis effect on rice grain yield under upland drought stress.

    abstract:BACKGROUND:Drought is one of the most important abiotic stresses that cause drastic reduction in rice grain yield (GY) in rainfed environments. The identification and introgression of QTL leading to high GY under drought have been advocated to be the preferred breeding strategy to improve drought tolerance of popular r...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-15-63

    authors: Sandhu N,Singh A,Dixit S,Sta Cruz MT,Maturan PC,Jain RK,Kumar A

    更新日期:2014-05-27 00:00:00

  • Age-stratified heritability estimation in the Framingham Heart Study families.

    abstract::The Framingham Heart Study provides a unique source of longitudinal family data related to CVD risk factors. Age-stratified heritability estimates were obtained over three age groups (31-49 years, 50-60 years, and 61-79 years), reflecting the longitudinal nature of the data, for four quantitative traits. Age-adjusted ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-4-S1-S32

    authors: Brown WM,Beck SR,Lange EM,Davis CC,Kay CM,Langefeld CD,Rich SS,Framingham Heart Study.

    更新日期:2003-12-31 00:00:00

  • A genome-wide scanning and fine mapping study of COGA data.

    abstract::A thorough genetic mapping study was performed to identify predisposing genes for alcoholism dependence using the Collaborative Study on the Genetics of Alcoholism (COGA) data. The procedure comprised whole-genome linkage and confirmation analyses, single locus and haplotype fine mapping analyses, and gene x environme...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-6-S1-S30

    authors: Yang HC,Chang CC,Lin CY,Chen CL,Lin CY,Fann CS

    更新日期:2005-12-30 00:00:00

  • Probability genotype imputation method and integrated weighted lasso for QTL identification.

    abstract:BACKGROUND:Many QTL studies have two common features: (1) often there is missing marker information, (2) among many markers involved in the biological process only a few are causal. In statistics, the second issue falls under the headings "sparsity" and "causal inference". The goal of this work is to develop a two-step...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-14-125

    authors: Demetrashvili N,Van den Heuvel ER,Wit EC

    更新日期:2013-12-30 00:00:00

  • MicroRNA-146a rs2910164 is associated with severe preeclampsia in Black South African women on HAART.

    abstract:BACKGROUND:South African (SA) Black women have a high prevalence of preeclampsia and HIV, both conditions associated with increased inflammation. miR-146a is an inflammatory-associated miR and a common single nucleotide polymorphism (rs2910164) has been associated with several disease conditions. To date, this SNP has ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-016-0469-z

    authors: Maharaj NR,Ramkaran P,Pillay S,Chuturgoon AA

    更新日期:2017-01-19 00:00:00

  • AB-QTL analysis reveals new alleles associated to proline accumulation and leaf wilting under drought stress conditions in barley (Hordeum vulgare L.).

    abstract:BACKGROUND:Land plants have evolved several measures to maintain their life against abiotic stresses. The accumulation of proline is the most generalized response of plants under drought, heat or salt stress conditions. It is known as an osmoprotectant which also acts as an instant source of energy during drought recov...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-13-61

    authors: Sayed MA,Schumann H,Pillen K,Naz AA,Léon J

    更新日期:2012-07-20 00:00:00

  • Comparison of the accuracy of methods of computational haplotype inference using a large empirical dataset.

    abstract:BACKGROUND:Analyses of genetic data at the level of haplotypes provide increased accuracy and power to infer genotype-phenotype correlations and evolutionary history of a locus. However, empirical determination of haplotypes is expensive and laborious. Therefore, several methods of inferring haplotypes from unphased ge...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-5-22

    authors: Adkins RM

    更新日期:2004-08-03 00:00:00

  • An investigation of the effects of lipid-lowering medications: genome-wide linkage analysis of lipids in the HyperGEN study.

    abstract:BACKGROUND:Use of anti-hyperlipidemic medications compromises genetic analysis because of altered lipid profiles. We propose an empirical method to adjust lipid levels for medication effects so that the adjusted lipid values substitute the unmedicated lipid values in the genetic analysis. RESULTS:Published clinical tr...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-8-60

    authors: Wu J,Province MA,Coon H,Hunt SC,Eckfeldt JH,Arnett DK,Heiss G,Lewis CE,Ellison RC,Rao DC,Rice T,Kraja AT

    更新日期:2007-09-10 00:00:00

  • Three novel mutations in KIF21A highlight the importance of the third coiled-coil stalk domain in the etiology of CFEOM1.

    abstract:BACKGROUND:Congenital fibrosis of the extraocular muscles types 1 and 3 (CFEOM1/CFEOM3) are autosomal dominant strabismus disorders that appear to result from maldevelopment of ocular nuclei and nerves. We previously reported that most individuals with CFEOM1 and rare individuals with CFEOM3 harbor heterozygous mutatio...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-8-26

    authors: Chan WM,Andrews C,Dragan L,Fredrick D,Armstrong L,Lyons C,Geraghty MT,Hunter DG,Yazdani A,Traboulsi EI,Pott JW,Gutowski NJ,Ellard S,Young E,Hanisch F,Koc F,Schnall B,Engle EC

    更新日期:2007-05-18 00:00:00

  • Correction to: Landraces of snake melon, an ancient Middle Eastern crop, reveal extensive morphological and DNA diversity for potential genetic improvement.

    abstract::Following publication of the original article [1], the authors reported the need for a more detailed acknowledgement of the source of the samples that were analyzed and their coordinates, which are discussed in the 'Methods' section of the article. This Correction provides an addition to the 'Methods' section, and a s...

    journal_title:BMC genetics

    pub_type: 杂志文章,已发布勘误

    doi:10.1186/s12863-018-0632-9

    authors: Omari S,Kamenir Y,Benichou JIC,Pariente S,Sela H,Perl-Treves R

    更新日期:2018-07-12 00:00:00

  • Statistically efficient association analysis of quantitative traits with haplotypes and untyped SNPs in family studies.

    abstract:BACKGROUND:Associations between haplotypes and quantitative traits provide valuable information about the genetic basis of complex human diseases. Haplotypes also provide an effective way to deal with untyped SNPs. Two major challenges arise in haplotype-based association analysis of family data. First, haplotypes may ...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-020-00902-x

    authors: Diao G,Lin DY

    更新日期:2020-09-07 00:00:00

  • Wild barley introgression lines revealed novel QTL alleles for root and related shoot traits in the cultivated barley (Hordeum vulgare L.).

    abstract:BACKGROUND:Root is the prime organ that sucks water and nutrients from deep layer of soil. Wild barley diversity exhibits remarkable variation in root system architecture that seems crucial in its adaptation to abiotic stresses like drought. In the present study, we performed quantitative trait locus (QTL) mapping of r...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-014-0107-6

    authors: Naz AA,Arifuzzaman M,Muzammil S,Pillen K,Léon J

    更新日期:2014-10-07 00:00:00

  • Construction of an integrative regulatory element and variation map of the murine Tst locus.

    abstract:BACKGROUND:Given the abundance of new genomic projects and gene annotations, researchers trying to pinpoint causal genetic variants are faced with a challenging task of how to efficiently integrate all current genomic information. The objective of the study was to develop an approach to integrate various genomic annota...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-016-0381-6

    authors: Beltram J,Morton NM,Kunej T,Horvat S

    更新日期:2016-06-11 00:00:00

  • Identification of major QTLs underlying tomato spotted wilt virus resistance in peanut cultivar Florida-EP(TM) '113'.

    abstract:BACKGROUND:Spotted wilt caused by tomato spotted wilt virus (TSWV) is one of the major peanut (Arachis hypogaea L.) diseases in the southeastern United States. Occurrence, severity, and symptoms of spotted wilt disease are highly variable from season to season, making it difficult to efficiently evaluate breeding popul...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-016-0435-9

    authors: Tseng YC,Tillman BL,Peng Z,Wang J

    更新日期:2016-09-06 00:00:00

  • Association of dopaminergic pathway gene polymorphisms with chronic renal insufficiency among Asian Indians with type-2 diabetes.

    abstract:BACKGROUND:Genetic markers conferring susceptibility to diabetes specific renal disease remains to be identified for early prediction and development of effective drugs and therapies. Inconsistent results obtained from analysis of genes from classical pathways generate need for examination of unconventional genetic mar...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-9-26

    authors: Prasad P,Kumar KM,Ammini AC,Gupta A,Gupta R,Thelma BK

    更新日期:2008-03-22 00:00:00

  • Preliminary study on microR-148a and microR-10a in dermal papilla cells of Hu sheep.

    abstract:BACKGROUND:Hu sheep, a unique Chinese breed with high reproductive performance, are also well known for their rare white lambskin in China. The quality of lambskin is affected by hair follicles, and dermal papilla cells are an important component of hair follicles that plays a key role in hair follicle growth and devel...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-019-0770-8

    authors: Lv X,Gao W,Jin C,Wang L,Wang Y,Chen W,Zou S,Huang S,Li Z,Wang J,Sun W

    更新日期:2019-08-27 00:00:00

  • Quantitative trait locus analysis for pod- and kernel-related traits in the cultivated peanut (Arachis hypogaea L.).

    abstract:BACKGROUND:The cultivated peanut (Arachis hypogaea L.) is an important oil and food crop in the world. Pod- and kernel-related traits are direct factors involved in determining the yield of the peanut. However, the genetic basis underlying pod- and kernel-related traits in the peanut remained largely unknown, which ham...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-016-0337-x

    authors: Chen W,Jiao Y,Cheng L,Huang L,Liao B,Tang M,Ren X,Zhou X,Chen Y,Jiang H

    更新日期:2016-01-25 00:00:00

  • The chromosomal constitution of fish hybrid lineage revealed by 5S rDNA FISH.

    abstract:BACKGROUND:The establishment of the bisexual fertile fish hybrid lineage including the allodiploid and allotetraploid hybrids, from interspecific hybridization of red crucian carp (Carassius auratus red var. 2n = 100, 2n = AA) (♀) × common carp (Cyprinus carpio L. 2n = 100, 2n = BB) (♂), provided a good platform to inv...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-015-0295-8

    authors: Zhang C,Ye L,Chen Y,Xiao J,Wu Y,Tao M,Xiao Y,Liu S

    更新日期:2015-12-03 00:00:00

  • Genomic characterisation, chromosomal assignment and in vivo localisation of the canine high mobility group A1 (HMGA1) gene.

    abstract:BACKGROUND:The high mobility group A1 proteins (HMGA1a/HMGA1b) are highly conserved between mammalian species and widely described as participating in various cellular processes. By inducing DNA conformation changes the HMGA1 proteins indirectly influence the binding of various transcription factors and therefore effec...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-9-49

    authors: Beuing C,Soller JT,Muth M,Wagner S,Dolf G,Schelling C,Richter A,Willenbrock S,Reimann-Berg N,Winkler S,Nolte I,Bullerdiek J,Murua Escobar H

    更新日期:2008-07-23 00:00:00

  • The use of the SLC16A1 gene as a potential marker to predict race performance in Arabian horses.

    abstract:BACKGROUND:Arabian horses are commonly believed to be one of the oldest and the most popular horse breeds in the world, characterized by favourable stamina traits and exercise phenotypes. During intensive training, the rates of lactate production and utilization are critical to avoid muscle fatigue and a decrease in ex...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/s12863-019-0774-4

    authors: Ropka-Molik K,Stefaniuk-Szmukier M,Szmatoła T,Piórkowska K,Bugno-Poniewierska M

    更新日期:2019-09-11 00:00:00

  • Impact of the diagnosis definition on linkage detection.

    abstract::Previous genome scan linkage analyses of the disease Kofendrerd Personality Disorder (KPD) with microsatellites led to detect some regions on chromosomes 1, 3, 5, and 9 that were identical for the three populations AI, KA, and DA but with large differences in significance levels. These differences in results may be ex...

    journal_title:BMC genetics

    pub_type: 杂志文章

    doi:10.1186/1471-2156-6-S1-S140

    authors: Dizier MH,Génin E,Babron MC,Bourgain C

    更新日期:2005-12-30 00:00:00