An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.

Abstract:

BACKGROUND:Europeans and American Indians were major genetic ancestry of Hispanics in the U.S. These ancestral groups have markedly different incidence rates and outcomes in many types of cancers. Therefore, the genetic admixture may cause biased genetic association study with cancer susceptibility variants specifically in Hispanics. For example, the incidence rate of liver cancer has been shown with substantial disparity between Hispanic, Asian and non-Hispanic white populations. Currently, ancestry informative marker (AIM) panels have been widely utilized with up to a few hundred ancestry-informative single nucleotide polymorphisms (SNPs) to infer ancestry admixture. Notably, current available AIMs are predominantly located in intron and intergenic regions, while the whole exome sequencing (WES) protocols commonly used in translational research and clinical practice do not cover these markers. Thus, it remains challenging to accurately determine a patient's admixture proportion without additional DNA testing. RESULTS:In this study we designed an unique AIM panel that infers 3-way genetic admixture from three distinct and selective continental populations (African (AFR), European (EUR), and East Asian (EAS)) within evolutionarily conserved exonic regions. Initially, about 1 million exonic SNPs from selective three populations in the 1000 Genomes Project were trimmed by their linkage disequilibrium (LD), restricted to biallelic variants, and finally we optimized to an AIM panel with 250 SNP markers, or the UT-AIM250 panel, using their ancestral informativeness statistics. Comparing to published AIM panels, UT-AIM250 performed better accuracy when we tested with three ancestral populations (accuracy: 0.995 ± 0.012 for AFR, 0.997 ± 0.007 for EUR, and 0.994 ± 0.012 for EAS). We further demonstrated the performance of the UT-AIM250 panel to admixed American (AMR) samples of the 1000 Genomes Project and obtained similar results (AFR, 0.085 ± 0.098; EUR, 0.665 ± 0.182; and EAS, 0.250 ± 0.205) to previously published AIM panels (Phillips-AIM34: AFR, 0.096 ± 0.127, EUR, 0.575 ± 0.290, and EAS, 0.330 ± 0.315; Wei-AIM278: AFR, 0.070 ± 0.096, EUR, 0.537 ± 0.267, and EAS, 0.393 ± 0.300). Subsequently, we applied the UT-AIM250 panel to a clinical dataset of 26 self-reported Hispanic patients in South Texas with hepatocellular carcinoma (HCC). We estimated the admixture proportions using WES data of adjacent non-cancer liver tissues (AFR, 0.065 ± 0.043; EUR, 0.594 ± 0.150; and EAS, 0.341 ± 0.160). Similar admixture proportions were identified from corresponding tumor tissues. In addition, we estimated admixture proportions of The Cancer Genome Atlas (TCGA) collection of hepatocellular carcinoma (TCGA-LIHC) samples (376 patients) using the UT-AIM250 panel. The panel obtained consistent admixture proportions from tumor and matched normal tissues, identified 3 possible incorrectly reported race/ethnicity, and/or provided race/ethnicity determination if necessary. CONCLUSIONS:Here we demonstrated the feasibility of using evolutionarily conserved exonic regions to infer admixture proportions and provided a robust and reliable control for sample collection or patient stratification for genetic analysis. R implementation of UT-AIM250 is available at https://github.com/chenlabgccri/UT-AIM250.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Wang LJ,Zhang CW,Su SC,Chen HH,Chiu YC,Lai Z,Bouamar H,Ramirez AG,Cigarroa FG,Sun LZ,Chen Y

doi

10.1186/s12864-019-6333-6

subject

Has Abstract

pub_date

2019-12-30 00:00:00

pages

1007

issue

Suppl 12

issn

1471-2164

pii

10.1186/s12864-019-6333-6

journal_volume

20

pub_type

杂志文章
  • The genetic regulation of size variation in the transcriptome of the cerebrum in the chicken and its role in domestication and brain size evolution.

    abstract:BACKGROUND:Large difference in cerebrum size exist between avian species and populations of the same species and is believed to reflect differences in processing power, i.e. in the speed and efficiency of processing information in this brain region. During domestication chickens developed a larger cerebrum compared to ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06908-0

    authors: Höglund A,Strempfl K,Fogelholm J,Wright D,Henriksen R

    更新日期:2020-07-29 00:00:00

  • Extensive analysis of D-J-C arrangements allows the identification of different mechanisms enhancing the diversity in sheep T cell receptor beta-chain repertoire.

    abstract:BACKGROUND:In most species of mammals, the TRB locus has the common feature of a library of TRBV genes positioned at the 5'- end of two in tandem aligned D-J-C gene clusters, each composed of a single TRBD gene, 6-7 TRBJ genes and one TRBC gene. An enhancer located at the 3'end of the last TRBC and a well-defined promo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-3

    authors: Di Tommaso S,Antonacci R,Ciccarese S,Massari S

    更新日期:2010-01-04 00:00:00

  • In silico and in situ characterization of the zebrafish (Danio rerio) gnrh3 (sGnRH) gene.

    abstract:BACKGROUND:Gonadotropin releasing hormone (GnRH) is responsible for stimulation of gonadotropic hormone (GtH) in the hypothalamus-pituitary-gonadal axis (HPG). The regulatory mechanisms responsible for brain specificity make the promoter attractive for in silico analysis and reporter gene studies in zebrafish (Danio re...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-3-25

    authors: Torgersen J,Nourizadeh-Lillabadi R,Husebye H,Aleström P

    更新日期:2002-08-21 00:00:00

  • High-throughput cis-regulatory element discovery in the vector mosquito Aedes aegypti.

    abstract:BACKGROUND:Despite substantial progress in mosquito genomic and genetic research, few cis-regulatory elements (CREs), DNA sequences that control gene expression, have been identified in mosquitoes or other non-model insects. Formaldehyde-assisted isolation of regulatory elements paired with DNA sequencing, FAIRE-seq, i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2468-x

    authors: Behura SK,Sarro J,Li P,Mysore K,Severson DW,Emrich SJ,Duman-Scheel M

    更新日期:2016-05-10 00:00:00

  • Transcriptome profiling of antiviral immune and dietary fatty acid dependent responses of Atlantic salmon macrophage-like cells.

    abstract:BACKGROUND:Due to the limited availability and high cost of fish oil in the face of increasing aquaculture production, there is a need to reduce usage of fish oil in aquafeeds without compromising farm fish health. Therefore, the present study was conducted to determine if different levels of vegetable and fish oils ca...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4099-2

    authors: Eslamloo K,Xue X,Hall JR,Smith NC,Caballero-Solares A,Parrish CC,Taylor RG,Rise ML

    更新日期:2017-09-08 00:00:00

  • Developing and applying a gene functional association network for anti-angiogenic kinase inhibitor activity assessment in an angiogenesis co-culture model.

    abstract:BACKGROUND:Tumor angiogenesis is a highly regulated process involving intercellular communication as well as the interactions of multiple downstream signal transduction pathways. Disrupting one or even a few angiogenesis pathways is often insufficient to achieve sustained therapeutic benefits due to the complexity of a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-264

    authors: Chen Y,Wei T,Yan L,Lawrence F,Qian HR,Burkholder TP,Starling JJ,Yingling JM,Shou J

    更新日期:2008-06-02 00:00:00

  • Effect of CAR activation on selected metabolic pathways in normal and hyperlipidemic mouse livers.

    abstract:BACKGROUND:Detoxification in the liver involves activation of nuclear receptors, such as the constitutive androstane receptor (CAR), which regulate downstream genes of xenobiotic metabolism. Frequently, the metabolism of endobiotics is also modulated, resulting in potentially harmful effects. We therefore used 1,4-Bis ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-384

    authors: Rezen T,Tamasi V,Lövgren-Sandblom A,Björkhem I,Meyer UA,Rozman D

    更新日期:2009-08-19 00:00:00

  • Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds.

    abstract:BACKGROUND:Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-eff...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-131

    authors: Shi CY,Yang H,Wei CL,Yu O,Zhang ZZ,Jiang CJ,Sun J,Li YY,Chen Q,Xia T,Wan XC

    更新日期:2011-02-28 00:00:00

  • Correction to: An Arabidopsis introgression zone studied at high spatio-temporal resolution: interglacial and multiple genetic contact exemplified using whole nuclear and plastid genomes.

    abstract::ᅟ: Upon publication of the original article [1], the authors had flagged that there was an error in Fig. 1c, as the key in this figure was displaying incorrectly. The colours had not displayed in the key in the final published article, and instead appear as plain white. ...

    journal_title:BMC genomics

    pub_type: 杂志文章,已发布勘误

    doi:10.1186/s12864-018-4614-0

    authors: Hohmann N,Koch MA

    更新日期:2018-04-11 00:00:00

  • Expanding dynamics of the virulence-related gene variations in the toxigenic Vibrio cholerae serogroup O1.

    abstract:BACKGROUND:Toxigenic Vibrio cholerae serogroup O1 is the causative pathogen in the sixth and seventh cholera pandemics. Cholera toxin is the major virulent factor but other virulence and virulence-related factors play certain roles in the pathogenesis and survival in the host. Along with the evolution of the epidemic s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5725-y

    authors: Li Z,Pang B,Wang D,Li J,Xu J,Fang Y,Lu X,Kan B

    更新日期:2019-05-09 00:00:00

  • Complex sense-antisense architecture of TNFAIP1/POLDIP2 on 17q11.2 represents a novel transcriptional structural-functional gene module involved in breast cancer progression.

    abstract:BACKGROUND:A sense-antisense gene pair (SAGP) is a gene pair where two oppositely transcribed genes share a common nucleotide sequence region. In eukaryotic genomes, SAGPs can be organized in complex sense-antisense architectures (CSAGAs) in which at least one sense gene shares loci with two or more antisense partners....

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S1-S9

    authors: Grinchuk OV,Motakis E,Kuznetsov VA

    更新日期:2010-02-10 00:00:00

  • Genes to predict VO2max trainability: a systematic review.

    abstract:BACKGROUND:Cardiorespiratory fitness (VO2max) is an excellent predictor of chronic disease morbidity and mortality risk. Guidelines recommend individuals undertake exercise training to improve VO2max for chronic disease reduction. However, there are large inter-individual differences between exercise training responses...

    journal_title:BMC genomics

    pub_type: 杂志文章,评审

    doi:10.1186/s12864-017-4192-6

    authors: Williams CJ,Williams MG,Eynon N,Ashton KJ,Little JP,Wisloff U,Coombes JS

    更新日期:2017-11-14 00:00:00

  • Identification of dysfunctional modules and disease genes in congenital heart disease by a network-based approach.

    abstract:BACKGROUND:The incidence of congenital heart disease (CHD) is continuously increasing among infants born alive nowadays, making it one of the leading causes of infant morbidity worldwide. Various studies suggest that both genetic and environmental factors lead to CHD, and therefore identifying its candidate genes and d...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-592

    authors: He D,Liu ZP,Chen L

    更新日期:2011-12-02 00:00:00

  • Colorectal cancer cell-derived microvesicles are enriched in cell cycle-related mRNAs that promote proliferation of endothelial cells.

    abstract:BACKGROUND:Various cancer cells, including those of colorectal cancer (CRC), release microvesicles (exosomes) into surrounding tissues and peripheral circulation. These microvesicles can mediate communication between cells and affect various tumor-related processes in their target cells. RESULTS:We present potential r...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-556

    authors: Hong BS,Cho JH,Kim H,Choi EJ,Rho S,Kim J,Kim JH,Choi DS,Kim YK,Hwang D,Gho YS

    更新日期:2009-11-25 00:00:00

  • Effects of dietary physical or nutritional factors on morphology of rumen papillae and transcriptome changes in lactating dairy cows based on three different forage-based diets.

    abstract:BACKGROUND:Rumen epithelial tissue plays an important role in nutrient absorption and rumen health. However, whether forage quality and particle size impact the rumen epithelial morphology is unclear. The current study was conducted to elucidate the effects of forage quality and forage particle size on rumen epithelial...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3726-2

    authors: Wang B,Wang D,Wu X,Cai J,Liu M,Huang X,Wu J,Liu J,Guan L

    更新日期:2017-05-06 00:00:00

  • Cytosine methylation is a conserved epigenetic feature found throughout the phylum Platyhelminthes.

    abstract:BACKGROUND:The phylum Platyhelminthes (flatworms) contains an important group of bilaterian organisms responsible for many debilitating and chronic infectious diseases of human and animal populations inhabiting the planet today. In addition to their biomedical and veterinary relevance, some platyhelminths are also freq...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-462

    authors: Geyer KK,Chalmers IW,Mackintosh N,Hirst JE,Geoghegan R,Badets M,Brophy PM,Brehm K,Hoffmann KF

    更新日期:2013-07-09 00:00:00

  • EPAS1 gene variants are associated with sprint/power athletic performance in two cohorts of European athletes.

    abstract:BACKGROUND:The endothelial PAS domain protein 1 (EPAS1) activates genes that are involved in erythropoiesis and angiogenesis, thus favoring a better delivery of oxygen to the tissues and is a plausible candidate to influence athletic performance. Using innovative statistical methods we compared genotype distributions a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-382

    authors: Voisin S,Cieszczyk P,Pushkarev VP,Dyatlov DA,Vashlyayev BF,Shumaylov VA,Maciejewska-Karlowska A,Sawczuk M,Skuza L,Jastrzebski Z,Bishop DJ,Eynon N

    更新日期:2014-05-18 00:00:00

  • Identification of prohormones and pituitary neuropeptides in the African cichlid, Astatotilapia burtoni.

    abstract:BACKGROUND:Cichlid fishes have evolved remarkably diverse reproductive, social, and feeding behaviors. Cell-to-cell signaling molecules, notably neuropeptides and peptide hormones, are known to regulate these behaviors across vertebrates. This class of signaling molecules derives from prohormone genes that have undergo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2914-9

    authors: Hu CK,Southey BR,Romanova EV,Maruska KP,Sweedler JV,Fernald RD

    更新日期:2016-08-19 00:00:00

  • Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles.

    abstract:BACKGROUND:Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-454

    authors: Guo S,Liu J,Zheng Y,Huang M,Zhang H,Gong G,He H,Ren Y,Zhong S,Fei Z,Xu Y

    更新日期:2011-09-21 00:00:00

  • Gene regulation of Sclerotinia sclerotiorum during infection of Glycine max: on the road to pathogenesis.

    abstract:BACKGROUND:Sclerotinia sclerotiorum is a broad-host range necrotrophic pathogen which is the causative agent of Sclerotinia stem rot (SSR), and a major disease of soybean (Glycine max). A time course transcriptomic analysis was performed in both compatible and incompatible soybean lines to identify pathogenicity and de...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5517-4

    authors: Westrick NM,Ranjan A,Jain S,Grau CR,Smith DL,Kabbage M

    更新日期:2019-02-26 00:00:00

  • The aspartic proteinase family of three Phytophthora species.

    abstract:BACKGROUND:Phytophthora species are oomycete plant pathogens with such major social and economic impact that genome sequences have been determined for Phytophthora infestans, P. sojae and P. ramorum. Pepsin-like aspartic proteinases (APs) are produced in a wide variety of species (from bacteria to humans) and contain c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-254

    authors: Kay J,Meijer HJ,ten Have A,van Kan JA

    更新日期:2011-05-20 00:00:00

  • Disease gene identification by random walk on multigraphs merging heterogeneous genomic and phenotype data.

    abstract:BACKGROUND:High throughput experiments resulted in many genomic datasets and hundreds of candidate disease genes. To discover the real disease genes from a set of candidate genes, computational methods have been proposed and worked on various types of genomic data sources. As a single source of genomic data is prone of...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S7-S27

    authors: Li Y,Li J

    更新日期:2012-01-01 00:00:00

  • Developing high throughput genotyped chromosome segment substitution lines based on population whole-genome re-sequencing in rice (Oryza sativa L.).

    abstract:BACKGROUND:Genetic populations provide the basis for a wide range of genetic and genomic studies and have been widely used in genetic mapping, gene discovery and genomics-assisted breeding. Chromosome segment substitution lines (CSSLs) are the most powerful tools for the detection and precise mapping of quantitative tr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-656

    authors: Xu J,Zhao Q,Du P,Xu C,Wang B,Feng Q,Liu Q,Tang S,Gu M,Han B,Liang G

    更新日期:2010-11-24 00:00:00

  • Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples.

    abstract:BACKGROUND:Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. RESULTS:We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogast...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-509

    authors: Liu Z,Venkatesh SS,Maley CC

    更新日期:2008-10-30 00:00:00

  • Functional genomics and microbiome profiling of the Asian longhorned beetle (Anoplophora glabripennis) reveal insights into the digestive physiology and nutritional ecology of wood feeding beetles.

    abstract:BACKGROUND:Wood-feeding beetles harbor an ecologically rich and taxonomically diverse assemblage of gut microbes that appear to promote survival in woody tissue, which is devoid of nitrogen and essential nutrients. Nevertheless, the contributions of these apparent symbionts to digestive physiology and nutritional ecolo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1096

    authors: Scully ED,Geib SM,Carlson JE,Tien M,McKenna D,Hoover K

    更新日期:2014-12-12 00:00:00

  • Genome-wide transcriptional analysis of T cell activation reveals differential gene expression associated with psoriasis.

    abstract:BACKGROUND:Psoriasis is a chronic autoimmune disease in which T cells have a predominant role in initiating and perpetuating the chronic inflammation in skin. However, the mechanisms that regulate T cell activation in psoriasis are still incompletely understood. The objective of the present study was to characterize th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-825

    authors: Palau N,Julià A,Ferrándiz C,Puig L,Fonseca E,Fernández E,López-Lasanta M,Tortosa R,Marsal S

    更新日期:2013-11-23 00:00:00

  • Comparative transcriptomics between high and low rubber producing Taraxacum kok-saghyz R. plants.

    abstract:BACKGROUND:Taraxacum kok-saghyz R. (Tks) is a promising alternative species to Hevea brasiliensis for production of high quality natural rubber (NR). A comparative transcriptome analysis of plants with differential production of NR will contribute to elucidate which genes are involved in the synthesis, regulation and a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5287-4

    authors: Panara F,Lopez L,Daddiego L,Fantini E,Facella P,Perrotta G

    更新日期:2018-12-04 00:00:00

  • Topology and expressed repertoire of the Felis catus T cell receptor loci.

    abstract:BACKGROUND:The domestic cat (Felis catus) is an important companion animal and is used as a large animal model for human disease. However, the comprehensive study of adaptive immunity in this species is hampered by the lack of data on lymphocyte antigen receptor genes and usage. The objectives of this study were to ann...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6431-5

    authors: Radtanakatikanon A,Keller SM,Darzentas N,Moore PF,Folch G,Nguefack Ngoune V,Lefranc MP,Vernau W

    更新日期:2020-01-06 00:00:00

  • First comprehensive analysis of lysine acetylation in Alvinocaris longirostris from the deep-sea hydrothermal vents.

    abstract:BACKGROUND:Deep-sea hydrothermal vents are unique chemoautotrophic ecosystems with harsh conditions. Alvinocaris longirostris is one of the dominant crustacean species inhabiting in these extreme environments. It is significant to clarify mechanisms in their adaptation to the vents. Lysine acetylation has been known to...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4745-3

    authors: Hui M,Cheng J,Sha Z

    更新日期:2018-05-10 00:00:00

  • Flux of transcript patterns during soybean seed development.

    abstract:BACKGROUND:To understand gene expression networks leading to functional properties of the soybean seed, we have undertaken a detailed examination of soybean seed development during the stages of major accumulation of oils, proteins, and starches, as well as the desiccating and mature stages, using microarrays consistin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-136

    authors: Jones SI,Gonzalez DO,Vodkin LO

    更新日期:2010-02-24 00:00:00