Assessing structural variation in a personal genome-towards a human reference diploid genome.

Abstract:

BACKGROUND:Characterizing large genomic variants is essential to expanding the research and clinical applications of genome sequencing. While multiple data types and methods are available to detect these structural variants (SVs), they remain less characterized than smaller variants because of SV diversity, complexity, and size. These challenges are exacerbated by the experimental and computational demands of SV analysis. Here, we characterize the SV content of a personal genome with Parliament, a publicly available consensus SV-calling infrastructure that merges multiple data types and SV detection methods. RESULTS:We demonstrate Parliament's efficacy via integrated analyses of data from whole-genome array comparative genomic hybridization, short-read next-generation sequencing, long-read (Pacific BioSciences RSII), long-insert (Illumina Nextera), and whole-genome architecture (BioNano Irys) data from the personal genome of a single subject (HS1011). From this genome, Parliament identified 31,007 genomic loci between 100 bp and 1 Mbp that are inconsistent with the hg19 reference assembly. Of these loci, 9,777 are supported as putative SVs by hybrid local assembly, long-read PacBio data, or multi-source heuristics. These SVs span 59 Mbp of the reference genome (1.8%) and include 3,801 events identified only with long-read data. The HS1011 data and complete Parliament infrastructure, including a BAM-to-SV workflow, are available on the cloud-based service DNAnexus. CONCLUSIONS:HS1011 SV analysis reveals the limits and advantages of multiple sequencing technologies, specifically the impact of long-read SV discovery. With the full Parliament infrastructure, the HS1011 data constitute a public resource for novel SV discovery, software calibration, and personal genome structural variation analysis.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

English AC,Salerno WJ,Hampton OA,Gonzaga-Jauregui C,Ambreth S,Ritter DI,Beck CR,Davis CF,Dahdouli M,Ma S,Carroll A,Veeraraghavan N,Bruestle J,Drees B,Hastie A,Lam ET,White S,Mishra P,Wang M,Han Y,Zhang F,Stankie

doi

10.1186/s12864-015-1479-3

subject

Has Abstract

pub_date

2015-04-11 00:00:00

pages

286

issn

1471-2164

pii

10.1186/s12864-015-1479-3

journal_volume

16

pub_type

杂志文章
  • Comparative analysis of neural transcriptomes and functional implication of unannotated intronic expression.

    abstract:BACKGROUND:The transcriptome and its regulation bridge the genome and the phenome. Recent RNA-seq studies unveiled complex transcriptomes with previously unknown transcripts and functions. To investigate the characteristics of neural transcriptomes and possible functions of previously unknown transcripts, we analyzed a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-494

    authors: Sun Y,Wang Y,Hu Y,Chen G,Ma H

    更新日期:2011-10-10 00:00:00

  • The genomic features of parasitism, Polyembryony and immune evasion in the endoparasitic wasp Macrocentrus cingulum.

    abstract:BACKGROUND:Parasitoid wasps are well-known natural enemies of major agricultural pests and arthropod borne diseases. The parasitoid wasp Macrocentrus cingulum (Hymenoptera: Braconidae) has been widely used to control the notorious insect pests Ostrinia furnacalis (Asian Corn Borer) and O. nubilalis (European corn borer...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4783-x

    authors: Yin C,Li M,Hu J,Lang K,Chen Q,Liu J,Guo D,He K,Dong Y,Luo J,Song Z,Walters JR,Zhang W,Li F,Chen X

    更新日期:2018-05-30 00:00:00

  • Phylogenetic reconstruction from transpositions.

    abstract:BACKGROUND:Because of the advent of high-throughput sequencing and the consequent reduction in the cost of sequencing, many organisms have been completely sequenced and most of their genes identified. It thus has become possible to represent whole genomes as ordered lists of gene identifiers and to study the rearrangem...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-S2-S15

    authors: Yue F,Zhang M,Tang J

    更新日期:2008-09-16 00:00:00

  • Deciphering gamma-decalactone biosynthesis in strawberry fruit using a combination of genetic mapping, RNA-Seq and eQTL analyses.

    abstract:BACKGROUND:Understanding the basis for volatile organic compound (VOC) biosynthesis and regulation is of great importance for the genetic improvement of fruit flavor. Lactones constitute an essential group of fatty acid-derived VOCs conferring peach-like aroma to a number of fruits including peach, plum, pineapple and ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-218

    authors: Sánchez-Sevilla JF,Cruz-Rus E,Valpuesta V,Botella MA,Amaya I

    更新日期:2014-04-17 00:00:00

  • Modular assembly of transposable element arrays by microsatellite targeting in the guayule and rice genomes.

    abstract:BACKGROUND:Guayule (Parthenium argentatum A. Gray) is a rubber-producing desert shrub native to Mexico and the United States. Guayule represents an alternative to Hevea brasiliensis as a source for commercial natural rubber. The efficient application of modern molecular/genetic tools to guayule improvement requires cha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4653-6

    authors: Valdes Franco JA,Wang Y,Huo N,Ponciano G,Colvin HA,McMahan CM,Gu YQ,Belknap WR

    更新日期:2018-04-19 00:00:00

  • Gene expression patterns that predict sensitivity to epidermal growth factor receptor tyrosine kinase inhibitors in lung cancer cell lines and human lung tumors.

    abstract:BACKGROUND:Increased focus surrounds identifying patients with advanced non-small cell lung cancer (NSCLC) who will benefit from treatment with epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKI). EGFR mutation, gene copy number, coexpression of ErbB proteins and ligands, and epithelial to mesenchy...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-289

    authors: Balko JM,Potti A,Saunders C,Stromberg A,Haura EB,Black EP

    更新日期:2006-11-10 00:00:00

  • Genome-wide association and transcriptional studies reveal novel genes for unsaturated fatty acid synthesis in a panel of soybean accessions.

    abstract:BACKGROUND:The nutritional value of soybean oil is largely influenced by the proportions of unsaturated fatty acids (FAs), including oleic acid (OA, 18:1), linoleic acid (LLA, 18:2), and linolenic acid (LNA, 18:3). Genome-wide association (GWAS) studies along with gene expression studies in soybean [Glycine max (L.) Me...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5449-z

    authors: Zhao X,Jiang H,Feng L,Qu Y,Teng W,Qiu L,Zheng H,Han Y,Li W

    更新日期:2019-01-21 00:00:00

  • Microbial "social networks".

    abstract:BACKGROUND:It is well understood that distinct communities of bacteria are present at different sites of the body, and that changes in the structure of these communities have strong implications for human health. Yet, challenges remain in understanding the complex interconnections between the bacterial taxa within thes...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S11-S6

    authors: Fernandez M,Riveros JD,Campos M,Mathee K,Narasimhan G

    更新日期:2015-01-01 00:00:00

  • Dose-dependent effects of small-molecule antagonists on the genomic landscape of androgen receptor binding.

    abstract:BACKGROUND:The androgen receptor plays a critical role throughout the progression of prostate cancer and is an important drug target for this disease. While chromatin immunoprecipitation coupled with massively parallel sequencing (ChIP-Seq) is becoming an essential tool for studying transcription and chromatin modifica...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-355

    authors: Zhu Z,Shi M,Hu W,Estrella H,Engebretsen J,Nichols T,Briere D,Hosea N,Los G,Rejto PA,Fanjul A

    更新日期:2012-07-31 00:00:00

  • Coevolution of paired receptors in Xenopus carcinoembryonic antigen-related cell adhesion molecule families suggests appropriation as pathogen receptors.

    abstract:BACKGROUND:In mammals, CEACAM1 and closely related members represent paired receptors with similar extracellular ligand-binding regions and cytoplasmic domains with opposing functions. Human CEACAM1 and CEACAM3 which have inhibitory ITIM/ITSM and activating ITAM-like motifs, respectively, in their cytoplasmic regions a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3279-9

    authors: Zimmermann W,Kammerer R

    更新日期:2016-11-16 00:00:00

  • Sequence comparison of prefrontal cortical brain transcriptome from a tame and an aggressive silver fox (Vulpes vulpes).

    abstract:BACKGROUND:Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-482

    authors: Kukekova AV,Johnson JL,Teiling C,Li L,Oskina IN,Kharlamova AV,Gulevich RG,Padte R,Dubreuil MM,Vladimirova AV,Shepeleva DV,Shikhevich SG,Sun Q,Ponnala L,Temnykh SV,Trut LN,Acland GM

    更新日期:2011-10-03 00:00:00

  • Profiling and metaanalysis of epidermal keratinocytes responses to epidermal growth factor.

    abstract:BACKGROUND:One challenge of systems biology is the integration of new data into the preexisting, and then re-interpretation of the integrated data. Here we use readily available metaanalysis computational methods to integrate new data on the transcriptomic effects of EGF in primary human epidermal keratinocytes with pr...

    journal_title:BMC genomics

    pub_type: 杂志文章,meta分析

    doi:10.1186/1471-2164-14-85

    authors: Blumenberg M

    更新日期:2013-02-08 00:00:00

  • Mining microsatellite markers from public expressed sequence tags databases for the study of threatened plants.

    abstract:BACKGROUND:Simple Sequence Repeats (SSRs) are widely used in population genetic studies but their classical development is costly and time-consuming. The ever-increasing available DNA datasets generated by high-throughput techniques offer an inexpensive alternative for SSRs discovery. Expressed Sequence Tags (ESTs) hav...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2031-1

    authors: Lopez L,Barreiro R,Fischer M,Koch MA

    更新日期:2015-10-13 00:00:00

  • Antagonistic, overlapping and distinct responses to biotic stress in rice (Oryza sativa) and interactions with abiotic stress.

    abstract:BACKGROUND:Every year, substantial crop loss occurs globally, as a result of bacterial, fungal, parasite and viral infections in rice. Here, we present an in-depth investigation of the transcriptomic response to infection with the destructive bacterial pathogen Xanthomonas oryzae pv. oryzae(Xoo) in both resistant and s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-93

    authors: Narsai R,Wang C,Chen J,Wu J,Shou H,Whelan J

    更新日期:2013-02-12 00:00:00

  • Identification of favorable SNP alleles and candidate genes for traits related to early maturity via GWAS in upland cotton.

    abstract:BACKGROUND:Early maturity is one of the most important and complex agronomic traits in upland cotton (Gossypium hirsutum L). To dissect the genetic architecture of this agronomically important trait, a population consisting of 355 upland cotton germplasm accessions was genotyped using the specific-locus amplified fragm...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2875-z

    authors: Su J,Pang C,Wei H,Li L,Liang B,Wang C,Song M,Wang H,Zhao S,Jia X,Mao G,Huang L,Geng D,Wang C,Fan S,Yu S

    更新日期:2016-08-30 00:00:00

  • Identification of recent cases of hepatitis C virus infection using physical-chemical properties of hypervariable region 1 and a radial basis function neural network classifier.

    abstract:BACKGROUND:Identification of acute or recent hepatitis C virus (HCV) infections is important for detecting outbreaks and devising timely public health interventions for interruption of transmission. Epidemiological investigations and chemistry-based laboratory tests are 2 main approaches that are available for identifi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4269-2

    authors: Lara J,Teka M,Khudyakov Y

    更新日期:2017-12-06 00:00:00

  • ATP-binding cassette systems in Burkholderia pseudomallei and Burkholderia mallei.

    abstract:BACKGROUND:ATP binding cassette (ABC) systems are responsible for the import and export of a wide variety of molecules across cell membranes and comprise one of largest protein superfamilies found in prokarya, eukarya and archea. ABC systems play important roles in bacterial lifestyle, virulence and survival. In this s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-83

    authors: Harland DN,Dassa E,Titball RW,Brown KA,Atkins HS

    更新日期:2007-03-28 00:00:00

  • A SNP in intron 8 of CD46 causes a novel transcript associated with mastitis in Holsteins.

    abstract:BACKGROUND:The membrane protein CD46, a ubiquitous cell surface pathogen receptor, can bind Streptococcus to trigger cell autophagy, which is a critical step in the control of infection. RESULTS:In this study, we found a new splice variant designated CD46 transcript variant (CD46-TV). The splice variant is characteriz...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-630

    authors: Wang X,Zhong J,Gao Y,Ju Z,Huang J

    更新日期:2014-07-28 00:00:00

  • Methods for high-throughput MethylCap-Seq data analysis.

    abstract:BACKGROUND:Advances in whole genome profiling have revolutionized the cancer research field, but at the same time have raised new bioinformatics challenges. For next generation sequencing (NGS), these include data storage, computational costs, sequence processing and alignment, delineating appropriate statistical measu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S6-S14

    authors: Rodriguez BA,Frankhouser D,Murphy M,Trimarchi M,Tam HH,Curfman J,Huang R,Chan MW,Lai HC,Parikh D,Ball B,Schwind S,Blum W,Marcucci G,Yan P,Bundschuh R

    更新日期:2012-01-01 00:00:00

  • Identification of Nicotiana benthamiana microRNAs and their targets using high throughput sequencing and degradome analysis.

    abstract:BACKGROUND:Nicotiana benthamiana is a widely used model plant species for research on plant-pathogen interactions as well as other areas of plant science. It can be easily transformed or agroinfiltrated, therefore it is commonly used in studies requiring protein localization, interaction, or plant-based systems for pro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2209-6

    authors: Baksa I,Nagy T,Barta E,Havelda Z,Várallyay É,Silhavy D,Burgyán J,Szittya G

    更新日期:2015-12-01 00:00:00

  • Scalable optimal Bayesian classification of single-cell trajectories under regulatory model uncertainty.

    abstract:BACKGROUND:Single-cell gene expression measurements offer opportunities in deriving mechanistic understanding of complex diseases, including cancer. However, due to the complex regulatory machinery of the cell, gene regulatory network (GRN) model inference based on such data still manifests significant uncertainty. RE...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5720-3

    authors: Hajiramezanali E,Imani M,Braga-Neto U,Qian X,Dougherty ER

    更新日期:2019-06-13 00:00:00

  • Characterization of a novel chicken muscle disorder through differential gene expression and pathway analysis using RNA-sequencing.

    abstract:BACKGROUND:Improvements in poultry production within the past 50 years have led to increased muscle yield and growth rate, which may be contributing to an increased rate and development of new muscle disorders in chickens. Previously reported muscle disorders and conditions are generally associated with poor meat quali...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1623-0

    authors: Mutryn MF,Brannick EM,Fu W,Lee WR,Abasht B

    更新日期:2015-05-21 00:00:00

  • An analysis of the transcriptome of Teladorsagia circumcincta: its biological and biotechnological implications.

    abstract:BACKGROUND:Teladorsagia circumcincta (order Strongylida) is an economically important parasitic nematode of small ruminants (including sheep and goats) in temperate climatic regions of the world. Improved insights into the molecular biology of this parasite could underpin alternative methods required to control this an...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S7-S10

    authors: Menon R,Gasser RB,Mitreva M,Ranganathan S

    更新日期:2012-01-01 00:00:00

  • Transcriptome analysis of bolting in A. tequilana reveals roles for florigen, MADS, fructans and gibberellins.

    abstract:BACKGROUND:Reliable indicators for the onset of flowering are not available for most perennial monocarpic species, representing a drawback for crops such as bamboo, agave and banana. The ability to predict and control the transition to the reproductive stage in A. tequilana would represent an advantage for field manage...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5808-9

    authors: Avila de Dios E,Delaye L,Simpson J

    更新日期:2019-06-10 00:00:00

  • Massively parallel nanowell-based single-cell gene expression profiling.

    abstract:BACKGROUND:Technological advances have enabled transcriptome characterization of cell types at the single-cell level providing new biological insights. New methods that enable simple yet high-throughput single-cell expression profiling are highly desirable. RESULTS:Here we report a novel nanowell-based single-cell RNA...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3893-1

    authors: Goldstein LD,Chen YJ,Dunne J,Mir A,Hubschle H,Guillory J,Yuan W,Zhang J,Stinson J,Jaiswal B,Pahuja KB,Mann I,Schaal T,Chan L,Anandakrishnan S,Lin CW,Espinoza P,Husain S,Shapiro H,Swaminathan K,Wei S,Srinivasan M

    更新日期:2017-07-07 00:00:00

  • Analysis of gene expression in soybean (Glycine max) roots in response to the root knot nematode Meloidogyne incognita using microarrays and KEGG pathways.

    abstract:BACKGROUND:Root-knot nematodes are sedentary endoparasites that can infect more than 3000 plant species. Root-knot nematodes cause an estimated $100 billion annual loss worldwide. For successful establishment of the root-knot nematode in its host plant, it causes dramatic morphological and physiological changes in plan...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-220

    authors: Ibrahim HM,Hosseini P,Alkharouf NW,Hussein EH,Gamal El-Din Ael K,Aly MA,Matthews BF

    更新日期:2011-05-10 00:00:00

  • Comparative genomic analysis of the Hafnia genus reveals an explicit evolutionary relationship between the species alvei and paralvei and provides insights into pathogenicity.

    abstract:BACKGROUND:The Hafnia genus is an opportunistic pathogen that has been implicated in both nosocomial and community-acquired infections. Although Hafnia is fairly often isolated from clinical material, its taxonomy has remained an unsolved riddle, and the involvement and importance of Hafnia in human disease is also unc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6123-1

    authors: Yin Z,Yuan C,Du Y,Yang P,Qian C,Wei Y,Zhang S,Huang D,Liu B

    更新日期:2019-10-23 00:00:00

  • The comparison of four mitochondrial genomes reveals cytoplasmic male sterility candidate genes in cotton.

    abstract:BACKGROUND:The mitochondrial genomes of higher plants vary remarkably in size, structure and sequence content, as demonstrated by the accumulation and activity of repetitive DNA sequences. Incompatibility between mitochondrial genome and nuclear genome leads to non-functional male reproductive organs and results in cyt...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5122-y

    authors: Li S,Chen Z,Zhao N,Wang Y,Nie H,Hua J

    更新日期:2018-10-26 00:00:00

  • Positively correlated miRNA-mRNA regulatory networks in mouse frontal cortex during early stages of alcohol dependence.

    abstract:BACKGROUND:Although the study of gene regulation via the action of specific microRNAs (miRNAs) has experienced a boom in recent years, the analysis of genome-wide interaction networks among miRNAs and respective targeted mRNAs has lagged behind. MicroRNAs simultaneously target many transcripts and fine-tune the express...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-725

    authors: Nunez YO,Truitt JM,Gorini G,Ponomareva ON,Blednov YA,Harris RA,Mayfield RD

    更新日期:2013-10-22 00:00:00

  • HGCS: an online tool for prioritizing disease-causing gene variants by biological distance.

    abstract:BACKGROUND:Identifying the genotypes underlying human disease phenotypes is a fundamental step in human genetics and medicine. High-throughput genomic technologies provide thousands of genetic variants per individual. The causal genes of a specific phenotype are usually expected to be functionally close to each other. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-256

    authors: Itan Y,Mazel M,Mazel B,Abhyankar A,Nitschke P,Quintana-Murci L,Boisson-Dupuis S,Boisson B,Abel L,Zhang SY,Casanova JL

    更新日期:2014-04-03 00:00:00