Determining multiallelic complex copy number and sequence variation from high coverage exome sequencing data.

Abstract:

BACKGROUND:Copy number variation (CNV) is a major component of genomic variation, yet methods to accurately type genomic CNV lag behind methods that type single nucleotide variation. High-throughput sequencing can contribute to these methods by using sequence read depth, which takes the number of reads that map to a given part of the reference genome as a proxy for copy number of that region, and compares across samples. Furthermore, high-throughput sequencing also provides information on the sequence differences between copies within and between individuals. METHODS:In this study we use high-coverage phase 3 exome sequences of the 1000 Genomes project to infer diploid copy number of the beta-defensin genomic region, a well-studied CNV that carries several beta-defensin genes involved in the antimicrobial response, signalling, and fertility. We also use these data to call sequence variants, a particular challenge given the multicopy nature of the region. RESULTS:We confidently call copy number and sequence variation of the beta-defensin genes on 1285 samples from 26 global populations, validate copy number using Nanostring nCounter and triplex paralogue ratio test data. We use the copy number calls to verify the genomic extent of the CNV and validate sequence calls using analysis of cloned PCR products. We identify novel variation, mostly individually rare, predicted to alter amino-acid sequence in the beta-defensin genes. Such novel variants may alter antimicrobial properties or have off-target receptor interactions, and may contribute to individuality in immunological response and fertility. CONCLUSIONS:Given that 81% of identified sequence variants were not previously in dbSNP, we show that sequence variation in multiallelic CNVs represent an unappreciated source of genomic diversity.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Forni D,Martin D,Abujaber R,Sharp AJ,Sironi M,Hollox EJ

doi

10.1186/s12864-015-2123-y

subject

Has Abstract

pub_date

2015-11-02 00:00:00

pages

891

issn

1471-2164

pii

10.1186/s12864-015-2123-y

journal_volume

16

pub_type

杂志文章
  • Genomic expression during human myelopoiesis.

    abstract:BACKGROUND:Human myelopoiesis is an exciting biological model for cellular differentiation since it represents a plastic process where multipotent stem cells gradually limit their differentiation potential, generating different precursor cells which finally evolve into distinct terminally differentiated cells. This stu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-264

    authors: Ferrari F,Bortoluzzi S,Coppe A,Basso D,Bicciato S,Zini R,Gemelli C,Danieli GA,Ferrari S

    更新日期:2007-08-03 00:00:00

  • Characteristics and clustering of human ribosomal protein genes.

    abstract:BACKGROUND:The ribosome is a central player in the translation system, which in mammals consists of four RNA species and 79 ribosomal proteins (RPs). The control mechanisms of gene expression and the functions of RPs are believed to be identical. Most RP genes have common promoters and were therefore assumed to have a ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-37

    authors: Ishii K,Washio T,Uechi T,Yoshihama M,Kenmochi N,Tomita M

    更新日期:2006-02-28 00:00:00

  • Papain-like cysteine proteases in Carica papaya: lineage-specific gene duplication and expansion.

    abstract:BACKGROUND:Papain-like cysteine proteases (PLCPs), a large group of cysteine proteases structurally related to papain, play important roles in plant development, senescence, and defense responses. Papain, the first cysteine protease whose structure was determined by X-ray crystallography, plays a crucial role in protec...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4394-y

    authors: Liu J,Sharma A,Niewiara MJ,Singh R,Ming R,Yu Q

    更新日期:2018-01-06 00:00:00

  • Markedly different genome arrangements between serotype a strains and serotypes b or c strains of Aggregatibacter actinomycetemcomitans.

    abstract:BACKGROUND:Bacterial phenotype may be profoundly affected by the physical arrangement of their genes in the genome. The Gram-negative species Aggregatibacter actinomycetemcomitans is a major etiologic agent of human periodontitis. Individual clonal types of A. actinomycetemcomitans may exhibit variable virulence and di...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-489

    authors: Kittichotirat W,Bumgarner R,Chen C

    更新日期:2010-09-08 00:00:00

  • Association of the matrix attachment region recognition signature with coding regions in Caenorhabditis elegans.

    abstract:BACKGROUND:Matrix attachment regions (MAR) are the sites on genomic DNA that interact with the nuclear matrix. There is increasing evidence for the involvement of MAR in regulation of gene expression. The unsuitability of experimental detection of MAR for genome-wide analyses has led to the development of computational...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-418

    authors: Anthony A,Blaxter M

    更新日期:2007-11-15 00:00:00

  • Muscle regeneration in dystrophin-deficient mdx mice studied by gene expression profiling.

    abstract:BACKGROUND:Duchenne muscular dystrophy (DMD), caused by mutations in the dystrophin gene, is lethal. In contrast, dystrophin-deficient mdx mice recover due to effective regeneration of affected muscle tissue. To characterize the molecular processes associated with regeneration, we compared gene expression levels in hin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-98

    authors: Turk R,Sterrenburg E,de Meijer EJ,van Ommen GJ,den Dunnen JT,'t Hoen PA

    更新日期:2005-07-13 00:00:00

  • Comparative analysis of the predicted secretomes of Rosaceae scab pathogens Venturia inaequalis and V. pirina reveals expanded effector families and putative determinants of host range.

    abstract:BACKGROUND:Fungal plant pathogens belonging to the genus Venturia cause damaging scab diseases of members of the Rosaceae. In terms of economic impact, the most important of these are V. inaequalis, which infects apple, and V. pirina, which is a pathogen of European pear. Given that Venturia fungi colonise the sub-cuti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3699-1

    authors: Deng CH,Plummer KM,Jones DAB,Mesarich CH,Shiller J,Taranto AP,Robinson AJ,Kastner P,Hall NE,Templeton MD,Bowen JK

    更新日期:2017-05-02 00:00:00

  • De novo assembly, characterization and functional annotation of Senegalese sole (Solea senegalensis) and common sole (Solea solea) transcriptomes: integration in a database and design of a microarray.

    abstract:BACKGROUND:Senegalese sole (Solea senegalensis) and common sole (S. solea) are two economically and evolutionary important flatfish species both in fisheries and aquaculture. Although some genomic resources and tools were recently described in these species, further sequencing efforts are required to establish a comple...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-952

    authors: Benzekri H,Armesto P,Cousin X,Rovira M,Crespo D,Merlo MA,Mazurais D,Bautista R,Guerrero-Fernández D,Fernandez-Pozo N,Ponce M,Infante C,Zambonino JL,Nidelet S,Gut M,Rebordinos L,Planas JV,Bégout ML,Claros MG,Manchado

    更新日期:2014-11-03 00:00:00

  • Transcriptional response to sulfide in the Echiuran Worm Urechis unicinctus by digital gene expression analysis.

    abstract:BACKGROUND:Urechis unicinctus, an echiuran worm inhabiting the U-shaped burrows in the coastal mud flats, is an important commercial and ecological invertebrate in Northeast Asian countries, which has potential applications in the study of animal evolution, coastal sediment improvement and marine drug development. Furt...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2094-z

    authors: Liu X,Zhang L,Zhang Z,Ma X,Liu J

    更新日期:2015-10-21 00:00:00

  • Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes.

    abstract:BACKGROUND:Many gram-negative bacteria use type III secretion systems (T3SSs) to translocate effector proteins into host cells. T3SS effectors can give some bacteria a competitive edge over others within the same environment and can help bacteria to invade the host cells and allow them to multiply rapidly within the ho...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3363-1

    authors: Hobbs CK,Porter VL,Stow ML,Siame BA,Tsang HH,Leung KY

    更新日期:2016-12-19 00:00:00

  • ISMapper: identifying transposase insertion sites in bacterial genomes from short read sequence data.

    abstract:BACKGROUND:Insertion sequences (IS) are small transposable elements, commonly found in bacterial genomes. Identifying the location of IS in bacterial genomes can be useful for a variety of purposes including epidemiological tracking and predicting antibiotic resistance. However IS are commonly present in multiple copie...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1860-2

    authors: Hawkey J,Hamidian M,Wick RR,Edwards DJ,Billman-Jacobe H,Hall RM,Holt KE

    更新日期:2015-09-03 00:00:00

  • Screening populations for copy number variation using genotyping-by-sequencing: a proof of concept using soybean fast neutron mutants.

    abstract:BACKGROUND:The effective use of mutant populations for reverse genetic screens relies on the population-wide characterization of the induced mutations. Genome- and population-wide characterization of the mutations found in fast neutron populations has been hindered, however, by the wide range of mutations generated and...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5998-1

    authors: Lemay MA,Torkamaneh D,Rigaill G,Boyle B,Stec AO,Stupar RM,Belzile F

    更新日期:2019-08-06 00:00:00

  • Structured RNAs and synteny regions in the pig genome.

    abstract:BACKGROUND:Annotating mammalian genomes for noncoding RNAs (ncRNAs) is nontrivial since far from all ncRNAs are known and the computational models are resource demanding. Currently, the human genome holds the best mammalian ncRNA annotation, a result of numerous efforts by several groups. However, a more direct strateg...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-459

    authors: Anthon C,Tafer H,Havgaard JH,Thomsen B,Hedegaard J,Seemann SE,Pundhir S,Kehr S,Bartschat S,Nielsen M,Nielsen RO,Fredholm M,Stadler PF,Gorodkin J

    更新日期:2014-06-10 00:00:00

  • Expansion of CORE-SINEs in the genome of the Tasmanian devil.

    abstract:BACKGROUND:The genome of the carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii, Order: Dasyuromorphia), was sequenced in the hopes of finding a cure for or gaining a better understanding of the contagious devil facial tumor disease that is threatening the species' survival. To better understand the Tasma...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-172

    authors: Nilsson MA,Janke A,Murchison EP,Ning Z,Hallström BM

    更新日期:2012-05-06 00:00:00

  • Microarray profiling for differential gene expression in PMSG-hCG stimulated preovulatory ovarian follicles of Chinese Taihu and Large White sows.

    abstract:BACKGROUND:The Chinese Taihu is one of the most prolific pig breeds in the world, which farrows at least five more piglets per litter than Western pig breeds partly due to a greater ovulation rate. Variation of ovulation rate maybe associated with the differences in the transcriptome of Chinese Taihu and Large White ov...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-111

    authors: Sun X,Mei S,Tao H,Wang G,Su L,Jiang S,Deng C,Xiong Y,Li F

    更新日期:2011-02-16 00:00:00

  • Comparison of leaf transcriptome in response to Rhizoctonia solani infection between resistant and susceptible rice cultivars.

    abstract:BACKGROUND:Sheath blight (SB), caused by Rhizoctonia solani, is a common rice disease worldwide. Currently, rice cultivars with robust resistance to R. solani are still lacking. To provide theoretic basis for molecular breeding of R. solani-resistant rice cultivars, the changes of transcriptome profiles in response to ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6645-6

    authors: Shi W,Zhao SL,Liu K,Sun YB,Ni ZB,Zhang GY,Tang HS,Zhu JW,Wan BJ,Sun HQ,Dai JY,Sun MF,Yan GH,Wang AM,Zhu GY

    更新日期:2020-03-19 00:00:00

  • A Gaijin-like miniature inverted repeat transposable element is mobilized in rice during cell differentiation.

    abstract:BACKGROUND:Miniature inverted repeat transposable element (MITE) is one type of transposable element (TE), which is largely found in eukaryotic genomes and involved in a wide variety of biological events. However, only few MITEs were proved to be currently active and their physiological function remains largely unknown...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-135

    authors: Dong HT,Zhang L,Zheng KL,Yao HG,Chen J,Yu FC,Yu XX,Mao BZ,Zhao D,Yao J,Li DB

    更新日期:2012-04-13 00:00:00

  • Alpha tubulin genes from Leishmania braziliensis: genomic organization, gene structure and insights on their expression.

    abstract:BACKGROUND:Alpha tubulin is a fundamental component of the cytoskeleton which is responsible for cell shape and is involved in cell division, ciliary and flagellar motility and intracellular transport. Alpha tubulin gene expression varies according to the morphological changes suffered by Leishmania in its life cycle. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-454

    authors: Ramírez CA,Requena JM,Puerta CJ

    更新日期:2013-07-06 00:00:00

  • Stenotrophomonas comparative genomics reveals genes and functions that differentiate beneficial and pathogenic bacteria.

    abstract:BACKGROUND:In recent years, the number of human infections caused by opportunistic pathogens has increased dramatically. Plant rhizospheres are one of the most typical natural reservoirs for these pathogens but they also represent a great source for beneficial microbes with potential for biotechnological applications. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-482

    authors: Alavi P,Starcher MR,Thallinger GG,Zachow C,Müller H,Berg G

    更新日期:2014-06-18 00:00:00

  • CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences.

    abstract:BACKGROUND:The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annota...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-715

    authors: Liu C,Shi L,Zhu Y,Chen H,Zhang J,Lin X,Guan X

    更新日期:2012-12-20 00:00:00

  • Fungal genomes mining to discover novel sterol esterases and lipases as catalysts.

    abstract:BACKGROUND:Sterol esterases and lipases are enzymes able to efficiently catalyze synthesis and hydrolysis reactions of both sterol esters and triglycerides and due to their versatility could be widely used in different industrial applications. Lipases with this ability have been reported in the yeast Candida rugosa tha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-712

    authors: Barriuso J,Prieto A,Martínez MJ

    更新日期:2013-10-18 00:00:00

  • Optimizing de novo assembly of short-read RNA-seq data for phylogenomics.

    abstract:BACKGROUND:RNA-seq has shown huge potential for phylogenomic inferences in non-model organisms. However, error, incompleteness, and redundant assembled transcripts for each gene in de novo assembly of short reads cause noise in analyses and a large amount of missing data in the aligned matrix. To address these problems...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-328

    authors: Yang Y,Smith SA

    更新日期:2013-05-14 00:00:00

  • Exploring the genetics of trotting racing ability in horses using a unique Nordic horse model.

    abstract:BACKGROUND:Horses have been strongly selected for speed, strength, and endurance-exercise traits since the onset of domestication. As a result, highly specialized horse breeds have developed with many modern horse breeds often representing closed populations with high phenotypic and genetic uniformity. However, a great...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5484-9

    authors: Velie BD,Lillie M,Fegraeus KJ,Rosengren MK,Solé M,Wiklund M,Ihler CF,Strand E,Lindgren G

    更新日期:2019-02-04 00:00:00

  • Extensive analysis of D-J-C arrangements allows the identification of different mechanisms enhancing the diversity in sheep T cell receptor beta-chain repertoire.

    abstract:BACKGROUND:In most species of mammals, the TRB locus has the common feature of a library of TRBV genes positioned at the 5'- end of two in tandem aligned D-J-C gene clusters, each composed of a single TRBD gene, 6-7 TRBJ genes and one TRBC gene. An enhancer located at the 3'end of the last TRBC and a well-defined promo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-3

    authors: Di Tommaso S,Antonacci R,Ciccarese S,Massari S

    更新日期:2010-01-04 00:00:00

  • Biclustering of transcriptome sequencing data reveals human tissue-specific circular RNAs.

    abstract:BACKGROUND:Emerging evidence has been experimentally confirmed the tissue-specific expression of circRNAs (circRNAs). Global identification of human tissue-specific circRNAs is crucial for the functionality study, which facilitates the discovery of circRNAs for potential diagnostic biomarkers. RESULTS:In this study, c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4335-9

    authors: Liu YC,Chiu YJ,Li JR,Sun CH,Liu CC,Huang HD

    更新日期:2018-01-19 00:00:00

  • Polysaccharides utilization in human gut bacterium Bacteroides thetaiotaomicron: comparative genomics reconstruction of metabolic and regulatory networks.

    abstract:BACKGROUND:Bacteroides thetaiotaomicron, a predominant member of the human gut microbiota, is characterized by its ability to utilize a wide variety of polysaccharides using the extensive saccharolytic machinery that is controlled by an expanded repertoire of transcription factors (TFs). The availability of genomic seq...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-873

    authors: Ravcheev DA,Godzik A,Osterman AL,Rodionov DA

    更新日期:2013-12-12 00:00:00

  • H2A.Z marks antisense promoters and has positive effects on antisense transcript levels in budding yeast.

    abstract:BACKGROUND:The histone variant H2A.Z, which has been reported to have both activating and repressive effects on gene expression, is known to occupy nucleosomes at the 5' ends of protein-coding genes. RESULTS:We now find that H2A.Z is also significantly enriched in gene coding regions and at the 3' ends of genes in bud...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1247-4

    authors: Gu M,Naiyachit Y,Wood TJ,Millar CB

    更新日期:2015-02-19 00:00:00

  • Genomics of sex allocation in the parasitoid wasp Nasonia vitripennis.

    abstract:BACKGROUND:Whilst adaptive facultative sex allocation has been widely studied at the phenotypic level across a broad range of organisms, we still know remarkably little about its genetic architecture. Here, we explore the genome-wide basis of sex ratio variation in the parasitoid wasp Nasonia vitripennis, perhaps the b...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06904-4

    authors: Pannebakker BA,Cook N,van den Heuvel J,van de Zande L,Shuker DM

    更新日期:2020-07-20 00:00:00

  • Comparative transcriptomics uncovers alternative splicing changes and signatures of selection from maize improvement.

    abstract:BACKGROUND:Alternative splicing (AS) is an important regulatory mechanism that greatly contributes to eukaryotic transcriptome diversity. A substantial amount of evidence has demonstrated that AS complexity is relevant to eukaryotic evolution, development, adaptation, and complexity. In this study, six teosinte and ten...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1582-5

    authors: Huang J,Gao Y,Jia H,Liu L,Zhang D,Zhang Z

    更新日期:2015-05-08 00:00:00

  • Identification of transcription factors potential related to brown planthopper resistance in rice via microarray expression profiling.

    abstract:BACKGROUND:Brown planthopper (BPH), Nilaparvata lugens Stål, is one of the most destructive insect pests of rice. The molecular responses of plants to sucking insects resemble responses to pathogen infection. However, the molecular mechanism of BPH-resistance in rice remains unclear. Transcription factors (TF) are up-s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-687

    authors: Wang Y,Guo H,Li H,Zhang H,Miao X

    更新日期:2012-12-10 00:00:00