Impact of genomic structural variation in Drosophila melanogaster based on population-scale sequencing.

Abstract:

:Genomic structural variation (SV) is a major determinant for phenotypic variation. Although it has been extensively studied in humans, the nucleotide resolution structure of SVs within the widely used model organism Drosophila remains unknown. We report a highly accurate, densely validated map of unbalanced SVs comprising 8962 deletions and 916 tandem duplications in 39 lines derived from short-read DNA sequencing in a natural population (the "Drosophila melanogaster Genetic Reference Panel," DGRP). Most SVs (>90%) were inferred at nucleotide resolution, and a large fraction was genotyped across all samples. Comprehensive analyses of SV formation mechanisms using the short-read data revealed an abundance of SVs formed by mobile element and nonhomologous end-joining-mediated rearrangements, and clustering of variants into SV hotspots. We further observed a strong depletion of SVs overlapping genes, which, along with population genetics analyses, suggests that these SVs are often deleterious. We inferred several gene fusion events also highlighting the potential role of SVs in the generation of novel protein products. Expression quantitative trait locus (eQTL) mapping revealed the functional impact of our high-resolution SV map, with quantifiable effects at >100 genic loci. Our map represents a resource for population-level studies of SVs in an important model organism.

journal_name

Genome Res

journal_title

Genome research

authors

Zichner T,Garfield DA,Rausch T,Stütz AM,Cannavó E,Braun M,Furlong EE,Korbel JO

doi

10.1101/gr.142646.112

subject

Has Abstract

pub_date

2013-03-01 00:00:00

pages

568-79

issue

3

eissn

1088-9051

issn

1549-5469

pii

gr.142646.112

journal_volume

23

pub_type

杂志文章
  • Nonrandom domain organization of the Arabidopsis genome at the nuclear periphery.

    abstract::The nuclear space is not a homogeneous biochemical environment. Many studies have demonstrated that the transcriptional activity of a gene is linked to its positioning within the nuclear space. Following the discovery of lamin-associated domains (LADs), which are transcriptionally repressed chromatin regions, the nonr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.215186.116

    authors: Bi X,Cheng YJ,Hu B,Ma X,Wu R,Wang JW,Liu C

    更新日期:2017-07-01 00:00:00

  • Gene and alternative splicing annotation with AIR.

    abstract::Designing effective and accurate tools for identifying the functional and structural elements in a genome remains at the frontier of genome annotation owing to incompleteness and inaccuracy of the data, limitations in the computational models, and shifting paradigms in genomics, such as alternative splicing. We presen...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2889405

    authors: Florea L,Di Francesco V,Miller J,Turner R,Yao A,Harris M,Walenz B,Mobarry C,Merkulov GV,Charlab R,Dew I,Deng Z,Istrail S,Li P,Sutton G

    更新日期:2005-01-01 00:00:00

  • Estimating coarse gene network structure from large-scale gene perturbation data.

    abstract::Large scale gene perturbation experiments generate information about the number of genes whose activity is directly or indirectly affected by a gene perturbation. From this information, one can numerically estimate coarse structural network features such as the total number of direct regulatory interactions and the nu...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.193902

    authors: Wagner A

    更新日期:2002-02-01 00:00:00

  • Spatiotemporal clustering of the epigenome reveals rules of dynamic gene regulation.

    abstract::Spatial organization of different epigenomic marks was used to infer functions of the epigenome. It remains unclear what can be learned from the temporal changes of the epigenome. Here, we developed a probabilistic model to cluster genomic sequences based on the similarity of temporal changes of multiple epigenomic ma...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.144949.112

    authors: Yu P,Xiao S,Xin X,Song CX,Huang W,McDee D,Tanaka T,Wang T,He C,Zhong S

    更新日期:2013-02-01 00:00:00

  • Caenorhabditis elegans has scores of hedgehog-related genes: sequence and expression analysis.

    abstract::Previously, we have described novel families of genes, warthog (wrt) and groundhog (grd), in Caenorhabditis elegans. They are related to Hedgehog (Hh) through the carboxy-terminal autoprocessing domain (called Hog or Hint). A comprehensive survey revealed 10 genes with Hog/Hint modules in C. elegans. Five of these are...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.9.10.909

    authors: Aspöck G,Kagoshima H,Niklaus G,Bürglin TR

    更新日期:1999-10-01 00:00:00

  • Species-specific class I gene expansions formed the telomeric 1 mb of the mouse major histocompatibility complex.

    abstract::We have determined the complete sequence of 951,695 bp from the class I region of H2, the mouse major histocompatibility complex (Mhc) from strain 129/Sv (haplotype bc). The sequence contains 26 genes. The sequence spans from the last 50 kb of the H2-T region, including 2 class I genes and 3 class I pseudogenes, and i...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.975303

    authors: Takada T,Kumánovics A,Amadou C,Yoshino M,Jones EP,Athanasiou M,Evans GA,Fischer Lindahl K

    更新日期:2003-04-01 00:00:00

  • The mRNA-bound proteome of the early fly embryo.

    abstract::Early embryogenesis is characterized by the maternal to zygotic transition (MZT), in which maternally deposited messenger RNAs are degraded while zygotic transcription begins. Before the MZT, post-transcriptional gene regulation by RNA-binding proteins (RBPs) is the dominant force in embryo patterning. We used two mRN...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.200386.115

    authors: Wessels HH,Imami K,Baltz AG,Kolinski M,Beldovskaya A,Selbach M,Small S,Ohler U,Landthaler M

    更新日期:2016-07-01 00:00:00

  • A virome-wide clonal integration analysis platform for discovering cancer viral etiology.

    abstract::Oncoviral infection is responsible for 12%-15% of cancer in humans. Convergent evidence from epidemiology, pathology, and oncology suggests that new viral etiologies for cancers remain to be discovered. Oncoviral profiles can be obtained from cancer genome sequencing data; however, widespread viral sequence contaminat...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.242529.118

    authors: Chen X,Kost J,Sulovari A,Wong N,Liang WS,Cao J,Li D

    更新日期:2019-05-01 00:00:00

  • A computational genomics approach to identify cis-regulatory modules from chromatin immunoprecipitation microarray data--a case study using E2F1.

    abstract::Advances in high-throughput technologies, such as ChIP-chip, and the completion of human and mouse genomic sequences now allow analysis of the mechanisms of gene regulation on a systems level. In this study, we have developed a computational genomics approach (termed ChIPModules), which begins with experimentally dete...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5520206

    authors: Jin VX,Rabinovich A,Squazzo SL,Green R,Farnham PJ

    更新日期:2006-12-01 00:00:00

  • A nuclear matrix attachment site in the 4q35 locus has an enhancer-blocking activity in vivo: implications for the facio-scapulo-humeral dystrophy.

    abstract::Facio-scapulo-humeral dystrophy (FSHD), a muscular hereditary disease with a prevalence of 1 in 20,000, is caused by a partial deletion of a subtelomeric repeat array on chromosome 4q. Earlier, we demonstrated the existence in the vicinity of the D4Z4 repeat of a nuclear matrix attachment site, FR-MAR, efficient in no...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6620908

    authors: Petrov A,Allinne J,Pirozhkova I,Laoudj D,Lipinski M,Vassetzky YS

    更新日期:2008-01-01 00:00:00

  • YY1 and CTCF orchestrate a 3D chromatin looping switch during early neural lineage commitment.

    abstract::CTCF is an architectural protein with a critical role in connecting higher-order chromatin folding in pluripotent stem cells. Recent reports have suggested that CTCF binding is more dynamic during development than previously appreciated. Here, we set out to understand the extent to which shifts in genome-wide CTCF occ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.215160.116

    authors: Beagan JA,Duong MT,Titus KR,Zhou L,Cao Z,Ma J,Lachanski CV,Gillis DR,Phillips-Cremins JE

    更新日期:2017-07-01 00:00:00

  • Widespread plasticity in CTCF occupancy linked to DNA methylation.

    abstract::CTCF is a ubiquitously expressed regulator of fundamental genomic processes including transcription, intra- and interchromosomal interactions, and chromatin structure. Because of its critical role in genome function, CTCF binding patterns have long been assumed to be largely invariant across different cellular environ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.136101.111

    authors: Wang H,Maurano MT,Qu H,Varley KE,Gertz J,Pauli F,Lee K,Canfield T,Weaver M,Sandstrom R,Thurman RE,Kaul R,Myers RM,Stamatoyannopoulos JA

    更新日期:2012-09-01 00:00:00

  • A first version of the Caenorhabditis elegans Promoterome.

    abstract::An important aspect of the development of systems biology approaches in metazoans is the characterization of expression patterns of nearly all genes predicted from genome sequences. Such "localizome" maps should provide information on where (in what cells or tissues) and when (at what stage of development or under wha...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2497604

    authors: Dupuy D,Li QR,Deplancke B,Boxem M,Hao T,Lamesch P,Sequerra R,Bosak S,Doucette-Stamm L,Hope IA,Hill DE,Walhout AJ,Vidal M

    更新日期:2004-10-01 00:00:00

  • Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data.

    abstract::Transposable elements (TEs) are an integral part of the host transcriptome. TE-containing noncoding RNAs (ncRNAs) show considerable tissue specificity and play important roles during development, including stem cell maintenance and cell differentiation. Recent advances in single-cell RNA-seq (scRNA-seq) revolutionized...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.265173.120

    authors: Shao W,Wang T

    更新日期:2021-01-01 00:00:00

  • Dissecting transcription regulatory pathways through a new bacterial one-hybrid reporter system.

    abstract::Sequence-specific DNA-binding transcription factors have widespread biological significance in the regulation of gene expression. However, in lower prokaryotes and eukaryotic metazoans, it is usually difficult to find transcription regulatory factors that recognize specific target promoters. To address this, we have d...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.086595.108

    authors: Guo M,Feng H,Zhang J,Wang W,Wang Y,Li Y,Gao C,Chen H,Feng Y,He ZG

    更新日期:2009-07-01 00:00:00

  • Multiplex mapping of chromatin accessibility and DNA methylation within targeted single molecules identifies epigenetic heterogeneity in neural stem cells and glioblastoma.

    abstract::Human tumors are comprised of heterogeneous cell populations that display diverse molecular and phenotypic features. To examine the extent to which epigenetic differences contribute to intratumoral cellular heterogeneity, we have developed a high-throughput method, termed MAPit-patch. The method uses multiplexed ampli...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.161737.113

    authors: Nabilsi NH,Deleyrolle LP,Darst RP,Riva A,Reynolds BA,Kladde MP

    更新日期:2014-02-01 00:00:00

  • An abundance of bidirectional promoters in the human genome.

    abstract::The alignment of full-length human cDNA sequences to the finished sequence of the human genome provides a unique opportunity to study the distribution of genes throughout the genome. By analyzing the distances between 23,752 genes, we identified a class of divergently transcribed gene pairs, representing more than 10%...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1982804

    authors: Trinklein ND,Aldred SF,Hartman SJ,Schroeder DI,Otillar RP,Myers RM

    更新日期:2004-01-01 00:00:00

  • Comprehensive genome sequence analysis of a breast cancer amplicon.

    abstract::Gene amplification occurs in most solid tumors and is associated with poor prognosis. Amplification of 20q13.2 is common to several tumor types including breast cancer. The 1 Mb of sequence spanning the 20q13.2 breast cancer amplicon is one of the most exhaustively studied segments of the human genome. These studies h...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.gr1743r

    authors: Collins C,Volik S,Kowbel D,Ginzinger D,Ylstra B,Cloutier T,Hawkins T,Predki P,Martin C,Wernick M,Kuo WL,Alberts A,Gray JW

    更新日期:2001-06-01 00:00:00

  • Whole genome shotgun sequencing of Brassica oleracea and its application to gene discovery and annotation in Arabidopsis.

    abstract::Through comparative studies of the model organism Arabidopsis thaliana and its close relative Brassica oleracea, we have identified conserved regions that represent potentially functional sequences overlooked by previous Arabidopsis genome annotation methods. A total of 454,274 whole genome shotgun sequences covering ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3176505

    authors: Ayele M,Haas BJ,Kumar N,Wu H,Xiao Y,Van Aken S,Utterback TR,Wortman JR,White OR,Town CD

    更新日期:2005-04-01 00:00:00

  • Antisense transcripts with FANTOM2 clone set and their implications for gene regulation.

    abstract::We have used the FANTOM2 mouse cDNA set (60,770 clones), public mRNA data, and mouse genome sequence data to identify 2481 pairs of sense-antisense transcripts and 899 further pairs of nonantisense bidirectional transcription based upon genomic mapping. The analysis greatly expands the number of known examples of sens...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.982903

    authors: Kiyosawa H,Yamanaka I,Osato N,Kondo S,Hayashizaki Y,RIKEN GER Group.,GSL Members.

    更新日期:2003-06-01 00:00:00

  • A bioinformatics-based strategy identifies c-Myc and Cdc25A as candidates for the Apmt mammary tumor latency modifiers.

    abstract::The epistatically interacting modifier loci (Apmt1 and Apmt2) accelerate the polyoma Middle-T (PyVT)-induced mammary tumor. To identify potential candidate genes loci, a combined bioinformatics and genomics strategy was used. On the basis of the assumption that the loci were functioning in the same or intersecting pat...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.210502

    authors: Cozma D,Lukes L,Rouse J,Qiu TH,Liu ET,Hunter KW

    更新日期:2002-06-01 00:00:00

  • Computational and experimental identification of mirtrons in Drosophila melanogaster and Caenorhabditis elegans.

    abstract::Mirtrons are intronic hairpin substrates of the dicing machinery that generate functional microRNAs. In this study, we describe experimental assays that defined the essential requirements for entry of introns into the mirtron pathway. These data informed a bioinformatic screen that effectively identified functional mi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.113050.110

    authors: Chung WJ,Agius P,Westholm JO,Chen M,Okamura K,Robine N,Leslie CS,Lai EC

    更新日期:2011-02-01 00:00:00

  • Genome-wide parent-of-origin DNA methylation analysis reveals the intricacies of human imprinting and suggests a germline methylation-independent mechanism of establishment.

    abstract::Differential methylation between the two alleles of a gene has been observed in imprinted regions, where the methylation of one allele occurs on a parent-of-origin basis, the inactive X-chromosome in females, and at those loci whose methylation is driven by genetic variants. We have extensively characterized imprinted...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.164913.113

    authors: Court F,Tayama C,Romanelli V,Martin-Trujillo A,Iglesias-Platas I,Okamura K,Sugahara N,Simón C,Moore H,Harness JV,Keirstead H,Sanchez-Mut JV,Kaneki E,Lapunzina P,Soejima H,Wake N,Esteller M,Ogata T,Hata K,Nakabayashi

    更新日期:2014-04-01 00:00:00

  • High mutational rates of large-scale duplication and deletion in Daphnia pulex.

    abstract::Knowledge of the genome-wide rate and spectrum of mutations is necessary to understand the origin of disease and the genetic variation driving all evolutionary processes. Here, we provide a genome-wide analysis of the rate and spectrum of mutations obtained in two Daphnia pulex genotypes via separate mutation-accumula...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.191338.115

    authors: Keith N,Tucker AE,Jackson CE,Sung W,Lucas Lledó JI,Schrider DR,Schaack S,Dudycha JL,Ackerman M,Younge AJ,Shaw JR,Lynch M

    更新日期:2016-01-01 00:00:00

  • The pig X and Y Chromosomes: structure, sequence, and evolution.

    abstract::We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.188839.114

    authors: Skinner BM,Sargent CA,Churcher C,Hunt T,Herrero J,Loveland JE,Dunn M,Louzada S,Fu B,Chow W,Gilbert J,Austin-Guest S,Beal K,Carvalho-Silva D,Cheng W,Gordon D,Grafham D,Hardy M,Harley J,Hauser H,Howden P,Howe K,

    更新日期:2016-01-01 00:00:00

  • General gene movement off the X chromosome in the Drosophila genus.

    abstract::In Drosophila melanogaster, there is an excess of genes duplicated by retroposition from the X chromosome to the autosomes. Most of those retrogenes that originated on the X chromosome have testis expression pattern. These observations could be explained by natural selection favoring genes that avoided spermatogenesis...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.088609.108

    authors: Vibranovski MD,Zhang Y,Long M

    更新日期:2009-05-01 00:00:00

  • A periodic pattern of SNPs in the human genome.

    abstract::By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing or alignment errors, for exampl...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6223207

    authors: Madsen BE,Villesen P,Wiuf C

    更新日期:2007-10-01 00:00:00

  • Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error.

    abstract::It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.097543.109

    authors: Liu X,Fu YX,Maxwell TJ,Boerwinkle E

    更新日期:2010-01-01 00:00:00

  • The human obese (OB) gene: RNA expression pattern and mapping on the physical, cytogenetic, and genetic maps of chromosome 7.

    abstract::The recently identified mouse obese (ob) gene apparently encodes a secreted protein that may function in the signaling pathway of adipose tissue. Mutations in the mouse ob gene are associated with the early development of gross obesity. A detailed knowledge concerning the RNA expression pattern and precise genomic loc...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5.1.5

    authors: Green ED,Maffei M,Braden VV,Proenca R,DeSilva U,Zhang Y,Chua SC Jr,Leibel RL,Weissenbach J,Friedman JM

    更新日期:1995-08-01 00:00:00

  • Next-generation sequencing identifies the natural killer cell microRNA transcriptome.

    abstract::Natural killer (NK) cells are innate lymphocytes important for early host defense against infectious pathogens and surveillance against malignant transformation. Resting murine NK cells regulate the translation of effector molecule mRNAs (e.g., granzyme B, GzmB) through unclear molecular mechanisms. MicroRNAs (miRNAs)...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.107995.110

    authors: Fehniger TA,Wylie T,Germino E,Leong JW,Magrini VJ,Koul S,Keppel CR,Schneider SE,Koboldt DC,Sullivan RP,Heinz ME,Crosby SD,Nagarajan R,Ramsingh G,Link DC,Ley TJ,Mardis ER

    更新日期:2010-11-01 00:00:00