An integrated computational pipeline and database to support whole-genome sequence annotation.

Abstract:

:We describe here our experience in annotating the Drosophila melanogaster genome sequence, in the course of which we developed several new open-source software tools and a database schema to support large-scale genome annotation. We have developed these into an integrated and reusable software system for whole-genome annotation. The key contributions to overall annotation quality are the marshalling of high-quality sequences for alignments and the design of a system with an adaptable and expandable flexible architecture.

journal_name

Genome Biol

journal_title

Genome biology

authors

Mungall CJ,Misra S,Berman BP,Carlson J,Frise E,Harris N,Marshall B,Shu S,Kaminker JS,Prochnik SE,Smith CD,Smith E,Tupy JL,Wiel C,Rubin GM,Lewis SE

doi

10.1186/gb-2002-3-12-research0081

keywords:

subject

Has Abstract

pub_date

2002-01-01 00:00:00

pages

RESEARCH0081

issue

12

eissn

1474-7596

issn

1474-760X

journal_volume

3

pub_type

杂志文章,评审
  • Genome-wide investigation of light and carbon signaling interactions in Arabidopsis.

    abstract:BACKGROUND:Light and carbon are two essential signals influencing plant growth and development. Little is known about how carbon and light signaling pathways intersect or influence one another to affect gene expression. RESULTS:Microarrays are used to investigate carbon and light signaling interactions at a genome-wid...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-2-r10

    authors: Thum KE,Shin MJ,Palenchar PM,Kouranov A,Coruzzi GM

    更新日期:2004-01-01 00:00:00

  • Domain atrophy creates rare cases of functional partial protein domains.

    abstract:BACKGROUND:Protein domains display a range of structural diversity, with numerous additions and deletions of secondary structural elements between related domains. We have observed a small number of cases of surprising large-scale deletions of core elements of structural domains. We propose a new concept called domain ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0655-8

    authors: Prakash A,Bateman A

    更新日期:2015-04-30 00:00:00

  • Target competition: transcription factors enter the limelight.

    abstract::Transcription factor binding sites compete for a limited pool of bioavailable transcription factor molecules to fine tune gene expression. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/gb4174

    authors: Karreth FA,Tay Y,Pandolfi PP

    更新日期:2014-04-28 00:00:00

  • Reproducible inference of transcription factor footprints in ATAC-seq and DNase-seq datasets using protocol-specific bias modeling.

    abstract:BACKGROUND:DNase-seq and ATAC-seq are broadly used methods to assay open chromatin regions genome-wide. The single nucleotide resolution of DNase-seq has been further exploited to infer transcription factor binding sites (TFBSs) in regulatory regions through footprinting. Recent studies have demonstrated the sequence b...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1654-y

    authors: Karabacak Calviello A,Hirsekorn A,Wurmus R,Yusuf D,Ohler U

    更新日期:2019-02-21 00:00:00

  • Transcript copy number estimation using a mouse whole-genome oligonucleotide microarray.

    abstract::The ability to quantitatively measure the expression of all genes in a given tissue or cell with a single assay is an exciting promise of gene-expression profiling technology. An in situ-synthesized 60-mer oligonucleotide microarray designed to detect transcripts from all mouse genes was validated, as well as a set of...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2005-6-7-r61

    authors: Carter MG,Sharov AA,VanBuren V,Dudekula DB,Carmack CE,Nelson C,Ko MS

    更新日期:2005-01-01 00:00:00

  • A fuzzy gene expression-based computational approach improves breast cancer prognostication.

    abstract::Early gene expression studies classified breast tumors into at least three clinically relevant subtypes. Although most current gene signatures are prognostic for estrogen receptor (ER) positive/human epidermal growth factor receptor 2 (HER2) negative breast cancers, few are informative for ER negative/HER2 negative an...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2010-11-2-r18

    authors: Haibe-Kains B,Desmedt C,Rothé F,Piccart M,Sotiriou C,Bontempi G

    更新日期:2010-01-01 00:00:00

  • Where is genomics going next?

    abstract::We polled the Editorial Board of Genome Biology to ask where they see genomics going in the next few years. Here are some of their responses. ...

    journal_title:Genome biology

    pub_type: 社论,面试

    doi:10.1186/s13059-019-1626-2

    authors: Cheifet B

    更新日期:2019-01-22 00:00:00

  • A human lung tumor microenvironment interactome identifies clinically relevant cell-type cross-talk.

    abstract:BACKGROUND:Tumors comprise a complex microenvironment of interacting malignant and stromal cell types. Much of our understanding of the tumor microenvironment comes from in vitro studies isolating the interactions between malignant cells and a single stromal cell type, often along a single pathway. RESULT:To develop a...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02019-x

    authors: Gentles AJ,Hui AB,Feng W,Azizi A,Nair RV,Bouchard G,Knowles DA,Yu A,Jeong Y,Bejnood A,Forgó E,Varma S,Xu Y,Kuong A,Nair VS,West R,van de Rijn M,Hoang CD,Diehn M,Plevritis SK

    更新日期:2020-05-07 00:00:00

  • Methylome evolution in plants.

    abstract::Despite major progress in dissecting the molecular pathways that control DNA methylation patterns in plants, little is known about the mechanisms that shape plant methylomes over evolutionary time. Drawing on recent intra- and interspecific epigenomic studies, we show that methylome evolution over long timescales is l...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/s13059-016-1127-5

    authors: Vidalis A,Živković D,Wardenaar R,Roquis D,Tellier A,Johannes F

    更新日期:2016-12-20 00:00:00

  • Dramatic changes in transcription factor binding over evolutionary time.

    abstract::A recent study reveals a surprisingly high degree of change in the occupancy patterns of two transcription factors in the livers of five vertebrates. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/gb-2010-11-6-122

    authors: Weirauch MT,Hughes TR

    更新日期:2010-01-01 00:00:00

  • Surfing waves of data in San Diego: sophisticated analyses provide a broad view of human genetic diversity.

    abstract::A report on the 64th annual American Society of Human Genetics meeting held in San Diego, USA, 18-22 October, 2014. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/s13059-014-0562-4

    authors: Reppell M,Koch E,Peter BM,Novembre J

    更新日期:2014-12-17 00:00:00

  • High throughput single-cell detection of multiplex CRISPR-edited gene modifications.

    abstract::CRISPR-Cas9 gene editing has transformed our ability to rapidly interrogate the functional impact of somatic mutations in human cancers. Droplet-based technology enables the analysis of Cas9-introduced gene edits in thousands of single cells. Using this technology, we analyze Ba/F3 cells engineered to express single o...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02174-1

    authors: Ten Hacken E,Clement K,Li S,Hernández-Sánchez M,Redd R,Wang S,Ruff D,Gruber M,Baranowski K,Jacob J,Flynn J,Jones KW,Neuberg D,Livak KJ,Pinello L,Wu CJ

    更新日期:2020-10-20 00:00:00

  • Quantifying the mechanisms of domain gain in animal proteins.

    abstract:BACKGROUND:Protein domains are protein regions that are shared among different proteins and are frequently functionally and structurally independent from the rest of the protein. Novel domain combinations have a major role in evolutionary innovation. However, the relative contributions of the different molecular mechan...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2010-11-7-r74

    authors: Buljan M,Frankish A,Bateman A

    更新日期:2010-01-01 00:00:00

  • Can sequence determine function?

    abstract::The functional annotation of proteins identified in genome sequencing projects is based on similarities to homologs in the databases. As a result of the possible strategies for divergent evolution, homologous enzymes frequently do not catalyze the same reaction, and we conclude that assignment of function from sequenc...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2000-1-5-reviews0005

    authors: Gerlt JA,Babbitt PC

    更新日期:2000-01-01 00:00:00

  • The Adult Mouse Anatomical Dictionary: a tool for annotating and integrating data.

    abstract::We have developed an ontology to provide standardized nomenclature for anatomical terms in the postnatal mouse. The Adult Mouse Anatomical Dictionary is structured as a directed acyclic graph, and is organized hierarchically both spatially and functionally. The ontology will be used to annotate and integrate different...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2005-6-3-r29

    authors: Hayamizu TF,Mangan M,Corradi JP,Kadin JA,Ringwald M

    更新日期:2005-01-01 00:00:00

  • Functional constraint and small insertions and deletions in the ENCODE regions of the human genome.

    abstract:BACKGROUND:We describe the distribution of indels in the 44 Encyclopedia of DNA Elements (ENCODE) regions (about 1% of the human genome) and evaluate the potential contributions of small insertion and deletion polymorphisms (indels) to human genetic variation. We relate indels to known genomic annotation features and m...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-9-r180

    authors: Clark TG,Andrew T,Cooper GM,Margulies EH,Mullikin JC,Balding DJ

    更新日期:2007-01-01 00:00:00

  • Inferring protein domain interactions from databases of interacting proteins.

    abstract::We describe domain pair exclusion analysis (DPEA), a method for inferring domain interactions from databases of interacting proteins. DPEA features a log odds score, Eij, reflecting confidence that domains i and j interact. We analyzed 177,233 potential domain interactions underlying 26,032 protein interactions. In to...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2005-6-10-r89

    authors: Riley R,Lee C,Sabatti C,Eisenberg D

    更新日期:2005-01-01 00:00:00

  • A versatile reporter system for CRISPR-mediated chromosomal rearrangements.

    abstract::Although chromosomal deletions and inversions are important in cancer, conventional methods for detecting DNA rearrangements require laborious indirect assays. Here we develop fluorescent reporters to rapidly quantify CRISPR/Cas9-mediated deletions and inversions. We find that inversion depends on the non-homologous e...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0680-7

    authors: Li Y,Park AI,Mou H,Colpan C,Bizhanova A,Akama-Garren E,Joshi N,Hendrickson EA,Feldser D,Yin H,Anderson DG,Jacks T,Weng Z,Xue W

    更新日期:2015-05-28 00:00:00

  • Molecular mechanisms of spindle function.

    abstract::The key molecules involved in regulating the assembly and function of the mitotic spindle are shared by evolutionarily divergent species. Studies in different model systems are leading to convergent conclusions about the central role of microtubule nucleation and dynamics and of kinesin-related motor proteins in spind...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2000-1-1-reviews101

    authors: Walczak CE

    更新日期:2000-01-01 00:00:00

  • FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease.

    abstract:BACKGROUND:Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibus to systematically p...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-12-r170

    authors: Chen R,Morgan AA,Dudley J,Deshpande T,Li L,Kodama K,Chiang AP,Butte AJ

    更新日期:2008-01-01 00:00:00

  • Common gene expression strategies revealed by genome-wide analysis in yeast.

    abstract:BACKGROUND:Gene expression is a two-step synthesis process that ends with the necessary amount of each protein required to perform its function. Since the protein is the final product, the main focus of gene regulation should be centered on it. However, because mRNA is an intermediate step and the amounts of both mRNA ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-10-r222

    authors: García-Martínez J,González-Candelas F,Pérez-Ortín JE

    更新日期:2007-01-01 00:00:00

  • Extensive transcriptomic and epigenomic remodelling occurs during Arabidopsis thaliana germination.

    abstract:BACKGROUND:Seed germination involves progression from complete metabolic dormancy to a highly active, growing seedling. Many factors regulate germination and these interact extensively, forming a complex network of inputs that control the seed-to-seedling transition. Our understanding of the direct regulation of gene e...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1302-3

    authors: Narsai R,Gouil Q,Secco D,Srivastava A,Karpievitch YV,Liew LC,Lister R,Lewsey MG,Whelan J

    更新日期:2017-09-15 00:00:00

  • Accelerating research through reagent repositories: the genome editing example.

    abstract::Keith Joung, Dan Voytas and Joanne Kamens share insights into how the genome editing field was advanced by early access to biological resources and the role in this process that plasmid repositories play. ...

    journal_title:Genome biology

    pub_type: 面试

    doi:10.1186/s13059-015-0830-y

    authors: Joung JK,Voytas DF,Kamens J

    更新日期:2015-11-20 00:00:00

  • Systematic analysis of dark and camouflaged genes reveals disease-relevant genes hiding in plain sight.

    abstract:BACKGROUND:The human genome contains "dark" gene regions that cannot be adequately assembled or aligned using standard short-read sequencing technologies, preventing researchers from identifying mutations within these gene regions that may be relevant to human disease. Here, we identify regions with few mappable reads ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1707-2

    authors: Ebbert MTW,Jensen TD,Jansen-West K,Sens JP,Reddy JS,Ridge PG,Kauwe JSK,Belzil V,Pregent L,Carrasquillo MM,Keene D,Larson E,Crane P,Asmann YW,Ertekin-Taner N,Younkin SG,Ross OA,Rademakers R,Petrucelli L,Fryer JD

    更新日期:2019-05-20 00:00:00

  • Chemical genomics in yeast.

    abstract::Many drugs have unknown, controversial or multiple mechanisms of action. Four recent 'chemical genomic' studies, using genome-scale collections of yeast gene deletions that were either arrayed or barcoded, have presented complementary approaches to identifying gene-drug and pathway-drug interactions. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2004-5-9-240

    authors: Brenner C

    更新日期:2004-01-01 00:00:00

  • Assembling allopolyploid genomes: no longer formidable.

    abstract::A combined approach of whole genome shotgun sequencing and ultra-high density linkage mapping using skim sequencing of a segregating population is effective for assembling allopolyploid genomes. ...

    journal_title:Genome biology

    pub_type: 信件

    doi:10.1186/s13059-015-0585-5

    authors: Ming R,Man Wai C

    更新日期:2015-01-31 00:00:00

  • Comparative genomics of gene-family size in closely related bacteria.

    abstract:BACKGROUND:The wealth of genomic data in bacteria is helping microbiologists understand the factors involved in gene innovation. Among these, the expansion and reduction of gene families appears to have a fundamental role in this, but the factors influencing gene family size are unclear. RESULTS:The relative content o...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-4-r27

    authors: Pushker R,Mira A,Rodríguez-Valera F

    更新日期:2004-01-01 00:00:00

  • An integrated multi-omics approach to identify regulatory mechanisms in cancer metastatic processes.

    abstract:BACKGROUND:Metastatic progress is the primary cause of death in most cancers, yet the regulatory dynamics driving the cellular changes necessary for metastasis remain poorly understood. Multi-omics approaches hold great promise for addressing this challenge; however, current analysis tools have limited capabilities to ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02213-x

    authors: Ghaffari S,Hanson C,Schmidt RE,Bouchonville KJ,Offer SM,Sinha S

    更新日期:2021-01-07 00:00:00

  • Quantitative protein expression profiling reveals extensive post-transcriptional regulation and post-translational modifications in schizont-stage malaria parasites.

    abstract:BACKGROUND:Malaria is a one of the most important infectious diseases and is caused by parasitic protozoa of the genus Plasmodium. Previously, quantitative characterization of the P. falciparum transcriptome demonstrated that the strictly controlled progression of these parasites through their intra-erythrocytic develo...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-12-r177

    authors: Foth BJ,Zhang N,Mok S,Preiser PR,Bozdech Z

    更新日期:2008-01-01 00:00:00

  • Genome-wide mutagenesis of Zea mays L. using RescueMu transposons.

    abstract::Derived from the maize Mu1 transposon, RescueMu provides strategies for maize gene discovery and mutant phenotypic analysis. 9.92 Mb of gene-enriched sequences next to RescueMu insertion sites were co-assembled with expressed sequence tags and analyzed. Multiple plasmid recoveries identified probable germinal insertio...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-10-r82

    authors: Fernandes J,Dong Q,Schneider B,Morrow DJ,Nan GL,Brendel V,Walbot V

    更新日期:2004-01-01 00:00:00