Vertebrate gene finding from multiple-species alignments using a two-level strategy.

Abstract:

BACKGROUND:One way in which the accuracy of gene structure prediction in vertebrate DNA sequences can be improved is by analyzing alignments with multiple related species, since functional regions of genes tend to be more conserved. RESULTS:We describe DOGFISH, a vertebrate gene finder consisting of a cleanly separated site classifier and structure predictor. The classifier scores potential splice sites and other features, using sequence alignments between multiple vertebrate species, while the structure predictor hypothesizes coding transcripts by combining these scores using a simple model of gene structure. This also identifies and assigns confidence scores to possible additional exons. Performance is assessed on the ENCODE regions. We predict transcripts and exons across the whole human genome, and identify over 10,000 high confidence new coding exons not in the Ensembl gene set. CONCLUSION:We present a practical multiple species gene prediction method. Accuracy improves as additional species, up to at least eight, are introduced. The novel predictions of the whole-genome scan should support efficient experimental verification.

journal_name

Genome Biol

journal_title

Genome biology

authors

Carter D,Durbin R

doi

10.1186/gb-2006-7-s1-s6

subject

Has Abstract

pub_date

2006-01-01 00:00:00

pages

S6.1-12

eissn

1474-7596

issn

1474-760X

pii

gb-2006-7-s1-s6

journal_volume

7 Suppl 1

pub_type

杂志文章
  • Chromatin Central: towards the comparative proteome by accurate mapping of the yeast proteomic environment.

    abstract:BACKGROUND:Understanding the design logic of living systems requires the understanding and comparison of proteomes. Proteomes define the commonalities between organisms more precisely than genomic sequences. Because uncertainties remain regarding the accuracy of proteomic data, several issues need to be resolved before...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-11-r167

    authors: Shevchenko A,Roguev A,Schaft D,Buchanan L,Habermann B,Sakalar C,Thomas H,Krogan NJ,Shevchenko A,Stewart AF

    更新日期:2008-01-01 00:00:00

  • Sequence-based genomics.

    abstract::A report on the Genome-Based Pathogen Biology meeting, Hinxton, UK, 7-10 July 2002. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2002-3-9-reports4029

    authors: Simpson AJ

    更新日期:2002-08-27 00:00:00

  • Using comparative genomics to reorder the human genome sequence into a virtual sheep genome.

    abstract:BACKGROUND:Is it possible to construct an accurate and detailed subgene-level map of a genome using bacterial artificial chromosome (BAC) end sequences, a sparse marker map, and the sequences of other genomes? RESULTS:A sheep BAC library, CHORI-243, was constructed and the BAC end sequences were determined and mapped ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-7-r152

    authors: Dalrymple BP,Kirkness EF,Nefedov M,McWilliam S,Ratnakumar A,Barris W,Zhao S,Shetty J,Maddox JF,O'Grady M,Nicholas F,Crawford AM,Smith T,de Jong PJ,McEwan J,Oddy VH,Cockett NE,International Sheep Genomics Consortium.

    更新日期:2007-01-01 00:00:00

  • Asymmetric relationships between proteins shape genome evolution.

    abstract:BACKGROUND:The relationships between proteins are often asymmetric: one protein (A) depends for its function on another protein (B), but the second protein does not depend on the first. In metabolic networks there are multiple pathways that converge into one central pathway. The enzymes in the converging pathways depen...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2009-10-2-r19

    authors: Notebaart RA,Kensche PR,Huynen MA,Dutilh BE

    更新日期:2009-02-12 00:00:00

  • The Amborella genome: an evolutionary reference for plant biology.

    abstract::The nuclear genome sequence of Amborella trichopoda, the sister species to all other extant angiosperms, will be an exceptional resource for plant genomics. ...

    journal_title:Genome biology

    pub_type: 信件

    doi:10.1186/gb-2008-9-3-402

    authors: Soltis DE,Albert VA,Leebens-Mack J,Palmer JD,Wing RA,dePamphilis CW,Ma H,Carlson JE,Altman N,Kim S,Wall PK,Zuccolo A,Soltis PS

    更新日期:2008-01-01 00:00:00

  • Benchmarking of computational error-correction methods for next-generation sequencing data.

    abstract:BACKGROUND:Recent advancements in next-generation sequencing have rapidly improved our ability to study genomic material at an unprecedented scale. Despite substantial improvements in sequencing technologies, errors present in the data still risk confounding downstream analysis and limiting the applicability of sequenc...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-01988-3

    authors: Mitchell K,Brito JJ,Mandric I,Wu Q,Knyazev S,Chang S,Martin LS,Karlsberg A,Gerasimov E,Littman R,Hill BL,Wu NC,Yang HT,Hsieh K,Chen L,Littman E,Shabani T,Enik G,Yao D,Sun R,Schroeder J,Eskin E,Zelikovsky A,S

    更新日期:2020-03-17 00:00:00

  • The bread wheat epigenomic map reveals distinct chromatin architectural and evolutionary features of functional genetic elements.

    abstract:BACKGROUND:Bread wheat is an allohexaploid species with a 16-Gb genome that has large intergenic regions, which presents a big challenge for pinpointing regulatory elements and further revealing the transcriptional regulatory mechanisms. Chromatin profiling to characterize the combinatorial patterns of chromatin signat...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1746-8

    authors: Li Z,Wang M,Lin K,Xie Y,Guo J,Ye L,Zhuang Y,Teng W,Ran X,Tong Y,Xue Y,Zhang W,Zhang Y

    更新日期:2019-07-15 00:00:00

  • Conserved rules govern genetic interaction degree across species.

    abstract:BACKGROUND:Synthetic genetic interactions have recently been mapped on a genome scale in the budding yeast Saccharomyces cerevisiae, providing a functional view of the central processes of eukaryotic life. Currently, comprehensive genetic interaction networks have not been determined for other species, and we therefore...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2012-13-7-r57

    authors: Koch EN,Costanzo M,Bellay J,Deshpande R,Chatfield-Reed K,Chua G,D'Urso G,Andrews BJ,Boone C,Myers CL

    更新日期:2012-07-02 00:00:00

  • Molecular mechanisms of spindle function.

    abstract::The key molecules involved in regulating the assembly and function of the mitotic spindle are shared by evolutionarily divergent species. Studies in different model systems are leading to convergent conclusions about the central role of microtubule nucleation and dynamics and of kinesin-related motor proteins in spind...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2000-1-1-reviews101

    authors: Walczak CE

    更新日期:2000-01-01 00:00:00

  • Transcriptional profiling of long non-coding RNAs and novel transcribed regions across a diverse panel of archived human cancers.

    abstract:BACKGROUND:Molecular characterization of tumors has been critical for identifying important genes in cancer biology and for improving tumor classification and diagnosis. Long non-coding RNAs, as a new, relatively unstudied class of transcripts, provide a rich opportunity to identify both functional drivers and cancer-t...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2012-13-8-r75

    authors: Brunner AL,Beck AH,Edris B,Sweeney RT,Zhu SX,Li R,Montgomery K,Varma S,Gilks T,Guo X,Foley JW,Witten DM,Giacomini CP,Flynn RA,Pollack JR,Tibshirani R,Chang HY,van de Rijn M,West RB

    更新日期:2012-08-28 00:00:00

  • Genomic analysis reveals that Pseudomonas aeruginosa virulence is combinatorial.

    abstract:BACKGROUND:Pseudomonas aeruginosa is a ubiquitous environmental bacterium and an important opportunistic human pathogen. Generally, the acquisition of genes in the form of pathogenicity islands distinguishes pathogenic isolates from nonpathogens. We therefore sequenced a highly virulent strain of P. aeruginosa, PA14, a...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2006-7-10-r90

    authors: Lee DG,Urbach JM,Wu G,Liberati NT,Feinbaum RL,Miyata S,Diggins LT,He J,Saucier M,Déziel E,Friedman L,Li L,Grills G,Montgomery K,Kucherlapati R,Rahme LG,Ausubel FM

    更新日期:2006-01-01 00:00:00

  • Age and sun exposure-related widespread genomic blocks of hypomethylation in nonmalignant skin.

    abstract:BACKGROUND:Aging and sun exposure are the leading causes of skin cancer. It has been shown that epigenetic changes, such as DNA methylation, are well established mechanisms for cancer, and also have emerging roles in aging and common disease. Here, we directly ask whether DNA methylation is altered following skin aging...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0644-y

    authors: Vandiver AR,Irizarry RA,Hansen KD,Garza LA,Runarsson A,Li X,Chien AL,Wang TS,Leung SG,Kang S,Feinberg AP

    更新日期:2015-04-16 00:00:00

  • Illuminating the genome-wide activity of genome editors for safe and effective therapeutics.

    abstract::Genome editing holds remarkable promise to transform human medicine as new therapies that can directly address the genetic causes of disease. However, concerns remain about possible undesired biological consequences of genome editors, particularly the introduction of unintended 'off-target' mutations. Here, we discuss...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1610-2

    authors: Cheng Y,Tsai SQ

    更新日期:2018-12-22 00:00:00

  • Molecular orchestration of the hepatic circadian symphony.

    abstract::The circadian clock determines the rhythmic expression of many different genes throughout a 24-hour period. A recent study investigating the circadian regulation of liver proteins reveals multiple levels of regulation, including transcriptional, post-transcriptional and post-translational mechanisms. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2006-7-9-234

    authors: Albrecht U

    更新日期:2006-01-01 00:00:00

  • Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene.

    abstract:BACKGROUND:RNA sequencing has opened new avenues for the study of transcriptome composition. Significant evidence has accumulated showing that the human transcriptome contains in excess of a hundred thousand different transcripts. However, it is still not clear to what extent this diversity prevails when considering th...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2013-14-7-r70

    authors: Gonzàlez-Porta M,Frankish A,Rung J,Harrow J,Brazma A

    更新日期:2013-07-01 00:00:00

  • Single-cell RNA-seq transcriptome analysis of linear and circular RNAs in mouse preimplantation embryos.

    abstract::Circular RNAs (circRNAs) are a new class of non-polyadenylated non-coding RNAs that may play important roles in many biological processes. Here we develop a single-cell universal poly(A)-independent RNA sequencing (SUPeR-seq) method to sequence both polyadenylated and non-polyadenylated RNAs from individual cells. Thi...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0706-1

    authors: Fan X,Zhang X,Wu X,Guo H,Hu Y,Tang F,Huang Y

    更新日期:2015-07-23 00:00:00

  • Genomic studies of mood disorders -- the brain as a muscle?

    abstract::Recent genomic studies showing abnormalities in the fibroblast growth factor system in the postmortem brains of people with major depressive disorder support previous indications of a role for growth factors in mood disorders. Similar molecular pathways, volumetric changes, and the effects of exercise on mood suggest ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2005-6-4-215

    authors: Niculescu AB

    更新日期:2005-01-01 00:00:00

  • Systematic identification of genetic influences on methylation across the human life course.

    abstract:BACKGROUND:The influence of genetic variation on complex diseases is potentially mediated through a range of highly dynamic epigenetic processes exhibiting temporal variation during development and later life. Here we present a catalogue of the genetic influences on DNA methylation (methylation quantitative trait loci ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-0926-z

    authors: Gaunt TR,Shihab HA,Hemani G,Min JL,Woodward G,Lyttleton O,Zheng J,Duggirala A,McArdle WL,Ho K,Ring SM,Evans DM,Davey Smith G,Relton CL

    更新日期:2016-03-31 00:00:00

  • Recurrent insertion and duplication generate networks of transposable element sequences in the Drosophila melanogaster genome.

    abstract:BACKGROUND:The recent availability of genome sequences has provided unparalleled insights into the broad-scale patterns of transposable element (TE) sequences in eukaryotic genomes. Nevertheless, the difficulties that TEs pose for genome assembly and annotation have prevented detailed, quantitative inferences about the...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2006-7-11-r112

    authors: Bergman CM,Quesneville H,Anxolabéhère D,Ashburner M

    更新日期:2006-01-01 00:00:00

  • Networks for all.

    abstract::A report on the Cold Spring Harbor Laboratory/Wellcome Trust conference on Network Biology, Hinxton, UK, 27-31 August 2008. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2008-9-10-324

    authors: Ahnert SE,Teichmann SA

    更新日期:2008-10-27 00:00:00

  • Comparative genomics reveals the distinct evolutionary trajectories of the robust and complex coral lineages.

    abstract:BACKGROUND:Despite the biological and economic significance of scleractinian reef-building corals, the lack of large molecular datasets for a representative range of species limits understanding of many aspects of their biology. Within the Scleractinia, based on molecular evidence, it is generally recognised that there...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1552-8

    authors: Ying H,Cooke I,Sprungala S,Wang W,Hayward DC,Tang Y,Huttley G,Ball EE,Forêt S,Miller DJ

    更新日期:2018-11-02 00:00:00

  • A prediction-based resampling method for estimating the number of clusters in a dataset.

    abstract:BACKGROUND:Microarray technology is increasingly being applied in biological and medical research to address a wide range of problems, such as the classification of tumors. An important statistical problem associated with tumor classification is the identification of new tumor classes using gene-expression profiles. Tw...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2002-3-7-research0036

    authors: Dudoit S,Fridlyand J

    更新日期:2002-06-25 00:00:00

  • Genomic parasites and genome evolution.

    abstract::A report of the Second International Conference/Workshop on the Genomic Impact of Eukaryotic Transposable Elements, Pacific Grove, USA, 6-10 February 2009. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2009-10-4-306

    authors: Ivics Z

    更新日期:2009-01-01 00:00:00

  • Independent centromere formation in a capricious, gene-free domain of chromosome 13q21 in Old World monkeys and pigs.

    abstract:BACKGROUND:Evolutionary centromere repositioning and human analphoid neocentromeres occurring in clinical cases are, very likely, two stages of the same phenomenon whose properties still remain substantially obscure. Chromosome 13 is the chromosome with the highest number of neocentromeres. We reconstructed the mammali...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2006-7-10-r91

    authors: Cardone MF,Alonso A,Pazienza M,Ventura M,Montemurro G,Carbone L,de Jong PJ,Stanyon R,D'Addabbo P,Archidiacono N,She X,Eichler EE,Warburton PE,Rocchi M

    更新日期:2006-01-01 00:00:00

  • Comparative genomics of mutualistic viruses of Glyptapanteles parasitic wasps.

    abstract:BACKGROUND:Polydnaviruses, double-stranded DNA viruses with segmented genomes, have evolved as obligate endosymbionts of parasitoid wasps. Virus particles are replication deficient and produced by female wasps from proviral sequences integrated into the wasp genome. These particles are co-injected with eggs into caterp...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-12-r183

    authors: Desjardins CA,Gundersen-Rindal DE,Hostetler JB,Tallon LJ,Fadrosh DW,Fuester RW,Pedroni MJ,Haas BJ,Schatz MC,Jones KM,Crabtree J,Forberger H,Nene V

    更新日期:2008-01-01 00:00:00

  • Species-specific shifts in centromere sequence composition are coincident with breakpoint reuse in karyotypically divergent lineages.

    abstract:BACKGROUND:It has been hypothesized that rapid divergence in centromere sequences accompanies rapid karyotypic change during speciation. However, the reuse of breakpoints coincident with centromeres in the evolution of divergent karyotypes poses a potential paradox. In distantly related species where the same centromer...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-8-r170

    authors: Bulazel KV,Ferreri GC,Eldridge MD,O'Neill RJ

    更新日期:2007-01-01 00:00:00

  • Redistribution of H3K27me3 upon DNA hypomethylation results in de-repression of Polycomb target genes.

    abstract:BACKGROUND:DNA methylation and the Polycomb repression system are epigenetic mechanisms that play important roles in maintaining transcriptional repression. Recent evidence suggests that DNA methylation can attenuate the binding of Polycomb protein components to chromatin and thus plays a role in determining their geno...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2013-14-3-r25

    authors: Reddington JP,Perricone SM,Nestor CE,Reichmann J,Youngson NA,Suzuki M,Reinhardt D,Dunican DS,Prendergast JG,Mjoseng H,Ramsahoye BH,Whitelaw E,Greally JM,Adams IR,Bickmore WA,Meehan RR

    更新日期:2013-03-25 00:00:00

  • Hemispheric asymmetry in the human brain and in Parkinson's disease is linked to divergent epigenetic patterns in neurons.

    abstract:BACKGROUND:Hemispheric asymmetry in neuronal processes is a fundamental feature of the human brain and drives symptom lateralization in Parkinson's disease (PD), but its molecular determinants are unknown. Here, we identify divergent epigenetic patterns involved in hemispheric asymmetry by profiling DNA methylation in ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-01960-1

    authors: Li P,Ensink E,Lang S,Marshall L,Schilthuis M,Lamp J,Vega I,Labrie V

    更新日期:2020-03-09 00:00:00

  • Chromatin accessibility reveals insights into androgen receptor activation and transcriptional specificity.

    abstract:BACKGROUND:Epigenetic mechanisms such as chromatin accessibility impact transcription factor binding to DNA and transcriptional specificity. The androgen receptor (AR), a master regulator of the male phenotype and prostate cancer pathogenesis, acts primarily through ligand-activated transcription of target genes. Altho...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2012-13-10-r88

    authors: Tewari AK,Yardimci GG,Shibata Y,Sheffield NC,Song L,Taylor BS,Georgiev SG,Coetzee GA,Ohler U,Furey TS,Crawford GE,Febbo PG

    更新日期:2012-10-03 00:00:00

  • A genomic and evolutionary approach reveals non-genetic drug resistance in malaria.

    abstract:BACKGROUND:Drug resistance remains a major public health challenge for malaria treatment and eradication. Individual loci associated with drug resistance to many antimalarials have been identified, but their epistasis with other resistance mechanisms has not yet been elucidated. RESULTS:We previously described two mut...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/PREACCEPT-1067113631444973

    authors: Herman JD,Rice DP,Ribacke U,Silterra J,Deik AA,Moss EL,Broadbent KM,Neafsey DE,Desai MM,Clish CB,Mazitschek R,Wirth DF

    更新日期:2014-01-01 00:00:00