Finding genes in Schistosoma japonicum: annotating novel genomes with help of extrinsic evidence.

Abstract:

:We have developed a novel method for estimating the parameters of hidden Markov models for gene finding in newly sequenced species. Our approach does not rely on curated training data sets, but instead uses extrinsic evidence (including paired-end ditags that have not been used in gene finding previously) and iterative training. This new method is particularly suitable for annotation of species with large evolutionary distance to the closest annotated species. We have used our approach to produce an initial annotation of more than 16,000 genes in the newly sequenced Schistosoma japonicum draft genome. We established the high quality of our predictions by comparison to full-length cDNAs (withdrawn from the extrinsic evidence) and to CEGMA core genes. We also evaluated the effectiveness of the new training procedure on Caenorhabditis elegans genome. ExonHunter and the newest parametric files for S. japonicum genome are available for download at www.bioinformatics.uwaterloo.ca/downloads/exonhunter.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Brejová B,Vinar T,Chen Y,Wang S,Zhao G,Brown DG,Li M,Zhou Y

doi

10.1093/nar/gkp052

subject

Has Abstract

pub_date

2009-04-01 00:00:00

pages

e52

issue

7

eissn

0305-1048

issn

1362-4962

pii

gkp052

journal_volume

37

pub_type

杂志文章
  • Restriction endonuclease isoschizomers ItaI, BsoFI and Fsp4HI are characterised by differences in their sensitivities to CpG methylation.

    abstract::BsoFI , ItaI and Fsp4HI are isoshizomers of Fnu4HI (5'-GC NGC-3'). Both Fnu4HI and BsoFI have previously been shown to be inhibited by cytosine-specific methylation within the recognition sequence. Fnu4HI is inhibited if either the internal cytosine at position 2 or the external cytosine at position 5 of the restricti...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/25.16.3196

    authors: Ramsahoye BH,Burnett AK,Taylor C

    更新日期:1997-08-15 00:00:00

  • Structure of a promotor on plasmid pMB9 derived from plasmid pSC101.

    abstract::The DNA sequence of a 354 basepair EcoRI-HindIII fragment of plasmid pMB9 which has originally been derived from plasmid pSC101 has been resolved. This fragment contains a promoter for transcription directed towards the EcoRI site. Escherichia coli RNA polymerase binds to a region within the EcoRI-HindIII fragment whi...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/8.7.1535

    authors: Pannekoek H,Maat J,van den Berg E,Noordermeer I

    更新日期:1980-04-11 00:00:00

  • Identification of a homeobox-containing gene located between lin-45 and unc-24 on chromosome IV in the nematode Caenorhabditis elegans.

    abstract::Using two primers corresponding to helix 1 and helix 3 regions in the homeodomain, we subjected genomic DNA from Caenorhabditis elegans to amplification by the polymerase chain reaction. Sequence analysis of the amplified products revealed a new homeobox-containing gene, designated ceh-19. This gene was located betwee...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.12.2967

    authors: Naito M,Kohara Y,Kurosawa Y

    更新日期:1992-06-25 00:00:00

  • The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse.

    abstract::The Mouse Genome Database (MGD, http://www.informatics.jax.org) is the international community resource for integrated genetic, genomic and biological data about the laboratory mouse. Data in MGD are obtained through loads from major data providers and experimental consortia, electronic submissions from laboratories a...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr974

    authors: Eppig JT,Blake JA,Bult CJ,Kadin JA,Richardson JE,Mouse Genome Database Group.

    更新日期:2012-01-01 00:00:00

  • The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution.

    abstract::We report the latest release (version 3.0) of the CATH protein domain database (http://www.cathdb.info). There has been a 20% increase in the number of structural domains classified in CATH, up to 86 151 domains. Release 3.0 comprises 1110 fold groups and 2147 homologous superfamilies. To cope with the increases in di...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl959

    authors: Greene LH,Lewis TE,Addou S,Cuff A,Dallman T,Dibley M,Redfern O,Pearl F,Nambudiry R,Reid A,Sillitoe I,Yeats C,Thornton JM,Orengo CA

    更新日期:2007-01-01 00:00:00

  • Methyl transferases induced during chemical adaptation of M. luteus.

    abstract::Three peaks of methyltransferase activity specific for MNNG alkylated DNA have been identified from extracts of chemically adapted M. luteus. They are designated as TI to TIII in order to their elution from a Sephadex G-75 column. The first one of these peaks has been purified to homogeneity. TI, is an inducible, unus...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/15.22.9471

    authors: Riazuddin S,Athar A,Sohail A

    更新日期:1987-11-25 00:00:00

  • Untangling the relationships between DNA repair pathways by silencing more than 20 DNA repair genes in human stable clones.

    abstract::Much effort has long been devoted to unraveling the coordinated cellular response to genotoxic insults. In view of the difficulty of obtaining human biological samples of homogeneous origin, I have established a set of stable human clones where one DNA repair gene has been stably silenced by means of RNA interference....

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkm195

    authors: Biard DS

    更新日期:2007-01-01 00:00:00

  • AlleleHMM: a data-driven method to identify allele specific differences in distributed functional genomic marks.

    abstract::How DNA sequence variation influences gene expression remains poorly understood. Diploid organisms have two homologous copies of their DNA sequence in the same nucleus, providing a rich source of information about how genetic variation affects a wealth of biochemical processes. However, few computational methods have ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz176

    authors: Chou SP,Danko CG

    更新日期:2019-06-20 00:00:00

  • Flexibility and stabilization of HgII-mediated C:T and T:T base pairs in DNA duplex.

    abstract::Owing to their great potentials in genetic code extension and the development of nucleic acid-based functional nanodevices, DNA duplexes containing HgII-mediated base pairs have been extensively studied during the past 60 years. However, structural basis underlying these base pairs remains poorly understood. Herein, w...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw1296

    authors: Liu H,Cai C,Haruehanroengra P,Yao Q,Chen Y,Yang C,Luo Q,Wu B,Li J,Ma J,Sheng J,Gan J

    更新日期:2017-03-17 00:00:00

  • Cell type-specific genomics of Drosophila neurons.

    abstract::Many tools are available to analyse genomes but are often challenging to use in a cell type-specific context. We have developed a method similar to the isolation of nuclei tagged in a specific cell type (INTACT) technique [Deal,R.B. and Henikoff,S. (2010) A simple method for gene expression and chromatin profiling of ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks671

    authors: Henry GL,Davis FP,Picard S,Eddy SR

    更新日期:2012-10-01 00:00:00

  • Bioinformatic analysis of the protein/DNA interface.

    abstract::To investigate the principles driving recognition between proteins and DNA, we analyzed more than thousand crystal structures of protein/DNA complexes. We classified protein and DNA conformations by structural alphabets, protein blocks [de Brevern, Etchebest and Hazout (2000) (Bayesian probabilistic approach for predi...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt1273

    authors: Schneider B,Cerný J,Svozil D,Cech P,Gelly JC,de Brevern AG

    更新日期:2014-03-01 00:00:00

  • Happy mapping: a proposal for linkage mapping the human genome.

    abstract::A theoretical approach for linkage mapping the genome of any higher eukaryote is described. It uses the polymerase chain reaction, oligonucleotides of random sequence and single haploid cells. Markers are defined and then the DNA of a single sperm is broken at random (eg by gamma-rays) and physically split into 3 aliq...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/17.17.6795

    authors: Dear PH,Cook PR

    更新日期:1989-09-12 00:00:00

  • Characterization of the apolipoprotein B mRNA editing enzyme: no similarity to the proposed mechanism of RNA editing in kinetoplastid protozoa.

    abstract::Intestinal apolipoprotein B mRNA is edited at nucleotide 6666 by a C to U transition resulting in a translational stop codon. The enzymatic properties of the editing activity were characterised in vitro using rat enterocyte cytosolic extract. The editing activity has no nucleotide or ion cofactor requirement. It shows...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/19.13.3569

    authors: Greeve J,Navaratnam N,Scott J

    更新日期:1991-07-11 00:00:00

  • Evolution of the casein multigene family: conserved sequences in the 5' flanking and exon regions.

    abstract::The rat alpha- and bovine alpha s1-casein genes have been isolated and their 5' sequences determined. The rat alpha-, beta-, gamma- and bovine alpha s1-casein genes contain similar 5' exon arrangements in which the 5' noncoding, signal peptide and casein kinase phosphorylation sequences are each encoded by separate ex...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/14.4.1883

    authors: Yu-Lee LY,Richter-Mann L,Couch CH,Stewart AF,Mackinlay AG,Rosen JM

    更新日期:1986-02-25 00:00:00

  • Identification of the factors that interact with NCBP, an 80 kDa nuclear cap binding protein.

    abstract::It has been shown that the monomethylated cap structure plays important roles in pre-mRNA splicing and nuclear export of RNA. As a candidate for the factor involved in these nuclear events we have previously purified an 80 kDa nuclear cap binding protein (NCBP) from a HeLa cell nuclear extract and isolated its full-le...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.18.3638

    authors: Kataoka N,Ohno M,Moda I,Shimura Y

    更新日期:1995-09-25 00:00:00

  • The role of zinc finger linkers in p43 and TFIIIA binding to 5S rRNA and DNA.

    abstract::Transcription factor IIIA (TFIIIA) and p43 zinc finger protein form distinct complexes with 5S ribosomal RNA in Xenopus oocytes. Additionally, TFIIIA binds the internal promoter of the 5S RNA gene and supports assembly of a transcription initiation complex. Both proteins have nine tandemly repeated zinc fingers with a...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/26.3.703

    authors: Ryan RF,Darby MK

    更新日期:1998-02-01 00:00:00

  • Change of RNase P RNA function by single base mutation correlates with perturbation of metal ion binding in P4 as determined by NMR spectroscopy.

    abstract::The solution structures of two 27 nt RNA hairpins and their complexes with cobalt(III)-hexammine [Co(NH(3))(6)(3+)] were determined by NMR spectroscopy. The RNA hairpins are variants of the P4 region from Escherichia coli RNase P RNA: a U-to-A mutant changing the identity of the bulged nucleotide, and a U-to-C, C-to-U...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh961

    authors: Schmitz M

    更新日期:2004-12-02 00:00:00

  • Understanding the effect of controlling phosphorothioate chirality in the DNA gap on the potency and safety of gapmer antisense oligonucleotides.

    abstract::Therapeutic oligonucleotides are often modified using the phosphorothioate (PS) backbone modification which enhances stability from nuclease mediated degradation. However, substituting oxygen in the phosphodiester backbone with sulfur introduce chirality into the backbone such that a full PS 16-mer oligonucleotide is ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkaa031

    authors: Østergaard ME,De Hoyos CL,Wan WB,Shen W,Low A,Berdeja A,Vasquez G,Murray S,Migawa MT,Liang XH,Swayze EE,Crooke ST,Seth PP

    更新日期:2020-02-28 00:00:00

  • Characteristic arrangement of nucleosomes is predictive of chromatin interactions at kilobase resolution.

    abstract::High-throughput chromosome conformation capture (3C) technologies, such as Hi-C, have made it possible to survey 3D genome structure. However, obtaining 3D profiles at kilobase resolution at low cost remains a major challenge. Therefore, we herein present an algorithm for precise identification of chromatin interactio...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx885

    authors: Zhang H,Li F,Jia Y,Xu B,Zhang Y,Li X,Zhang Z

    更新日期:2017-12-15 00:00:00

  • Structure of Actin-related protein 8 and its contribution to nucleosome binding.

    abstract::Nuclear actin-related proteins (Arps) are subunits of several chromatin remodelers, but their molecular functions within these complexes are unclear. We report the crystal structure of the INO80 complex subunit Arp8 in its ATP-bound form. Human Arp8 has several insertions in the conserved actin fold that explain its i...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks842

    authors: Gerhold CB,Winkler DD,Lakomek K,Seifert FU,Fenn S,Kessler B,Witte G,Luger K,Hopfner KP

    更新日期:2012-11-01 00:00:00

  • Analysis of the gene encoding the RNA subunit of ribonuclease P from cyanobacteria.

    abstract::The genes encoding the RNA subunit of ribonuclease P from the unicellular cyanobacterium Synechocystis sp. PCC 6803, and from the heterocyst-forming strains Anabaena sp. PCC 7120 and Calothrix sp. PCC 7601 were cloned using the homologous gene from Anacystis nidulans (Synechococcus sp. PCC 6301) as a probe. The genes ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.23.6331

    authors: Vioque A

    更新日期:1992-12-11 00:00:00

  • ConTra v3: a tool to identify transcription factor binding sites across species, update 2017.

    abstract::Transcription factors are important gene regulators with distinctive roles in development, cell signaling and cell cycling, and they have been associated with many diseases. The ConTra v3 web server allows easy visualization and exploration of predicted transcription factor binding sites (TFBSs) in any genomic region ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx376

    authors: Kreft L,Soete A,Hulpiau P,Botzki A,Saeys Y,De Bleser P

    更新日期:2017-07-03 00:00:00

  • Influence of flanking sequences on variability in expression levels of an introduced gene in transgenic tobacco plants.

    abstract::The petunia rbcS gene SSU301 was introduced into tobacco using Agrobacterium tumefaciens-mediated transformation. The time at which rbcS expression was maximal after transfer of the tobacco plants to the greenhouse was determined. The expression level of the SSU301 gene varied up to 9 fold between individual tobacco p...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/16.19.9267

    authors: Dean C,Jones J,Favreau M,Dunsmuir P,Bedbrook J

    更新日期:1988-10-11 00:00:00

  • Dynamic binding of Ku80, Ku70 and NF90 to the IL-2 promoter in vivo in activated T-cells.

    abstract::IL-2 gene expression in activated T-cells is initiated by chromatin remodeling at the IL-2 proximal promoter and conversion of a transcriptional repressor into a potent transcriptional activator. A purine-box regulator complex was purified from activated Jurkat T-cell nuclei based on sequence-specific DNA binding to t...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkm117

    authors: Shi L,Qiu D,Zhao G,Corthesy B,Lees-Miller S,Reeves WH,Kao PN

    更新日期:2007-01-01 00:00:00

  • Improving the performance of DomainParser for structural domain partition using neural network.

    abstract::Structural domains are considered as the basic units of protein folding, evolution, function and design. Automatic decomposition of protein structures into structural domains, though after many years of investigation, remains a challenging and unsolved problem. Manual inspection still plays a key role in domain decomp...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg189

    authors: Guo JT,Xu D,Kim D,Xu Y

    更新日期:2003-02-01 00:00:00

  • Distinct RNA-binding modules in a single PUF protein cooperate to determine RNA specificity.

    abstract::PUF proteins, named for Drosophila Pumilio (PUM) and Caenorhabditis elegans fem-3-binding factor (FBF), recognize specific sequences in the mRNAs they bind and control. RNA binding by classical PUF proteins is mediated by a characteristic PUM homology domain (PUM-HD). The Puf1 and Puf2 proteins possess a distinct arch...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz583

    authors: Qiu C,Dutcher RC,Porter DF,Arava Y,Wickens M,Hall TMT

    更新日期:2019-09-19 00:00:00

  • CAF-1-induced oligomerization of histones H3/H4 and mutually exclusive interactions with Asf1 guide H3/H4 transitions among histone chaperones and DNA.

    abstract::Anti-silencing function 1 (Asf1) and Chromatin Assembly Factor 1 (CAF-1) chaperone histones H3/H4 during the assembly of nucleosomes on newly replicated DNA. To understand the mechanism of histone H3/H4 transfer among Asf1, CAF-1 and DNA from a thermodynamic perspective, we developed and employed biophysical approache...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks906

    authors: Liu WH,Roemer SC,Port AM,Churchill ME

    更新日期:2012-12-01 00:00:00

  • Characterization of the 3' exonuclease subunit DP1 of Methanococcus jannaschii replicative DNA polymerase D.

    abstract::The B-subunits associated with the replicative DNA polymerases are conserved from Archaea to humans, whereas the corresponding catalytic subunits are not related. The latter belong to the B and D DNA polymerase families in eukaryotes and archaea, respectively. Sequence analysis places the B-subunits within the calcine...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh558

    authors: Jokela M,Eskelinen A,Pospiech H,Rouvinen J,Syväoja JE

    更新日期:2004-04-30 00:00:00

  • RAD51AP2, a novel vertebrate- and meiotic-specific protein, shares a conserved RAD51-interacting C-terminal domain with RAD51AP1/PIR51.

    abstract::Many interacting proteins regulate and/or assist the activities of RAD51, a recombinase which plays a critical role in both DNA repair and meiotic recombination. Yeast two-hybrid screening of a human testis cDNA library revealed a new protein, RAD51AP2 (RAD51 Associated Protein 2), that interacts strongly with RAD51. ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl665

    authors: Kovalenko OV,Wiese C,Schild D

    更新日期:2006-01-01 00:00:00

  • Eukaryotic rpL10 drives ribosomal rotation.

    abstract::Ribosomes transit between two conformational states, non-rotated and rotated, through the elongation cycle. Here, we present evidence that an internal loop in the essential yeast ribosomal protein rpL10 is a central controller of this process. Mutations in this loop promote opposing effects on the natural equilibrium ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt1107

    authors: Sulima SO,Gülay SP,Anjos M,Patchett S,Meskauskas A,Johnson AW,Dinman JD

    更新日期:2014-02-01 00:00:00